/[suikacvs]/markup/html/whatpm/Whatpm/ChangeLog
Suika

Diff of /markup/html/whatpm/Whatpm/ChangeLog

Parent Directory Parent Directory | Revision Log Revision Log | View Patch Patch

revision 1.261 by wakaba, Sat Jul 19 11:48:23 2008 UTC revision 1.332 by wakaba, Sun Sep 21 11:55:49 2008 UTC
# Line 1  Line 1 
1    2008-09-21  Wakaba  <wakaba@suika.fam.cx>
2    
3            * ContentChecker.pm: |contextmenu| validness is now checked using
4            |id| and |id_type| properties, and |menu| property is removed.
5    
6    2008-09-21  Wakaba  <wakaba@suika.fam.cx>
7    
8            * ContentChecker.pm: Prepare for |form| |name| attribute's
9            duplication checking.
10    
11    2008-09-21  Wakaba  <wakaba@suika.fam.cx>
12    
13            * HTML.pm.src (parse_byte_stream): Support (or non-support) for
14            unsupported charset="" parameter value (HTML5 revision 2131).
15    
16    2008-09-20  Wakaba  <wakaba@suika.fam.cx>
17    
18            * HTML.pm.src: Reminding places where U+000B is allowed as a space
19            character is fixed (cf. HTML5 revision 1738).
20    
21            * ContentChecker.pm, HTMLTable.pm: U+000B is no longer part of
22            space characters (HTML5 revision 1738).
23    
24    2008-09-20  Wakaba  <wakaba@suika.fam.cx>
25    
26            * HTML.pm.src: The "anything else" case for the "after after body"
27            insertion mode was not updated to swtich to the "in body"
28            insertion mode.  U+000B is no longer a space character for the
29            purpose of tree construction phase (HTML5 revision 1738).
30    
31    2008-09-20  Wakaba  <wakaba@suika.fam.cx>
32    
33            * HTML.pm.src: U+000B is no longer a space character (HTML5
34            revision 1738).
35    
36    2008-09-20  Wakaba  <wakaba@suika.fam.cx>
37    
38            * ContentType.pm: 0x0B is no longer a space character (HTML5
39            revision 1738).
40    
41            * HTML.pm.src: U+000B is no longer a space character for the
42            algorithm for extracting an encoding from a Content-Type (HTML5
43            revision 1738).
44    
45    2008-09-20  Wakaba  <wakaba@suika.fam.cx>
46    
47            * ContentChecker.pm ($IsInHTMLInteractiveContent): New.
48    
49    2008-09-18  Wakaba  <wakaba@suika.fam.cx>
50    
51            * LangTag.pm: Add checks for remaining requirements from RFC 4646.
52    
53            * mklangreg.pl: Sort 'Prefix' values by their length, to ease
54            matching.
55    
56    2008-09-18  Wakaba  <wakaba@suika.fam.cx>
57    
58            * LangTag.pm: Warn for private use language subtags.  Error level
59            typos fixed.  Support for Suppress-Script field.
60    
61            * mklangreg.pl: Support for dumping of nested structure.
62    
63    2008-09-18  Wakaba  <wakaba@suika.fam.cx>
64    
65            * LangTag.pm (check_rfc4646_langtag): Check if a tag is in the
66            recommended case as per RFC 4646.
67    
68    2008-09-18  Wakaba  <wakaba@suika.fam.cx>
69    
70            * LangTag.pm (check_rfc4646_langtag): New method.
71    
72    2008-09-18  Wakaba  <wakaba@suika.fam.cx>
73    
74            * mklangreg.pl: New script.
75    
76            * Makefile: Updated for creation of the module for language subtag
77            registry.
78            
79    2008-09-16  Wakaba  <wakaba@suika.fam.cx>
80    
81            * Makefile: WebIDL.html added.
82    
83            * WebIDL.pod: New documentation.
84    
85    2008-09-16  Wakaba  <wakaba@suika.fam.cx>
86    
87            * WebIDL.pm: Checker's error types are redefined.
88    
89    2008-09-16  Wakaba  <wakaba@suika.fam.cx>
90    
91            * WebIDL.pm: Parser's error types are redefined.  Some forward
92            compatible parsing bugs are fixed.  Some unreachable codes are
93            commented out.
94    
95    2008-09-16  Wakaba  <wakaba@suika.fam.cx>
96    
97            * WebIDL.pm: Support for the reminding extended attributes are
98            added.  It does not satisfy the definition that a forward
99            interface declaration has an extended attribute.  It seems that
100            unless explicitly allowed multiple extended attributes with the
101            same name is not allowed, though it is not explicitly mentioned in
102            the spec.
103    
104    2008-09-16  Wakaba  <wakaba@suika.fam.cx>
105    
106            * WebIDL.pm: Unescapes extended attribute names and extended
107            attribute identifiers.  Preserve whether an extended attribute has
108            an argument list of not.  Support for extended attributes:
109            Constructor, ExceptionConsts, IndexGetter, IndexSetter,
110            NameGetter, NameSetter, and Null.
111            (has_argument_list): New attribute.
112            (idl_text): Stringifies argument lists, if any, even if it is
113            empty.
114    
115    2008-09-15  Wakaba  <wakaba@suika.fam.cx>
116    
117            * HTML.pm.src: New state |PCDATA_STATE|.  Use an empty string for
118            |{s_kwd}| in DATA_STATE as default.
119    
120    2008-09-15  Wakaba  <wakaba@suika.fam.cx>
121    
122            * HTML.pm.src, mkhtmlparser.pl: Replace |{prev_char}|
123            by |{s_kwd}| in DATA_STATE.
124    
125    2008-09-15  Wakaba  <wakaba@suika.fam.cx>
126    
127            * HTML.pm.src: Shorten keys.
128    
129    2008-09-15  Wakaba  <wakaba@suika.fam.cx>
130    
131            * HTML.pm.src: Remove checking for control character, surrogate
132            pair, or noncharacter code points and non-Unicode code
133            points (they should be handled by Whatpm::Charset::UnicodeChecker).
134            (parse_char_stream): Support for the |$get_wrapper| argument and
135            character stream error handlers.
136    
137    2008-09-15  Wakaba  <wakaba@suika.fam.cx>
138    
139            * ContentChecker.pm: Don't call |loda_ns_module|
140            for null-namespace elements/attributes.
141    
142            * HTML.pm.src: Fact out $disallowed_control_chars
143            as a hash.
144    
145    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
146    
147            * HTML.pm.src: Regexp typo fixed.  |{prev_char}|
148            and |{next_char}| initializations are moved to initialization
149            method.  |{read_until}| now supports buffering.  Sync |set_inner_html|
150            with |parse_char_stream|.
151    
152    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
153    
154            * HTML.pm.src (parse_char_stream): Make |set_next_char|
155            invoke |manakai_read_until|, not only |read|, where
156            possible, to decrease the number of |read| method calls.
157    
158            * mkhtmlparser.pl: Related changes to the aforementioned
159            modification.
160    
161    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
162    
163            * HTML.pm.src: Use |read| instead of |getc|.  |set_inner_html|
164            would report character error from now.
165    
166    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
167    
168            * HTML.pm.src: White-space-leaded non-white-space character
169            tokens in "before head insertion mode" was not
170            correctly handled.
171            (set_inner_html): Reimplemented using CharString decodehandle
172            class.  Support for $get_wrapper argument.  Support
173            for |{read_until}| feature.
174    
175    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
176    
177            * HTML.pm.src: Make a "bare ero" error for unknown
178            entities point the "&" character.
179    
180    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
181    
182            * HTML.pm.src: It turns out that U+FFFD don't have to
183            be added to the list of excluded characters.
184    
185    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
186    
187            * HTML.pm.src ($char_onerror): Have character decoder's |line|
188            and |column| a higher priority than the one set by the
189            tokenizer's input handler.
190            ($self->{read_until}): Exclude U+FFFD (but this might
191            not be necessary, since now we do line/column fixup in
192            the character decode handle).
193    
194    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
195    
196            * HTML.pm.src: Use |{read_until}| where possible.
197    
198    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
199    
200            * HTML.pm.src: Change |{getc_until}| to |{read_until}|
201            and |manakai_getc_until| to |manakai_read_until| to
202            reduce the number of string copies.
203    
204    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
205    
206            * HTML.pm.src (parse_char_string): Use newly created
207            |Whatpm::Charset::DecodeHandle::CharString| instead of Perl's
208            standard feature to |open| a string as a filehandle,
209            since Perl's string filehandle seems not supporting |ungetc|
210            method correctly.
211            (parse_char_stream): Define |{getc_until}| method.
212            (DATA_STATE): Experimental support for |getc_until| feature.
213    
214    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
215    
216            * HTML.pm.src: Check points added to newly added branches.
217    
218    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
219    
220            * HTML.pm.src: Remove |{char}|, which is no longer used.
221            Remove |{entity_in_attr}| and |{last_attribute_value_state}|
222            and replaced by |{prev_state}|.
223    
224            * mkhtmlparser.pl: Remove |{char}| feature.
225            Remove |!!!back-next-input-character;| macro.
226    
227    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
228    
229            * HTML.pm.src: Finally we get rid of all the inner loops.  Remove
230            entity related tokenizer states in favor of new states
231            implementing the consume character reference algorithm.
232    
233    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
234    
235            * HTML.pm.src: "Consume a character reference" algorithm is
236            now implemented as a tokenizer's state, rather than
237            a method, with minimum changes (more changes will
238            be made, in due course).  "Bogus comment state"'s inner
239            loop gets removed.
240    
241    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
242    
243            * HTML.pm.src: Make |PUBLIC| and |SYSTEM| keyword tokenizing
244            into their own tokenizer states.
245    
246    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
247    
248            * HTML.pm.src: |CDATA_SECTION_STATE| (formally |CDATA_BLOCK_STATE|
249            is split into three states.
250    
251    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
252    
253            * HTML.pm.src: |CLOSE_TAG_OPEN_STATE| is broken into
254            itself and new |CDATA_PCDATA_CLOSE_TAG_STATE| so that
255            no longer does the tokenizer have to push back next input
256            characters in those states.
257    
258    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
259    
260            * HTML.pm.src: |MARKUP_DECLARATION_OPEN_STATE| broken
261            into four states so that no longer does the tokenizer have to push
262            back next input characters in that state.
263    
264    2008-09-11  Wakaba  <wakaba@suika.fam.cx>
265    
266            * HTML.pm.src: Methods now accept additional parameter, $get_wrapper,
267            which can be used to insert some wrapper between the character
268            stream handle and the tokenizer.  (It is currently not supported
269            for |set_inner_html| for |Element|s).
270    
271    2008-09-10  Wakaba  <wakaba@suika.fam.cx>
272    
273            * HTML.pm.src: Ignore punctuations in charset names.
274    
275    2008-09-10  Wakaba  <wakaba@suika.fam.cx>
276    
277            * ContentChecker.pm: Support for charset-layer error levels.
278    
279            * HTML.pm.src: Don't specify |text| argument for the
280            |chardecode:fallback| error, since it is not the encoding
281            being used alternatively.
282    
283    2008-09-06  Wakaba  <wakaba@suika.fam.cx>
284    
285            * HTML.pm.src: Support for |XSLT-compat| (HTML5 revision 2141).
286    
287    2008-08-31  Wakaba  <wakaba@suika.fam.cx>
288    
289            * CacheManifest.pm: Support for extensibility (HTML5 revision 2051).
290    
291    2008-08-31  Wakaba  <wakaba@suika.fam.cx>
292    
293            * HTML.pm.src: Bug fix and sync with the spec with regard
294            to after after frameset insertion mode processing (HTML5
295            revision 1909).  Note that the implementation was wrong
296            per the old spec before the r1909 changes.
297    
298    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
299    
300            * HTMLTable.pm: scope=auto algorithm fix synced with the
301            spec (HTML5 revision 2093).
302            ($process_row): Algorithm step numbers synced with the
303            spec (HTML5 revision 2092).
304    
305    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
306    
307            * HTMLTable.pm: Zs is not what we want; we want White_Space! (HTML5
308            revision 2094).
309    
310    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
311    
312            * ContentType.pm: Support for image/svg+xml (HTML5 revision 2096).
313    
314    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
315    
316            * HTML.pm.src: '"' and "'" at the end of attribute
317            name (after another attribute) now raise parse error (HTML5
318            revision 2123).  Empty unquoted attribute values are no
319            longer allowed (HTML5 revision 2122).
320    
321    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
322    
323            * mkhtmlparser.pl: Support for MathML |definitionURL| attribute (HTML5
324            revision 2130).
325    
326    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
327    
328            * ContentChecker.pm: |xml:lang| attribute value must be same
329            as |lang| attribute value for HTML elements (HTML5 revision 2062
330            and so on).
331    
332    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
333    
334            * ContentChecker.pm: Error level definition for |xml_id_error|
335            was missing.
336    
337            * URIChecker.pm: The end of the URL should be marked as the
338            error location for an empty path error.  The position
339            between the userinfo and the port components should be
340            marked as the error location for an empty host error.
341    
342    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
343    
344            * URIChecker.pm: Set parameters representing where in the
345            value the error occurs for errors.  Report unknown
346            address format error in warning level, since address
347            formats are rarely added.  Path segments starting with "/.."
348            were misinterpreted as a dot-segment.
349    
350    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
351    
352            * URIChecker.pm (check_iri_reference): Requires
353            |Message::DOM::DOMImplementation|.
354    
355    2008-08-29  Wakaba  <wakaba@suika.fam.cx>
356    
357            * IMTChecker.pm: Updated for the new error reporting architecture.
358    
359            * ContentChecker.pm: Error levels for IMTs are added.
360    
361    2008-08-17  Wakaba  <wakaba@suika.fam.cx>
362    
363            * H2H.pm (_shift_token): Support for unquoted HTML attribute
364            values.
365    
366    2008-08-16  Wakaba  <wakaba@suika.fam.cx>
367    
368            * CacheManifest.pm: Support for new style of error
369            reports.
370    
371            * HTML.pm.src: Set line=1, column=1 to the document node.
372    
373    2008-08-16  Wakaba  <wakaba@suika.fam.cx>
374    
375            * ContentChecker.pm, RDFXML.pm: Pass {level} object to language tag
376            and URL checkers.  Support for more error levels for bogus
377            langauge tag and URL "standards".
378    
379            * LangTag.pm, URIChecker.pm: Support for new style error
380            level reporting.
381    
382    2008-08-15  Wakaba  <wakaba@suika.fam.cx>
383    
384            * ContentChecker.pm: Support for RDF/XML error levels.
385    
386            * HTMLTable.pm, RDFXML.pm: Support for new style of error level
387            specifying.  Error types are revised.
388    
389    2008-08-15  Wakaba  <wakaba@suika.fam.cx>
390    
391            * ContentChecker.pm: All error reporting method calls are
392            renewed.
393    
394    2008-08-15  Wakaba  <wakaba@suika.fam.cx>
395    
396            * HTML.pm.src: All error type names and "text" parameters
397            are revised.  Use new style for "level" specification.
398    
399            * mkhtmlparser.pl: Use new style for "level" specification.
400    
401    2008-08-03  Wakaba  <wakaba@suika.fam.cx>
402    
403            * WebIDL.pm (parse_char_string): Simplified error
404            reporting process for broken ignored valuetype definition.
405            (Valuetype idl_text): Support for special "DOMString" name.
406    
407    2008-08-03  Wakaba  <wakaba@suika.fam.cx>
408    
409            * WebIDL.pm ($get_scoped_name): Append "::::" if the last
410            terminal of the ScopedName is "DOMString", such that whether
411            the last part of the scoped name is "DOMString" or "_DOMString"
412            later.  It is necessary to determine whether a |typedef|
413            definition should be ignored or not.
414            (parse_char_string): Unescape the identifier of
415            exception members.
416            ($resolve): Return undef for builtin types and sequence<T>
417            types (we might not have to do this, however...).
418            (check): Support checking for Exceptions, Valuetypes,
419            and Typedefs.
420            ($serialize_type): Support for "DOMString::::" syntax.
421            (Typedef idl_text): Output Type as "DOMString" if it
422            is really "DOMString" (i.e. its internal representation
423            is "::DOMString::").
424    
425    2008-08-03  Wakaba  <wakaba@suika.fam.cx>
426    
427            * WebIDL.pm ($resolve): New code, based on resolve code
428            for constant types in the |check| method.
429            (check): Support for checking of attributes, operations, and
430            arguments.
431            (Attribute/Operation idl_text): Exception names in getraises,
432            setraises, and raises clauses is serizlied by |$serialize_type|
433            code.
434    
435    2008-08-02  Wakaba  <wakaba@suika.fam.cx>
436    
437            * WebIDL.pm ($integer): Order of selections are changed to match
438            hexadecimal numbers (the original pattern, taken from the spec,
439            was not work for hexadecimal numbers, because the "0" prefix
440            matches to the [0-7]* part (as an empty string) and therefore
441            it does not match with remaining "x..." part of a "0x..." integer
442            literal.
443            ($get_type): It now returns a string, not an array reference,
444            for regular types and |sequence| types (i.e. it in any case
445            returns a string).
446            ($get_next_token): The second item in the array that represents
447            a integer or float token is now a Perl number value, not the
448            original string representation of the number.
449            (check): Support for const value consistency checking.
450            No extended attribute is defined for constants.
451            (Node subclasses): Use simple strings rather than array references
452            for default data type values.
453            ($serialize_type): Type values are now simple strings.
454            (value): If the new attribute value is a false value, then
455            a FALSE value is set to the attribute.
456    
457    2008-08-02  Wakaba  <wakaba@suika.fam.cx>
458    
459            * WebIDL.pm ($get_scoped_name): Now scoped names are stored
460            in its stringified format ("scoped name" as defined in the
461            spec).  Note that future version of this module should not use
462            array references for type values and the |type_text| attribute
463            should be made obsolete.
464            (parse_char_string): Unescape attribute names.
465            (check): Support for checking of whether inherited interfaces
466            are actually defined or not.  Support for checking of whether
467            interface member identifiers are duplicated or not.
468            ($serialize_type): Scoped names are returned as is.  A future
469            version of this code should escape identifiers other than "DOMString",
470            otherwise the idl_text would be non-conforming.
471    
472    2008-08-02  Wakaba  <wakaba@suika.fam.cx>
473    
474            * WebIDL.pm (parse_char_string): Set line/column numbers
475            to generated nodes.  Unescape identifiers.  Extended attributes
476            for Definition's were ignored.
477            (append_child): Set |parent_node| attribute.
478            (parent_node): New attribute.
479            (check): Support interface/exception members.  Support
480            extended attributes.  Support definition identifier uniqueness
481            constraint.
482            (qualified_name): New attribute.
483            (Interface/Exception idl_text): Extended attributes were
484            not prepended to the returned text.
485    
486    2008-08-02  Wakaba  <wakaba@suika.fam.cx>
487    
488            * WebIDL.pm (parse_char_string): Set line/column numbers
489            to interface object experimentally.  s/shift/pop/g, shift
490            would make things wrong.  Support for interface forward
491            declarations was missing.  Broken interface declarations
492            with no block were not ignored entirely.
493            (Whatpm::WebIDL::Node): New abstract class.  This class
494            makes things easier.
495            (child_nodes): New attribute.  Unlike DOM's attribute with
496            same name, this attribute returns a dead list of nodes for
497            simplicity.
498            (get_user_data, set_user_data): New methods.
499            (Module idl_text): A SPACE character should be inserted
500            before the |{| character.
501            (Interface idl_text): Support for interface forward declarations.
502            (is_forward_declaration): New attribute.
503    
504    2008-07-19  Wakaba  <wakaba@suika.fam.cx>
505    
506            * WebIDL.pm (type_text): Better serializer.
507    
508    2008-07-19  Wakaba  <wakaba@suika.fam.cx>
509    
510            * WebIDL.pm: Revise forward-compatible parsing so that
511            it now can handle broken extended attributes and as such.
512    
513  2008-07-19  Wakaba  <wakaba@suika.fam.cx>  2008-07-19  Wakaba  <wakaba@suika.fam.cx>
514    
515          * WebIDL.pm: Real support for extended attributes.          * WebIDL.pm: Real support for extended attributes.

Legend:
Removed from v.1.261  
changed lines
  Added in v.1.332

admin@suikawiki.org
ViewVC Help
Powered by ViewVC 1.1.24