/[suikacvs]/markup/html/whatpm/Whatpm/ChangeLog
Suika

Diff of /markup/html/whatpm/Whatpm/ChangeLog

Parent Directory Parent Directory | Revision Log Revision Log | View Patch Patch

revision 1.252 by wakaba, Sun Jun 1 06:47:08 2008 UTC revision 1.316 by wakaba, Tue Sep 16 10:43:18 2008 UTC
# Line 1  Line 1 
1    2008-09-16  Wakaba  <wakaba@suika.fam.cx>
2    
3            * WebIDL.pm: Support for the reminding extended attributes are
4            added.  It does not satisfy the definition that a forward
5            interface declaration has an extended attribute.  It seems that
6            unless explicitly allowed multiple extended attributes with the
7            same name is not allowed, though it is not explicitly mentioned in
8            the spec.
9    
10    2008-09-16  Wakaba  <wakaba@suika.fam.cx>
11    
12            * WebIDL.pm: Unescapes extended attribute names and extended
13            attribute identifiers.  Preserve whether an extended attribute has
14            an argument list of not.  Support for extended attributes:
15            Constructor, ExceptionConsts, IndexGetter, IndexSetter,
16            NameGetter, NameSetter, and Null.
17            (has_argument_list): New attribute.
18            (idl_text): Stringifies argument lists, if any, even if it is
19            empty.
20    
21    2008-09-15  Wakaba  <wakaba@suika.fam.cx>
22    
23            * HTML.pm.src: New state |PCDATA_STATE|.  Use an empty string for
24            |{s_kwd}| in DATA_STATE as default.
25    
26    2008-09-15  Wakaba  <wakaba@suika.fam.cx>
27    
28            * HTML.pm.src, mkhtmlparser.pl: Replace |{prev_char}|
29            by |{s_kwd}| in DATA_STATE.
30    
31    2008-09-15  Wakaba  <wakaba@suika.fam.cx>
32    
33            * HTML.pm.src: Shorten keys.
34    
35    2008-09-15  Wakaba  <wakaba@suika.fam.cx>
36    
37            * HTML.pm.src: Remove checking for control character, surrogate
38            pair, or noncharacter code points and non-Unicode code
39            points (they should be handled by Whatpm::Charset::UnicodeChecker).
40            (parse_char_stream): Support for the |$get_wrapper| argument and
41            character stream error handlers.
42    
43    2008-09-15  Wakaba  <wakaba@suika.fam.cx>
44    
45            * ContentChecker.pm: Don't call |loda_ns_module|
46            for null-namespace elements/attributes.
47    
48            * HTML.pm.src: Fact out $disallowed_control_chars
49            as a hash.
50    
51    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
52    
53            * HTML.pm.src: Regexp typo fixed.  |{prev_char}|
54            and |{next_char}| initializations are moved to initialization
55            method.  |{read_until}| now supports buffering.  Sync |set_inner_html|
56            with |parse_char_stream|.
57    
58    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
59    
60            * HTML.pm.src (parse_char_stream): Make |set_next_char|
61            invoke |manakai_read_until|, not only |read|, where
62            possible, to decrease the number of |read| method calls.
63    
64            * mkhtmlparser.pl: Related changes to the aforementioned
65            modification.
66    
67    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
68    
69            * HTML.pm.src: Use |read| instead of |getc|.  |set_inner_html|
70            would report character error from now.
71    
72    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
73    
74            * HTML.pm.src: White-space-leaded non-white-space character
75            tokens in "before head insertion mode" was not
76            correctly handled.
77            (set_inner_html): Reimplemented using CharString decodehandle
78            class.  Support for $get_wrapper argument.  Support
79            for |{read_until}| feature.
80    
81    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
82    
83            * HTML.pm.src: Make a "bare ero" error for unknown
84            entities point the "&" character.
85    
86    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
87    
88            * HTML.pm.src: It turns out that U+FFFD don't have to
89            be added to the list of excluded characters.
90    
91    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
92    
93            * HTML.pm.src ($char_onerror): Have character decoder's |line|
94            and |column| a higher priority than the one set by the
95            tokenizer's input handler.
96            ($self->{read_until}): Exclude U+FFFD (but this might
97            not be necessary, since now we do line/column fixup in
98            the character decode handle).
99    
100    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
101    
102            * HTML.pm.src: Use |{read_until}| where possible.
103    
104    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
105    
106            * HTML.pm.src: Change |{getc_until}| to |{read_until}|
107            and |manakai_getc_until| to |manakai_read_until| to
108            reduce the number of string copies.
109    
110    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
111    
112            * HTML.pm.src (parse_char_string): Use newly created
113            |Whatpm::Charset::DecodeHandle::CharString| instead of Perl's
114            standard feature to |open| a string as a filehandle,
115            since Perl's string filehandle seems not supporting |ungetc|
116            method correctly.
117            (parse_char_stream): Define |{getc_until}| method.
118            (DATA_STATE): Experimental support for |getc_until| feature.
119    
120    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
121    
122            * HTML.pm.src: Check points added to newly added branches.
123    
124    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
125    
126            * HTML.pm.src: Remove |{char}|, which is no longer used.
127            Remove |{entity_in_attr}| and |{last_attribute_value_state}|
128            and replaced by |{prev_state}|.
129    
130            * mkhtmlparser.pl: Remove |{char}| feature.
131            Remove |!!!back-next-input-character;| macro.
132    
133    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
134    
135            * HTML.pm.src: Finally we get rid of all the inner loops.  Remove
136            entity related tokenizer states in favor of new states
137            implementing the consume character reference algorithm.
138    
139    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
140    
141            * HTML.pm.src: "Consume a character reference" algorithm is
142            now implemented as a tokenizer's state, rather than
143            a method, with minimum changes (more changes will
144            be made, in due course).  "Bogus comment state"'s inner
145            loop gets removed.
146    
147    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
148    
149            * HTML.pm.src: Make |PUBLIC| and |SYSTEM| keyword tokenizing
150            into their own tokenizer states.
151    
152    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
153    
154            * HTML.pm.src: |CDATA_SECTION_STATE| (formally |CDATA_BLOCK_STATE|
155            is split into three states.
156    
157    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
158    
159            * HTML.pm.src: |CLOSE_TAG_OPEN_STATE| is broken into
160            itself and new |CDATA_PCDATA_CLOSE_TAG_STATE| so that
161            no longer does the tokenizer have to push back next input
162            characters in those states.
163    
164    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
165    
166            * HTML.pm.src: |MARKUP_DECLARATION_OPEN_STATE| broken
167            into four states so that no longer does the tokenizer have to push
168            back next input characters in that state.
169    
170    2008-09-11  Wakaba  <wakaba@suika.fam.cx>
171    
172            * HTML.pm.src: Methods now accept additional parameter, $get_wrapper,
173            which can be used to insert some wrapper between the character
174            stream handle and the tokenizer.  (It is currently not supported
175            for |set_inner_html| for |Element|s).
176    
177    2008-09-10  Wakaba  <wakaba@suika.fam.cx>
178    
179            * HTML.pm.src: Ignore punctuations in charset names.
180    
181    2008-09-10  Wakaba  <wakaba@suika.fam.cx>
182    
183            * ContentChecker.pm: Support for charset-layer error levels.
184    
185            * HTML.pm.src: Don't specify |text| argument for the
186            |chardecode:fallback| error, since it is not the encoding
187            being used alternatively.
188    
189    2008-09-06  Wakaba  <wakaba@suika.fam.cx>
190    
191            * HTML.pm.src: Support for |XSLT-compat| (HTML5 revision 2141).
192    
193    2008-08-31  Wakaba  <wakaba@suika.fam.cx>
194    
195            * CacheManifest.pm: Support for extensibility (HTML5 revision 2051).
196    
197    2008-08-31  Wakaba  <wakaba@suika.fam.cx>
198    
199            * HTML.pm.src: Bug fix and sync with the spec with regard
200            to after after frameset insertion mode processing (HTML5
201            revision 1909).  Note that the implementation was wrong
202            per the old spec before the r1909 changes.
203    
204    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
205    
206            * HTMLTable.pm: scope=auto algorithm fix synced with the
207            spec (HTML5 revision 2093).
208            ($process_row): Algorithm step numbers synced with the
209            spec (HTML5 revision 2092).
210    
211    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
212    
213            * HTMLTable.pm: Zs is not what we want; we want White_Space! (HTML5
214            revision 2094).
215    
216    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
217    
218            * ContentType.pm: Support for image/svg+xml (HTML5 revision 2096).
219    
220    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
221    
222            * HTML.pm.src: '"' and "'" at the end of attribute
223            name (after another attribute) now raise parse error (HTML5
224            revision 2123).  Empty unquoted attribute values are no
225            longer allowed (HTML5 revision 2122).
226    
227    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
228    
229            * mkhtmlparser.pl: Support for MathML |definitionURL| attribute (HTML5
230            revision 2130).
231    
232    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
233    
234            * ContentChecker.pm: |xml:lang| attribute value must be same
235            as |lang| attribute value for HTML elements (HTML5 revision 2062
236            and so on).
237    
238    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
239    
240            * ContentChecker.pm: Error level definition for |xml_id_error|
241            was missing.
242    
243            * URIChecker.pm: The end of the URL should be marked as the
244            error location for an empty path error.  The position
245            between the userinfo and the port components should be
246            marked as the error location for an empty host error.
247    
248    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
249    
250            * URIChecker.pm: Set parameters representing where in the
251            value the error occurs for errors.  Report unknown
252            address format error in warning level, since address
253            formats are rarely added.  Path segments starting with "/.."
254            were misinterpreted as a dot-segment.
255    
256    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
257    
258            * URIChecker.pm (check_iri_reference): Requires
259            |Message::DOM::DOMImplementation|.
260    
261    2008-08-29  Wakaba  <wakaba@suika.fam.cx>
262    
263            * IMTChecker.pm: Updated for the new error reporting architecture.
264    
265            * ContentChecker.pm: Error levels for IMTs are added.
266    
267    2008-08-17  Wakaba  <wakaba@suika.fam.cx>
268    
269            * H2H.pm (_shift_token): Support for unquoted HTML attribute
270            values.
271    
272    2008-08-16  Wakaba  <wakaba@suika.fam.cx>
273    
274            * CacheManifest.pm: Support for new style of error
275            reports.
276    
277            * HTML.pm.src: Set line=1, column=1 to the document node.
278    
279    2008-08-16  Wakaba  <wakaba@suika.fam.cx>
280    
281            * ContentChecker.pm, RDFXML.pm: Pass {level} object to language tag
282            and URL checkers.  Support for more error levels for bogus
283            langauge tag and URL "standards".
284    
285            * LangTag.pm, URIChecker.pm: Support for new style error
286            level reporting.
287    
288    2008-08-15  Wakaba  <wakaba@suika.fam.cx>
289    
290            * ContentChecker.pm: Support for RDF/XML error levels.
291    
292            * HTMLTable.pm, RDFXML.pm: Support for new style of error level
293            specifying.  Error types are revised.
294    
295    2008-08-15  Wakaba  <wakaba@suika.fam.cx>
296    
297            * ContentChecker.pm: All error reporting method calls are
298            renewed.
299    
300    2008-08-15  Wakaba  <wakaba@suika.fam.cx>
301    
302            * HTML.pm.src: All error type names and "text" parameters
303            are revised.  Use new style for "level" specification.
304    
305            * mkhtmlparser.pl: Use new style for "level" specification.
306    
307    2008-08-03  Wakaba  <wakaba@suika.fam.cx>
308    
309            * WebIDL.pm (parse_char_string): Simplified error
310            reporting process for broken ignored valuetype definition.
311            (Valuetype idl_text): Support for special "DOMString" name.
312    
313    2008-08-03  Wakaba  <wakaba@suika.fam.cx>
314    
315            * WebIDL.pm ($get_scoped_name): Append "::::" if the last
316            terminal of the ScopedName is "DOMString", such that whether
317            the last part of the scoped name is "DOMString" or "_DOMString"
318            later.  It is necessary to determine whether a |typedef|
319            definition should be ignored or not.
320            (parse_char_string): Unescape the identifier of
321            exception members.
322            ($resolve): Return undef for builtin types and sequence<T>
323            types (we might not have to do this, however...).
324            (check): Support checking for Exceptions, Valuetypes,
325            and Typedefs.
326            ($serialize_type): Support for "DOMString::::" syntax.
327            (Typedef idl_text): Output Type as "DOMString" if it
328            is really "DOMString" (i.e. its internal representation
329            is "::DOMString::").
330    
331    2008-08-03  Wakaba  <wakaba@suika.fam.cx>
332    
333            * WebIDL.pm ($resolve): New code, based on resolve code
334            for constant types in the |check| method.
335            (check): Support for checking of attributes, operations, and
336            arguments.
337            (Attribute/Operation idl_text): Exception names in getraises,
338            setraises, and raises clauses is serizlied by |$serialize_type|
339            code.
340    
341    2008-08-02  Wakaba  <wakaba@suika.fam.cx>
342    
343            * WebIDL.pm ($integer): Order of selections are changed to match
344            hexadecimal numbers (the original pattern, taken from the spec,
345            was not work for hexadecimal numbers, because the "0" prefix
346            matches to the [0-7]* part (as an empty string) and therefore
347            it does not match with remaining "x..." part of a "0x..." integer
348            literal.
349            ($get_type): It now returns a string, not an array reference,
350            for regular types and |sequence| types (i.e. it in any case
351            returns a string).
352            ($get_next_token): The second item in the array that represents
353            a integer or float token is now a Perl number value, not the
354            original string representation of the number.
355            (check): Support for const value consistency checking.
356            No extended attribute is defined for constants.
357            (Node subclasses): Use simple strings rather than array references
358            for default data type values.
359            ($serialize_type): Type values are now simple strings.
360            (value): If the new attribute value is a false value, then
361            a FALSE value is set to the attribute.
362    
363    2008-08-02  Wakaba  <wakaba@suika.fam.cx>
364    
365            * WebIDL.pm ($get_scoped_name): Now scoped names are stored
366            in its stringified format ("scoped name" as defined in the
367            spec).  Note that future version of this module should not use
368            array references for type values and the |type_text| attribute
369            should be made obsolete.
370            (parse_char_string): Unescape attribute names.
371            (check): Support for checking of whether inherited interfaces
372            are actually defined or not.  Support for checking of whether
373            interface member identifiers are duplicated or not.
374            ($serialize_type): Scoped names are returned as is.  A future
375            version of this code should escape identifiers other than "DOMString",
376            otherwise the idl_text would be non-conforming.
377    
378    2008-08-02  Wakaba  <wakaba@suika.fam.cx>
379    
380            * WebIDL.pm (parse_char_string): Set line/column numbers
381            to generated nodes.  Unescape identifiers.  Extended attributes
382            for Definition's were ignored.
383            (append_child): Set |parent_node| attribute.
384            (parent_node): New attribute.
385            (check): Support interface/exception members.  Support
386            extended attributes.  Support definition identifier uniqueness
387            constraint.
388            (qualified_name): New attribute.
389            (Interface/Exception idl_text): Extended attributes were
390            not prepended to the returned text.
391    
392    2008-08-02  Wakaba  <wakaba@suika.fam.cx>
393    
394            * WebIDL.pm (parse_char_string): Set line/column numbers
395            to interface object experimentally.  s/shift/pop/g, shift
396            would make things wrong.  Support for interface forward
397            declarations was missing.  Broken interface declarations
398            with no block were not ignored entirely.
399            (Whatpm::WebIDL::Node): New abstract class.  This class
400            makes things easier.
401            (child_nodes): New attribute.  Unlike DOM's attribute with
402            same name, this attribute returns a dead list of nodes for
403            simplicity.
404            (get_user_data, set_user_data): New methods.
405            (Module idl_text): A SPACE character should be inserted
406            before the |{| character.
407            (Interface idl_text): Support for interface forward declarations.
408            (is_forward_declaration): New attribute.
409    
410    2008-07-19  Wakaba  <wakaba@suika.fam.cx>
411    
412            * WebIDL.pm (type_text): Better serializer.
413    
414    2008-07-19  Wakaba  <wakaba@suika.fam.cx>
415    
416            * WebIDL.pm: Revise forward-compatible parsing so that
417            it now can handle broken extended attributes and as such.
418    
419    2008-07-19  Wakaba  <wakaba@suika.fam.cx>
420    
421            * WebIDL.pm: Real support for extended attributes.
422            Support for extended attributes with arguments.
423    
424    2008-07-19  Wakaba  <wakaba@suika.fam.cx>
425    
426            * WebIDL.pm: Support for |exception| syntax.
427            (Interface->idl_text): Tentative support for inheritances.
428    
429    2008-07-19  Wakaba  <wakaba@suika.fam.cx>
430    
431            * WebIDL.pm: Hierarchical scoped name support was broken.
432            Support for raises, setraises, and getraises syntaxes.
433    
434    2008-07-18  Wakaba  <wakaba@suika.fam.cx>
435    
436            * WebIDL.pm: Support for |idl_text| attribute, version 1 (no
437            proper support for types, extended attributes, and exceptions yet).
438            WebIDL parser, version 1 (no support for exceptions yet,
439            no proper support for extended attributes yet).
440    
441    2008-07-09  Wakaba  <wakaba@suika.fam.cx>
442    
443            * WebIDL.pm (parse_char_string): Support for basic attribute syntax.
444    
445    2008-06-29  Wakaba  <wakaba@suika.fam.cx>
446    
447            * WebIDL.pm: Support for valuetype and const.
448    
449    2008-06-29  Wakaba  <wakaba@suika.fam.cx>
450            
451            * WebIDL.pm: New module.
452    
453    2008-06-15  Wakaba  <wakaba@suika.fam.cx>
454    
455            * Makefile (Entities.html): URI changed.
456    
457    2008-06-08  Wakaba  <wakaba@suika.fam.cx>
458    
459            * HTML.pm.src: Support for ruby parsing (HTML5 revision 1704).
460    
461  2008-06-01  Wakaba  <wakaba@suika.fam.cx>  2008-06-01  Wakaba  <wakaba@suika.fam.cx>
462    
463          * HTML.pm.src (_get_next_token): A parse error was missing.          * HTML.pm.src (_get_next_token): A parse error was missing.

Legend:
Removed from v.1.252  
changed lines
  Added in v.1.316

admin@suikawiki.org
ViewVC Help
Powered by ViewVC 1.1.24