/[suikacvs]/markup/html/whatpm/Whatpm/ChangeLog
Suika

Diff of /markup/html/whatpm/Whatpm/ChangeLog

Parent Directory Parent Directory | Revision Log Revision Log | View Patch Patch

revision 1.237 by wakaba, Sun May 18 03:46:26 2008 UTC revision 1.310 by wakaba, Mon Sep 15 02:54:12 2008 UTC
# Line 1  Line 1 
1    2008-09-15  Wakaba  <wakaba@suika.fam.cx>
2    
3            * ContentChecker.pm: Don't call |loda_ns_module|
4            for null-namespace elements/attributes.
5    
6            * HTML.pm.src: Fact out $disallowed_control_chars
7            as a hash.
8    
9    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
10    
11            * HTML.pm.src: Regexp typo fixed.  |{prev_char}|
12            and |{next_char}| initializations are moved to initialization
13            method.  |{read_until}| now supports buffering.  Sync |set_inner_html|
14            with |parse_char_stream|.
15    
16    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
17    
18            * HTML.pm.src (parse_char_stream): Make |set_next_char|
19            invoke |manakai_read_until|, not only |read|, where
20            possible, to decrease the number of |read| method calls.
21    
22            * mkhtmlparser.pl: Related changes to the aforementioned
23            modification.
24    
25    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
26    
27            * HTML.pm.src: Use |read| instead of |getc|.  |set_inner_html|
28            would report character error from now.
29    
30    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
31    
32            * HTML.pm.src: White-space-leaded non-white-space character
33            tokens in "before head insertion mode" was not
34            correctly handled.
35            (set_inner_html): Reimplemented using CharString decodehandle
36            class.  Support for $get_wrapper argument.  Support
37            for |{read_until}| feature.
38    
39    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
40    
41            * HTML.pm.src: Make a "bare ero" error for unknown
42            entities point the "&" character.
43    
44    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
45    
46            * HTML.pm.src: It turns out that U+FFFD don't have to
47            be added to the list of excluded characters.
48    
49    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
50    
51            * HTML.pm.src ($char_onerror): Have character decoder's |line|
52            and |column| a higher priority than the one set by the
53            tokenizer's input handler.
54            ($self->{read_until}): Exclude U+FFFD (but this might
55            not be necessary, since now we do line/column fixup in
56            the character decode handle).
57    
58    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
59    
60            * HTML.pm.src: Use |{read_until}| where possible.
61    
62    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
63    
64            * HTML.pm.src: Change |{getc_until}| to |{read_until}|
65            and |manakai_getc_until| to |manakai_read_until| to
66            reduce the number of string copies.
67    
68    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
69    
70            * HTML.pm.src (parse_char_string): Use newly created
71            |Whatpm::Charset::DecodeHandle::CharString| instead of Perl's
72            standard feature to |open| a string as a filehandle,
73            since Perl's string filehandle seems not supporting |ungetc|
74            method correctly.
75            (parse_char_stream): Define |{getc_until}| method.
76            (DATA_STATE): Experimental support for |getc_until| feature.
77    
78    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
79    
80            * HTML.pm.src: Check points added to newly added branches.
81    
82    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
83    
84            * HTML.pm.src: Remove |{char}|, which is no longer used.
85            Remove |{entity_in_attr}| and |{last_attribute_value_state}|
86            and replaced by |{prev_state}|.
87    
88            * mkhtmlparser.pl: Remove |{char}| feature.
89            Remove |!!!back-next-input-character;| macro.
90    
91    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
92    
93            * HTML.pm.src: Finally we get rid of all the inner loops.  Remove
94            entity related tokenizer states in favor of new states
95            implementing the consume character reference algorithm.
96    
97    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
98    
99            * HTML.pm.src: "Consume a character reference" algorithm is
100            now implemented as a tokenizer's state, rather than
101            a method, with minimum changes (more changes will
102            be made, in due course).  "Bogus comment state"'s inner
103            loop gets removed.
104    
105    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
106    
107            * HTML.pm.src: Make |PUBLIC| and |SYSTEM| keyword tokenizing
108            into their own tokenizer states.
109    
110    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
111    
112            * HTML.pm.src: |CDATA_SECTION_STATE| (formally |CDATA_BLOCK_STATE|
113            is split into three states.
114    
115    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
116    
117            * HTML.pm.src: |CLOSE_TAG_OPEN_STATE| is broken into
118            itself and new |CDATA_PCDATA_CLOSE_TAG_STATE| so that
119            no longer does the tokenizer have to push back next input
120            characters in those states.
121    
122    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
123    
124            * HTML.pm.src: |MARKUP_DECLARATION_OPEN_STATE| broken
125            into four states so that no longer does the tokenizer have to push
126            back next input characters in that state.
127    
128    2008-09-11  Wakaba  <wakaba@suika.fam.cx>
129    
130            * HTML.pm.src: Methods now accept additional parameter, $get_wrapper,
131            which can be used to insert some wrapper between the character
132            stream handle and the tokenizer.  (It is currently not supported
133            for |set_inner_html| for |Element|s).
134    
135    2008-09-10  Wakaba  <wakaba@suika.fam.cx>
136    
137            * HTML.pm.src: Ignore punctuations in charset names.
138    
139    2008-09-10  Wakaba  <wakaba@suika.fam.cx>
140    
141            * ContentChecker.pm: Support for charset-layer error levels.
142    
143            * HTML.pm.src: Don't specify |text| argument for the
144            |chardecode:fallback| error, since it is not the encoding
145            being used alternatively.
146    
147    2008-09-06  Wakaba  <wakaba@suika.fam.cx>
148    
149            * HTML.pm.src: Support for |XSLT-compat| (HTML5 revision 2141).
150    
151    2008-08-31  Wakaba  <wakaba@suika.fam.cx>
152    
153            * CacheManifest.pm: Support for extensibility (HTML5 revision 2051).
154    
155    2008-08-31  Wakaba  <wakaba@suika.fam.cx>
156    
157            * HTML.pm.src: Bug fix and sync with the spec with regard
158            to after after frameset insertion mode processing (HTML5
159            revision 1909).  Note that the implementation was wrong
160            per the old spec before the r1909 changes.
161    
162    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
163    
164            * HTMLTable.pm: scope=auto algorithm fix synced with the
165            spec (HTML5 revision 2093).
166            ($process_row): Algorithm step numbers synced with the
167            spec (HTML5 revision 2092).
168    
169    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
170    
171            * HTMLTable.pm: Zs is not what we want; we want White_Space! (HTML5
172            revision 2094).
173    
174    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
175    
176            * ContentType.pm: Support for image/svg+xml (HTML5 revision 2096).
177    
178    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
179    
180            * HTML.pm.src: '"' and "'" at the end of attribute
181            name (after another attribute) now raise parse error (HTML5
182            revision 2123).  Empty unquoted attribute values are no
183            longer allowed (HTML5 revision 2122).
184    
185    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
186    
187            * mkhtmlparser.pl: Support for MathML |definitionURL| attribute (HTML5
188            revision 2130).
189    
190    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
191    
192            * ContentChecker.pm: |xml:lang| attribute value must be same
193            as |lang| attribute value for HTML elements (HTML5 revision 2062
194            and so on).
195    
196    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
197    
198            * ContentChecker.pm: Error level definition for |xml_id_error|
199            was missing.
200    
201            * URIChecker.pm: The end of the URL should be marked as the
202            error location for an empty path error.  The position
203            between the userinfo and the port components should be
204            marked as the error location for an empty host error.
205    
206    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
207    
208            * URIChecker.pm: Set parameters representing where in the
209            value the error occurs for errors.  Report unknown
210            address format error in warning level, since address
211            formats are rarely added.  Path segments starting with "/.."
212            were misinterpreted as a dot-segment.
213    
214    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
215    
216            * URIChecker.pm (check_iri_reference): Requires
217            |Message::DOM::DOMImplementation|.
218    
219    2008-08-29  Wakaba  <wakaba@suika.fam.cx>
220    
221            * IMTChecker.pm: Updated for the new error reporting architecture.
222    
223            * ContentChecker.pm: Error levels for IMTs are added.
224    
225    2008-08-17  Wakaba  <wakaba@suika.fam.cx>
226    
227            * H2H.pm (_shift_token): Support for unquoted HTML attribute
228            values.
229    
230    2008-08-16  Wakaba  <wakaba@suika.fam.cx>
231    
232            * CacheManifest.pm: Support for new style of error
233            reports.
234    
235            * HTML.pm.src: Set line=1, column=1 to the document node.
236    
237    2008-08-16  Wakaba  <wakaba@suika.fam.cx>
238    
239            * ContentChecker.pm, RDFXML.pm: Pass {level} object to language tag
240            and URL checkers.  Support for more error levels for bogus
241            langauge tag and URL "standards".
242    
243            * LangTag.pm, URIChecker.pm: Support for new style error
244            level reporting.
245    
246    2008-08-15  Wakaba  <wakaba@suika.fam.cx>
247    
248            * ContentChecker.pm: Support for RDF/XML error levels.
249    
250            * HTMLTable.pm, RDFXML.pm: Support for new style of error level
251            specifying.  Error types are revised.
252    
253    2008-08-15  Wakaba  <wakaba@suika.fam.cx>
254    
255            * ContentChecker.pm: All error reporting method calls are
256            renewed.
257    
258    2008-08-15  Wakaba  <wakaba@suika.fam.cx>
259    
260            * HTML.pm.src: All error type names and "text" parameters
261            are revised.  Use new style for "level" specification.
262    
263            * mkhtmlparser.pl: Use new style for "level" specification.
264    
265    2008-08-03  Wakaba  <wakaba@suika.fam.cx>
266    
267            * WebIDL.pm (parse_char_string): Simplified error
268            reporting process for broken ignored valuetype definition.
269            (Valuetype idl_text): Support for special "DOMString" name.
270    
271    2008-08-03  Wakaba  <wakaba@suika.fam.cx>
272    
273            * WebIDL.pm ($get_scoped_name): Append "::::" if the last
274            terminal of the ScopedName is "DOMString", such that whether
275            the last part of the scoped name is "DOMString" or "_DOMString"
276            later.  It is necessary to determine whether a |typedef|
277            definition should be ignored or not.
278            (parse_char_string): Unescape the identifier of
279            exception members.
280            ($resolve): Return undef for builtin types and sequence<T>
281            types (we might not have to do this, however...).
282            (check): Support checking for Exceptions, Valuetypes,
283            and Typedefs.
284            ($serialize_type): Support for "DOMString::::" syntax.
285            (Typedef idl_text): Output Type as "DOMString" if it
286            is really "DOMString" (i.e. its internal representation
287            is "::DOMString::").
288    
289    2008-08-03  Wakaba  <wakaba@suika.fam.cx>
290    
291            * WebIDL.pm ($resolve): New code, based on resolve code
292            for constant types in the |check| method.
293            (check): Support for checking of attributes, operations, and
294            arguments.
295            (Attribute/Operation idl_text): Exception names in getraises,
296            setraises, and raises clauses is serizlied by |$serialize_type|
297            code.
298    
299    2008-08-02  Wakaba  <wakaba@suika.fam.cx>
300    
301            * WebIDL.pm ($integer): Order of selections are changed to match
302            hexadecimal numbers (the original pattern, taken from the spec,
303            was not work for hexadecimal numbers, because the "0" prefix
304            matches to the [0-7]* part (as an empty string) and therefore
305            it does not match with remaining "x..." part of a "0x..." integer
306            literal.
307            ($get_type): It now returns a string, not an array reference,
308            for regular types and |sequence| types (i.e. it in any case
309            returns a string).
310            ($get_next_token): The second item in the array that represents
311            a integer or float token is now a Perl number value, not the
312            original string representation of the number.
313            (check): Support for const value consistency checking.
314            No extended attribute is defined for constants.
315            (Node subclasses): Use simple strings rather than array references
316            for default data type values.
317            ($serialize_type): Type values are now simple strings.
318            (value): If the new attribute value is a false value, then
319            a FALSE value is set to the attribute.
320    
321    2008-08-02  Wakaba  <wakaba@suika.fam.cx>
322    
323            * WebIDL.pm ($get_scoped_name): Now scoped names are stored
324            in its stringified format ("scoped name" as defined in the
325            spec).  Note that future version of this module should not use
326            array references for type values and the |type_text| attribute
327            should be made obsolete.
328            (parse_char_string): Unescape attribute names.
329            (check): Support for checking of whether inherited interfaces
330            are actually defined or not.  Support for checking of whether
331            interface member identifiers are duplicated or not.
332            ($serialize_type): Scoped names are returned as is.  A future
333            version of this code should escape identifiers other than "DOMString",
334            otherwise the idl_text would be non-conforming.
335    
336    2008-08-02  Wakaba  <wakaba@suika.fam.cx>
337    
338            * WebIDL.pm (parse_char_string): Set line/column numbers
339            to generated nodes.  Unescape identifiers.  Extended attributes
340            for Definition's were ignored.
341            (append_child): Set |parent_node| attribute.
342            (parent_node): New attribute.
343            (check): Support interface/exception members.  Support
344            extended attributes.  Support definition identifier uniqueness
345            constraint.
346            (qualified_name): New attribute.
347            (Interface/Exception idl_text): Extended attributes were
348            not prepended to the returned text.
349    
350    2008-08-02  Wakaba  <wakaba@suika.fam.cx>
351    
352            * WebIDL.pm (parse_char_string): Set line/column numbers
353            to interface object experimentally.  s/shift/pop/g, shift
354            would make things wrong.  Support for interface forward
355            declarations was missing.  Broken interface declarations
356            with no block were not ignored entirely.
357            (Whatpm::WebIDL::Node): New abstract class.  This class
358            makes things easier.
359            (child_nodes): New attribute.  Unlike DOM's attribute with
360            same name, this attribute returns a dead list of nodes for
361            simplicity.
362            (get_user_data, set_user_data): New methods.
363            (Module idl_text): A SPACE character should be inserted
364            before the |{| character.
365            (Interface idl_text): Support for interface forward declarations.
366            (is_forward_declaration): New attribute.
367    
368    2008-07-19  Wakaba  <wakaba@suika.fam.cx>
369    
370            * WebIDL.pm (type_text): Better serializer.
371    
372    2008-07-19  Wakaba  <wakaba@suika.fam.cx>
373    
374            * WebIDL.pm: Revise forward-compatible parsing so that
375            it now can handle broken extended attributes and as such.
376    
377    2008-07-19  Wakaba  <wakaba@suika.fam.cx>
378    
379            * WebIDL.pm: Real support for extended attributes.
380            Support for extended attributes with arguments.
381    
382    2008-07-19  Wakaba  <wakaba@suika.fam.cx>
383    
384            * WebIDL.pm: Support for |exception| syntax.
385            (Interface->idl_text): Tentative support for inheritances.
386    
387    2008-07-19  Wakaba  <wakaba@suika.fam.cx>
388    
389            * WebIDL.pm: Hierarchical scoped name support was broken.
390            Support for raises, setraises, and getraises syntaxes.
391    
392    2008-07-18  Wakaba  <wakaba@suika.fam.cx>
393    
394            * WebIDL.pm: Support for |idl_text| attribute, version 1 (no
395            proper support for types, extended attributes, and exceptions yet).
396            WebIDL parser, version 1 (no support for exceptions yet,
397            no proper support for extended attributes yet).
398    
399    2008-07-09  Wakaba  <wakaba@suika.fam.cx>
400    
401            * WebIDL.pm (parse_char_string): Support for basic attribute syntax.
402    
403    2008-06-29  Wakaba  <wakaba@suika.fam.cx>
404    
405            * WebIDL.pm: Support for valuetype and const.
406    
407    2008-06-29  Wakaba  <wakaba@suika.fam.cx>
408            
409            * WebIDL.pm: New module.
410    
411    2008-06-15  Wakaba  <wakaba@suika.fam.cx>
412    
413            * Makefile (Entities.html): URI changed.
414    
415    2008-06-08  Wakaba  <wakaba@suika.fam.cx>
416    
417            * HTML.pm.src: Support for ruby parsing (HTML5 revision 1704).
418    
419    2008-06-01  Wakaba  <wakaba@suika.fam.cx>
420    
421            * HTML.pm.src (_get_next_token): A parse error was missing.
422    
423    2008-06-01  Wakaba  <wakaba@suika.fam.cx>
424    
425            * mklinktypelist.pl: rel=contact is no longer part of the HTML5
426            spec (commented out). (HTML5 revision 1711).
427    
428    2008-05-25  Wakaba  <wakaba@suika.fam.cx>
429    
430            * ContentType.pm: Drop support for UTF-32 (HTML5 revision 1701).
431    
432            * HTML.pm.src: UTF-16BE and UTF-16LE should be considered
433            as UTF-16 (HTML5 revision 1701).
434    
435    2008-05-25  Wakaba  <wakaba@suika.fam.cx>
436    
437            * HTML.pm.src: Support for <noframes> in <head> (HTML5 revision
438            1692).
439    
440    2008-05-25  Wakaba  <wakaba@suika.fam.cx>
441    
442            * HTML.pm.src: The secondary insertion mode used when switching
443            to foreign content is the "in body" insertion mode (HTML5 revision
444            1696).
445    
446    2008-05-25  Wakaba  <wakaba@suika.fam.cx>
447    
448            * HTML.pm.src: Don't raise parse error for <isindex/> (HTML5
449            revision 1697).
450    
451    2008-05-24  Wakaba  <wakaba@suika.fam.cx>
452    
453            * HTML.pm.src: Support for end-of-file token in foreign content
454            insertion mode (HTML5 revision 1693).  Update SVG camelCase
455            attribute list (HTML5 revision 1700).  <textarea> closes
456            </select> (HTML5 revision 1699).  More start tags close in
457            foreign content insertion mode (HTML5 revision 1698).
458    
459    2008-05-24  Wakaba  <wakaba@suika.fam.cx>
460    
461            * HTML.pm.src: ";" is not part of charset name (HTML5 revision 1665).
462    
463    2008-05-24  Wakaba  <wakaba@suika.fam.cx>
464    
465            * HTML.pm.src: More robust charset parameter detection (HTML5
466            revision 1674).
467    
468    2008-05-24  Wakaba  <wakaba@suika.fam.cx>
469    
470            * ContentType.pm: Support for image/vnd.microsoft.icon (HTML5
471            revision 1676).
472    
473    2008-05-24  Wakaba  <wakaba@suika.fam.cx>
474    
475            * HTML.pm.src: Ignore language part of public identifiers for
476            quriks mode detection (HTML5 revision 1679).
477    
478    2008-05-24  Wakaba  <wakaba@suika.fam.cx>
479    
480            * HTML.pm.src: Reduce the number of errors in truncated doctypes (HTML5
481            revision 1685).
482    
483    2008-05-24  Wakaba  <wakaba@suika.fam.cx>
484    
485            * HTML.pm.src: Support for EOF in new states for tags (HTML5
486            revision 1684).
487    
488    2008-05-24  Wakaba  <wakaba@suika.fam.cx>
489    
490            * HTML.pm.src (_reset_insertion_mode): Make <td>.innerHTML
491            work (HTML5 revision 1690).
492    
493    2008-05-24  Wakaba  <wakaba@suika.fam.cx>
494    
495            * HTML.pm.src (_tree_construction_main): Change handling of
496            end tags in head insertion modes (HTML5 revision 1686).
497            (parse_char_string): Bug fix for non-utf8 character string handlings.
498            (parse_char_stream): |ungetc| does not work well for this context.
499    
500    2008-05-18  Wakaba  <wakaba@suika.fam.cx>
501    
502            * HTML.pm.src (parse_byte_string): Redefined to invoke
503            |parse_byte_stream|.
504            (parse_byte_stream): New method.
505    
506  2008-05-18  Wakaba  <wakaba@suika.fam.cx>  2008-05-18  Wakaba  <wakaba@suika.fam.cx>
507    
508          * HTML.pm.src (parse_byte_string): Fix the column number reported          * HTML.pm.src (parse_byte_string): Fix the column number reported

Legend:
Removed from v.1.237  
changed lines
  Added in v.1.310

admin@suikawiki.org
ViewVC Help
Powered by ViewVC 1.1.24