/[suikacvs]/markup/html/whatpm/Whatpm/ChangeLog
Suika

Diff of /markup/html/whatpm/Whatpm/ChangeLog

Parent Directory Parent Directory | Revision Log Revision Log | View Patch Patch

revision 1.237 by wakaba, Sun May 18 03:46:26 2008 UTC revision 1.309 by wakaba, Sun Sep 14 14:35:43 2008 UTC
# Line 1  Line 1 
1    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
2    
3            * HTML.pm.src: Regexp typo fixed.  |{prev_char}|
4            and |{next_char}| initializations are moved to initialization
5            method.  |{read_until}| now supports buffering.  Sync |set_inner_html|
6            with |parse_char_stream|.
7    
8    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
9    
10            * HTML.pm.src (parse_char_stream): Make |set_next_char|
11            invoke |manakai_read_until|, not only |read|, where
12            possible, to decrease the number of |read| method calls.
13    
14            * mkhtmlparser.pl: Related changes to the aforementioned
15            modification.
16    
17    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
18    
19            * HTML.pm.src: Use |read| instead of |getc|.  |set_inner_html|
20            would report character error from now.
21    
22    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
23    
24            * HTML.pm.src: White-space-leaded non-white-space character
25            tokens in "before head insertion mode" was not
26            correctly handled.
27            (set_inner_html): Reimplemented using CharString decodehandle
28            class.  Support for $get_wrapper argument.  Support
29            for |{read_until}| feature.
30    
31    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
32    
33            * HTML.pm.src: Make a "bare ero" error for unknown
34            entities point the "&" character.
35    
36    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
37    
38            * HTML.pm.src: It turns out that U+FFFD don't have to
39            be added to the list of excluded characters.
40    
41    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
42    
43            * HTML.pm.src ($char_onerror): Have character decoder's |line|
44            and |column| a higher priority than the one set by the
45            tokenizer's input handler.
46            ($self->{read_until}): Exclude U+FFFD (but this might
47            not be necessary, since now we do line/column fixup in
48            the character decode handle).
49    
50    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
51    
52            * HTML.pm.src: Use |{read_until}| where possible.
53    
54    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
55    
56            * HTML.pm.src: Change |{getc_until}| to |{read_until}|
57            and |manakai_getc_until| to |manakai_read_until| to
58            reduce the number of string copies.
59    
60    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
61    
62            * HTML.pm.src (parse_char_string): Use newly created
63            |Whatpm::Charset::DecodeHandle::CharString| instead of Perl's
64            standard feature to |open| a string as a filehandle,
65            since Perl's string filehandle seems not supporting |ungetc|
66            method correctly.
67            (parse_char_stream): Define |{getc_until}| method.
68            (DATA_STATE): Experimental support for |getc_until| feature.
69    
70    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
71    
72            * HTML.pm.src: Check points added to newly added branches.
73    
74    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
75    
76            * HTML.pm.src: Remove |{char}|, which is no longer used.
77            Remove |{entity_in_attr}| and |{last_attribute_value_state}|
78            and replaced by |{prev_state}|.
79    
80            * mkhtmlparser.pl: Remove |{char}| feature.
81            Remove |!!!back-next-input-character;| macro.
82    
83    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
84    
85            * HTML.pm.src: Finally we get rid of all the inner loops.  Remove
86            entity related tokenizer states in favor of new states
87            implementing the consume character reference algorithm.
88    
89    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
90    
91            * HTML.pm.src: "Consume a character reference" algorithm is
92            now implemented as a tokenizer's state, rather than
93            a method, with minimum changes (more changes will
94            be made, in due course).  "Bogus comment state"'s inner
95            loop gets removed.
96    
97    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
98    
99            * HTML.pm.src: Make |PUBLIC| and |SYSTEM| keyword tokenizing
100            into their own tokenizer states.
101    
102    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
103    
104            * HTML.pm.src: |CDATA_SECTION_STATE| (formally |CDATA_BLOCK_STATE|
105            is split into three states.
106    
107    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
108    
109            * HTML.pm.src: |CLOSE_TAG_OPEN_STATE| is broken into
110            itself and new |CDATA_PCDATA_CLOSE_TAG_STATE| so that
111            no longer does the tokenizer have to push back next input
112            characters in those states.
113    
114    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
115    
116            * HTML.pm.src: |MARKUP_DECLARATION_OPEN_STATE| broken
117            into four states so that no longer does the tokenizer have to push
118            back next input characters in that state.
119    
120    2008-09-11  Wakaba  <wakaba@suika.fam.cx>
121    
122            * HTML.pm.src: Methods now accept additional parameter, $get_wrapper,
123            which can be used to insert some wrapper between the character
124            stream handle and the tokenizer.  (It is currently not supported
125            for |set_inner_html| for |Element|s).
126    
127    2008-09-10  Wakaba  <wakaba@suika.fam.cx>
128    
129            * HTML.pm.src: Ignore punctuations in charset names.
130    
131    2008-09-10  Wakaba  <wakaba@suika.fam.cx>
132    
133            * ContentChecker.pm: Support for charset-layer error levels.
134    
135            * HTML.pm.src: Don't specify |text| argument for the
136            |chardecode:fallback| error, since it is not the encoding
137            being used alternatively.
138    
139    2008-09-06  Wakaba  <wakaba@suika.fam.cx>
140    
141            * HTML.pm.src: Support for |XSLT-compat| (HTML5 revision 2141).
142    
143    2008-08-31  Wakaba  <wakaba@suika.fam.cx>
144    
145            * CacheManifest.pm: Support for extensibility (HTML5 revision 2051).
146    
147    2008-08-31  Wakaba  <wakaba@suika.fam.cx>
148    
149            * HTML.pm.src: Bug fix and sync with the spec with regard
150            to after after frameset insertion mode processing (HTML5
151            revision 1909).  Note that the implementation was wrong
152            per the old spec before the r1909 changes.
153    
154    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
155    
156            * HTMLTable.pm: scope=auto algorithm fix synced with the
157            spec (HTML5 revision 2093).
158            ($process_row): Algorithm step numbers synced with the
159            spec (HTML5 revision 2092).
160    
161    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
162    
163            * HTMLTable.pm: Zs is not what we want; we want White_Space! (HTML5
164            revision 2094).
165    
166    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
167    
168            * ContentType.pm: Support for image/svg+xml (HTML5 revision 2096).
169    
170    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
171    
172            * HTML.pm.src: '"' and "'" at the end of attribute
173            name (after another attribute) now raise parse error (HTML5
174            revision 2123).  Empty unquoted attribute values are no
175            longer allowed (HTML5 revision 2122).
176    
177    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
178    
179            * mkhtmlparser.pl: Support for MathML |definitionURL| attribute (HTML5
180            revision 2130).
181    
182    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
183    
184            * ContentChecker.pm: |xml:lang| attribute value must be same
185            as |lang| attribute value for HTML elements (HTML5 revision 2062
186            and so on).
187    
188    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
189    
190            * ContentChecker.pm: Error level definition for |xml_id_error|
191            was missing.
192    
193            * URIChecker.pm: The end of the URL should be marked as the
194            error location for an empty path error.  The position
195            between the userinfo and the port components should be
196            marked as the error location for an empty host error.
197    
198    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
199    
200            * URIChecker.pm: Set parameters representing where in the
201            value the error occurs for errors.  Report unknown
202            address format error in warning level, since address
203            formats are rarely added.  Path segments starting with "/.."
204            were misinterpreted as a dot-segment.
205    
206    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
207    
208            * URIChecker.pm (check_iri_reference): Requires
209            |Message::DOM::DOMImplementation|.
210    
211    2008-08-29  Wakaba  <wakaba@suika.fam.cx>
212    
213            * IMTChecker.pm: Updated for the new error reporting architecture.
214    
215            * ContentChecker.pm: Error levels for IMTs are added.
216    
217    2008-08-17  Wakaba  <wakaba@suika.fam.cx>
218    
219            * H2H.pm (_shift_token): Support for unquoted HTML attribute
220            values.
221    
222    2008-08-16  Wakaba  <wakaba@suika.fam.cx>
223    
224            * CacheManifest.pm: Support for new style of error
225            reports.
226    
227            * HTML.pm.src: Set line=1, column=1 to the document node.
228    
229    2008-08-16  Wakaba  <wakaba@suika.fam.cx>
230    
231            * ContentChecker.pm, RDFXML.pm: Pass {level} object to language tag
232            and URL checkers.  Support for more error levels for bogus
233            langauge tag and URL "standards".
234    
235            * LangTag.pm, URIChecker.pm: Support for new style error
236            level reporting.
237    
238    2008-08-15  Wakaba  <wakaba@suika.fam.cx>
239    
240            * ContentChecker.pm: Support for RDF/XML error levels.
241    
242            * HTMLTable.pm, RDFXML.pm: Support for new style of error level
243            specifying.  Error types are revised.
244    
245    2008-08-15  Wakaba  <wakaba@suika.fam.cx>
246    
247            * ContentChecker.pm: All error reporting method calls are
248            renewed.
249    
250    2008-08-15  Wakaba  <wakaba@suika.fam.cx>
251    
252            * HTML.pm.src: All error type names and "text" parameters
253            are revised.  Use new style for "level" specification.
254    
255            * mkhtmlparser.pl: Use new style for "level" specification.
256    
257    2008-08-03  Wakaba  <wakaba@suika.fam.cx>
258    
259            * WebIDL.pm (parse_char_string): Simplified error
260            reporting process for broken ignored valuetype definition.
261            (Valuetype idl_text): Support for special "DOMString" name.
262    
263    2008-08-03  Wakaba  <wakaba@suika.fam.cx>
264    
265            * WebIDL.pm ($get_scoped_name): Append "::::" if the last
266            terminal of the ScopedName is "DOMString", such that whether
267            the last part of the scoped name is "DOMString" or "_DOMString"
268            later.  It is necessary to determine whether a |typedef|
269            definition should be ignored or not.
270            (parse_char_string): Unescape the identifier of
271            exception members.
272            ($resolve): Return undef for builtin types and sequence<T>
273            types (we might not have to do this, however...).
274            (check): Support checking for Exceptions, Valuetypes,
275            and Typedefs.
276            ($serialize_type): Support for "DOMString::::" syntax.
277            (Typedef idl_text): Output Type as "DOMString" if it
278            is really "DOMString" (i.e. its internal representation
279            is "::DOMString::").
280    
281    2008-08-03  Wakaba  <wakaba@suika.fam.cx>
282    
283            * WebIDL.pm ($resolve): New code, based on resolve code
284            for constant types in the |check| method.
285            (check): Support for checking of attributes, operations, and
286            arguments.
287            (Attribute/Operation idl_text): Exception names in getraises,
288            setraises, and raises clauses is serizlied by |$serialize_type|
289            code.
290    
291    2008-08-02  Wakaba  <wakaba@suika.fam.cx>
292    
293            * WebIDL.pm ($integer): Order of selections are changed to match
294            hexadecimal numbers (the original pattern, taken from the spec,
295            was not work for hexadecimal numbers, because the "0" prefix
296            matches to the [0-7]* part (as an empty string) and therefore
297            it does not match with remaining "x..." part of a "0x..." integer
298            literal.
299            ($get_type): It now returns a string, not an array reference,
300            for regular types and |sequence| types (i.e. it in any case
301            returns a string).
302            ($get_next_token): The second item in the array that represents
303            a integer or float token is now a Perl number value, not the
304            original string representation of the number.
305            (check): Support for const value consistency checking.
306            No extended attribute is defined for constants.
307            (Node subclasses): Use simple strings rather than array references
308            for default data type values.
309            ($serialize_type): Type values are now simple strings.
310            (value): If the new attribute value is a false value, then
311            a FALSE value is set to the attribute.
312    
313    2008-08-02  Wakaba  <wakaba@suika.fam.cx>
314    
315            * WebIDL.pm ($get_scoped_name): Now scoped names are stored
316            in its stringified format ("scoped name" as defined in the
317            spec).  Note that future version of this module should not use
318            array references for type values and the |type_text| attribute
319            should be made obsolete.
320            (parse_char_string): Unescape attribute names.
321            (check): Support for checking of whether inherited interfaces
322            are actually defined or not.  Support for checking of whether
323            interface member identifiers are duplicated or not.
324            ($serialize_type): Scoped names are returned as is.  A future
325            version of this code should escape identifiers other than "DOMString",
326            otherwise the idl_text would be non-conforming.
327    
328    2008-08-02  Wakaba  <wakaba@suika.fam.cx>
329    
330            * WebIDL.pm (parse_char_string): Set line/column numbers
331            to generated nodes.  Unescape identifiers.  Extended attributes
332            for Definition's were ignored.
333            (append_child): Set |parent_node| attribute.
334            (parent_node): New attribute.
335            (check): Support interface/exception members.  Support
336            extended attributes.  Support definition identifier uniqueness
337            constraint.
338            (qualified_name): New attribute.
339            (Interface/Exception idl_text): Extended attributes were
340            not prepended to the returned text.
341    
342    2008-08-02  Wakaba  <wakaba@suika.fam.cx>
343    
344            * WebIDL.pm (parse_char_string): Set line/column numbers
345            to interface object experimentally.  s/shift/pop/g, shift
346            would make things wrong.  Support for interface forward
347            declarations was missing.  Broken interface declarations
348            with no block were not ignored entirely.
349            (Whatpm::WebIDL::Node): New abstract class.  This class
350            makes things easier.
351            (child_nodes): New attribute.  Unlike DOM's attribute with
352            same name, this attribute returns a dead list of nodes for
353            simplicity.
354            (get_user_data, set_user_data): New methods.
355            (Module idl_text): A SPACE character should be inserted
356            before the |{| character.
357            (Interface idl_text): Support for interface forward declarations.
358            (is_forward_declaration): New attribute.
359    
360    2008-07-19  Wakaba  <wakaba@suika.fam.cx>
361    
362            * WebIDL.pm (type_text): Better serializer.
363    
364    2008-07-19  Wakaba  <wakaba@suika.fam.cx>
365    
366            * WebIDL.pm: Revise forward-compatible parsing so that
367            it now can handle broken extended attributes and as such.
368    
369    2008-07-19  Wakaba  <wakaba@suika.fam.cx>
370    
371            * WebIDL.pm: Real support for extended attributes.
372            Support for extended attributes with arguments.
373    
374    2008-07-19  Wakaba  <wakaba@suika.fam.cx>
375    
376            * WebIDL.pm: Support for |exception| syntax.
377            (Interface->idl_text): Tentative support for inheritances.
378    
379    2008-07-19  Wakaba  <wakaba@suika.fam.cx>
380    
381            * WebIDL.pm: Hierarchical scoped name support was broken.
382            Support for raises, setraises, and getraises syntaxes.
383    
384    2008-07-18  Wakaba  <wakaba@suika.fam.cx>
385    
386            * WebIDL.pm: Support for |idl_text| attribute, version 1 (no
387            proper support for types, extended attributes, and exceptions yet).
388            WebIDL parser, version 1 (no support for exceptions yet,
389            no proper support for extended attributes yet).
390    
391    2008-07-09  Wakaba  <wakaba@suika.fam.cx>
392    
393            * WebIDL.pm (parse_char_string): Support for basic attribute syntax.
394    
395    2008-06-29  Wakaba  <wakaba@suika.fam.cx>
396    
397            * WebIDL.pm: Support for valuetype and const.
398    
399    2008-06-29  Wakaba  <wakaba@suika.fam.cx>
400            
401            * WebIDL.pm: New module.
402    
403    2008-06-15  Wakaba  <wakaba@suika.fam.cx>
404    
405            * Makefile (Entities.html): URI changed.
406    
407    2008-06-08  Wakaba  <wakaba@suika.fam.cx>
408    
409            * HTML.pm.src: Support for ruby parsing (HTML5 revision 1704).
410    
411    2008-06-01  Wakaba  <wakaba@suika.fam.cx>
412    
413            * HTML.pm.src (_get_next_token): A parse error was missing.
414    
415    2008-06-01  Wakaba  <wakaba@suika.fam.cx>
416    
417            * mklinktypelist.pl: rel=contact is no longer part of the HTML5
418            spec (commented out). (HTML5 revision 1711).
419    
420    2008-05-25  Wakaba  <wakaba@suika.fam.cx>
421    
422            * ContentType.pm: Drop support for UTF-32 (HTML5 revision 1701).
423    
424            * HTML.pm.src: UTF-16BE and UTF-16LE should be considered
425            as UTF-16 (HTML5 revision 1701).
426    
427    2008-05-25  Wakaba  <wakaba@suika.fam.cx>
428    
429            * HTML.pm.src: Support for <noframes> in <head> (HTML5 revision
430            1692).
431    
432    2008-05-25  Wakaba  <wakaba@suika.fam.cx>
433    
434            * HTML.pm.src: The secondary insertion mode used when switching
435            to foreign content is the "in body" insertion mode (HTML5 revision
436            1696).
437    
438    2008-05-25  Wakaba  <wakaba@suika.fam.cx>
439    
440            * HTML.pm.src: Don't raise parse error for <isindex/> (HTML5
441            revision 1697).
442    
443    2008-05-24  Wakaba  <wakaba@suika.fam.cx>
444    
445            * HTML.pm.src: Support for end-of-file token in foreign content
446            insertion mode (HTML5 revision 1693).  Update SVG camelCase
447            attribute list (HTML5 revision 1700).  <textarea> closes
448            </select> (HTML5 revision 1699).  More start tags close in
449            foreign content insertion mode (HTML5 revision 1698).
450    
451    2008-05-24  Wakaba  <wakaba@suika.fam.cx>
452    
453            * HTML.pm.src: ";" is not part of charset name (HTML5 revision 1665).
454    
455    2008-05-24  Wakaba  <wakaba@suika.fam.cx>
456    
457            * HTML.pm.src: More robust charset parameter detection (HTML5
458            revision 1674).
459    
460    2008-05-24  Wakaba  <wakaba@suika.fam.cx>
461    
462            * ContentType.pm: Support for image/vnd.microsoft.icon (HTML5
463            revision 1676).
464    
465    2008-05-24  Wakaba  <wakaba@suika.fam.cx>
466    
467            * HTML.pm.src: Ignore language part of public identifiers for
468            quriks mode detection (HTML5 revision 1679).
469    
470    2008-05-24  Wakaba  <wakaba@suika.fam.cx>
471    
472            * HTML.pm.src: Reduce the number of errors in truncated doctypes (HTML5
473            revision 1685).
474    
475    2008-05-24  Wakaba  <wakaba@suika.fam.cx>
476    
477            * HTML.pm.src: Support for EOF in new states for tags (HTML5
478            revision 1684).
479    
480    2008-05-24  Wakaba  <wakaba@suika.fam.cx>
481    
482            * HTML.pm.src (_reset_insertion_mode): Make <td>.innerHTML
483            work (HTML5 revision 1690).
484    
485    2008-05-24  Wakaba  <wakaba@suika.fam.cx>
486    
487            * HTML.pm.src (_tree_construction_main): Change handling of
488            end tags in head insertion modes (HTML5 revision 1686).
489            (parse_char_string): Bug fix for non-utf8 character string handlings.
490            (parse_char_stream): |ungetc| does not work well for this context.
491    
492    2008-05-18  Wakaba  <wakaba@suika.fam.cx>
493    
494            * HTML.pm.src (parse_byte_string): Redefined to invoke
495            |parse_byte_stream|.
496            (parse_byte_stream): New method.
497    
498  2008-05-18  Wakaba  <wakaba@suika.fam.cx>  2008-05-18  Wakaba  <wakaba@suika.fam.cx>
499    
500          * HTML.pm.src (parse_byte_string): Fix the column number reported          * HTML.pm.src (parse_byte_string): Fix the column number reported

Legend:
Removed from v.1.237  
changed lines
  Added in v.1.309

admin@suikawiki.org
ViewVC Help
Powered by ViewVC 1.1.24