/[suikacvs]/markup/html/whatpm/Whatpm/ChangeLog
Suika

Diff of /markup/html/whatpm/Whatpm/ChangeLog

Parent Directory Parent Directory | Revision Log Revision Log | View Patch Patch

revision 1.224 by wakaba, Mon May 5 06:12:43 2008 UTC revision 1.301 by wakaba, Sun Sep 14 03:07:57 2008 UTC
# Line 1  Line 1 
1    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
2    
3            * HTML.pm.src: Change |{getc_until}| to |{read_until}|
4            and |manakai_getc_until| to |manakai_read_until| to
5            reduce the number of string copies.
6    
7    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
8    
9            * HTML.pm.src (parse_char_string): Use newly created
10            |Whatpm::Charset::DecodeHandle::CharString| instead of Perl's
11            standard feature to |open| a string as a filehandle,
12            since Perl's string filehandle seems not supporting |ungetc|
13            method correctly.
14            (parse_char_stream): Define |{getc_until}| method.
15            (DATA_STATE): Experimental support for |getc_until| feature.
16    
17    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
18    
19            * HTML.pm.src: Check points added to newly added branches.
20    
21    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
22    
23            * HTML.pm.src: Remove |{char}|, which is no longer used.
24            Remove |{entity_in_attr}| and |{last_attribute_value_state}|
25            and replaced by |{prev_state}|.
26    
27            * mkhtmlparser.pl: Remove |{char}| feature.
28            Remove |!!!back-next-input-character;| macro.
29    
30    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
31    
32            * HTML.pm.src: Finally we get rid of all the inner loops.  Remove
33            entity related tokenizer states in favor of new states
34            implementing the consume character reference algorithm.
35    
36    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
37    
38            * HTML.pm.src: "Consume a character reference" algorithm is
39            now implemented as a tokenizer's state, rather than
40            a method, with minimum changes (more changes will
41            be made, in due course).  "Bogus comment state"'s inner
42            loop gets removed.
43    
44    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
45    
46            * HTML.pm.src: Make |PUBLIC| and |SYSTEM| keyword tokenizing
47            into their own tokenizer states.
48    
49    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
50    
51            * HTML.pm.src: |CDATA_SECTION_STATE| (formally |CDATA_BLOCK_STATE|
52            is split into three states.
53    
54    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
55    
56            * HTML.pm.src: |CLOSE_TAG_OPEN_STATE| is broken into
57            itself and new |CDATA_PCDATA_CLOSE_TAG_STATE| so that
58            no longer does the tokenizer have to push back next input
59            characters in those states.
60    
61    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
62    
63            * HTML.pm.src: |MARKUP_DECLARATION_OPEN_STATE| broken
64            into four states so that no longer does the tokenizer have to push
65            back next input characters in that state.
66    
67    2008-09-11  Wakaba  <wakaba@suika.fam.cx>
68    
69            * HTML.pm.src: Methods now accept additional parameter, $get_wrapper,
70            which can be used to insert some wrapper between the character
71            stream handle and the tokenizer.  (It is currently not supported
72            for |set_inner_html| for |Element|s).
73    
74    2008-09-10  Wakaba  <wakaba@suika.fam.cx>
75    
76            * HTML.pm.src: Ignore punctuations in charset names.
77    
78    2008-09-10  Wakaba  <wakaba@suika.fam.cx>
79    
80            * ContentChecker.pm: Support for charset-layer error levels.
81    
82            * HTML.pm.src: Don't specify |text| argument for the
83            |chardecode:fallback| error, since it is not the encoding
84            being used alternatively.
85    
86    2008-09-06  Wakaba  <wakaba@suika.fam.cx>
87    
88            * HTML.pm.src: Support for |XSLT-compat| (HTML5 revision 2141).
89    
90    2008-08-31  Wakaba  <wakaba@suika.fam.cx>
91    
92            * CacheManifest.pm: Support for extensibility (HTML5 revision 2051).
93    
94    2008-08-31  Wakaba  <wakaba@suika.fam.cx>
95    
96            * HTML.pm.src: Bug fix and sync with the spec with regard
97            to after after frameset insertion mode processing (HTML5
98            revision 1909).  Note that the implementation was wrong
99            per the old spec before the r1909 changes.
100    
101    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
102    
103            * HTMLTable.pm: scope=auto algorithm fix synced with the
104            spec (HTML5 revision 2093).
105            ($process_row): Algorithm step numbers synced with the
106            spec (HTML5 revision 2092).
107    
108    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
109    
110            * HTMLTable.pm: Zs is not what we want; we want White_Space! (HTML5
111            revision 2094).
112    
113    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
114    
115            * ContentType.pm: Support for image/svg+xml (HTML5 revision 2096).
116    
117    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
118    
119            * HTML.pm.src: '"' and "'" at the end of attribute
120            name (after another attribute) now raise parse error (HTML5
121            revision 2123).  Empty unquoted attribute values are no
122            longer allowed (HTML5 revision 2122).
123    
124    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
125    
126            * mkhtmlparser.pl: Support for MathML |definitionURL| attribute (HTML5
127            revision 2130).
128    
129    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
130    
131            * ContentChecker.pm: |xml:lang| attribute value must be same
132            as |lang| attribute value for HTML elements (HTML5 revision 2062
133            and so on).
134    
135    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
136    
137            * ContentChecker.pm: Error level definition for |xml_id_error|
138            was missing.
139    
140            * URIChecker.pm: The end of the URL should be marked as the
141            error location for an empty path error.  The position
142            between the userinfo and the port components should be
143            marked as the error location for an empty host error.
144    
145    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
146    
147            * URIChecker.pm: Set parameters representing where in the
148            value the error occurs for errors.  Report unknown
149            address format error in warning level, since address
150            formats are rarely added.  Path segments starting with "/.."
151            were misinterpreted as a dot-segment.
152    
153    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
154    
155            * URIChecker.pm (check_iri_reference): Requires
156            |Message::DOM::DOMImplementation|.
157    
158    2008-08-29  Wakaba  <wakaba@suika.fam.cx>
159    
160            * IMTChecker.pm: Updated for the new error reporting architecture.
161    
162            * ContentChecker.pm: Error levels for IMTs are added.
163    
164    2008-08-17  Wakaba  <wakaba@suika.fam.cx>
165    
166            * H2H.pm (_shift_token): Support for unquoted HTML attribute
167            values.
168    
169    2008-08-16  Wakaba  <wakaba@suika.fam.cx>
170    
171            * CacheManifest.pm: Support for new style of error
172            reports.
173    
174            * HTML.pm.src: Set line=1, column=1 to the document node.
175    
176    2008-08-16  Wakaba  <wakaba@suika.fam.cx>
177    
178            * ContentChecker.pm, RDFXML.pm: Pass {level} object to language tag
179            and URL checkers.  Support for more error levels for bogus
180            langauge tag and URL "standards".
181    
182            * LangTag.pm, URIChecker.pm: Support for new style error
183            level reporting.
184    
185    2008-08-15  Wakaba  <wakaba@suika.fam.cx>
186    
187            * ContentChecker.pm: Support for RDF/XML error levels.
188    
189            * HTMLTable.pm, RDFXML.pm: Support for new style of error level
190            specifying.  Error types are revised.
191    
192    2008-08-15  Wakaba  <wakaba@suika.fam.cx>
193    
194            * ContentChecker.pm: All error reporting method calls are
195            renewed.
196    
197    2008-08-15  Wakaba  <wakaba@suika.fam.cx>
198    
199            * HTML.pm.src: All error type names and "text" parameters
200            are revised.  Use new style for "level" specification.
201    
202            * mkhtmlparser.pl: Use new style for "level" specification.
203    
204    2008-08-03  Wakaba  <wakaba@suika.fam.cx>
205    
206            * WebIDL.pm (parse_char_string): Simplified error
207            reporting process for broken ignored valuetype definition.
208            (Valuetype idl_text): Support for special "DOMString" name.
209    
210    2008-08-03  Wakaba  <wakaba@suika.fam.cx>
211    
212            * WebIDL.pm ($get_scoped_name): Append "::::" if the last
213            terminal of the ScopedName is "DOMString", such that whether
214            the last part of the scoped name is "DOMString" or "_DOMString"
215            later.  It is necessary to determine whether a |typedef|
216            definition should be ignored or not.
217            (parse_char_string): Unescape the identifier of
218            exception members.
219            ($resolve): Return undef for builtin types and sequence<T>
220            types (we might not have to do this, however...).
221            (check): Support checking for Exceptions, Valuetypes,
222            and Typedefs.
223            ($serialize_type): Support for "DOMString::::" syntax.
224            (Typedef idl_text): Output Type as "DOMString" if it
225            is really "DOMString" (i.e. its internal representation
226            is "::DOMString::").
227    
228    2008-08-03  Wakaba  <wakaba@suika.fam.cx>
229    
230            * WebIDL.pm ($resolve): New code, based on resolve code
231            for constant types in the |check| method.
232            (check): Support for checking of attributes, operations, and
233            arguments.
234            (Attribute/Operation idl_text): Exception names in getraises,
235            setraises, and raises clauses is serizlied by |$serialize_type|
236            code.
237    
238    2008-08-02  Wakaba  <wakaba@suika.fam.cx>
239    
240            * WebIDL.pm ($integer): Order of selections are changed to match
241            hexadecimal numbers (the original pattern, taken from the spec,
242            was not work for hexadecimal numbers, because the "0" prefix
243            matches to the [0-7]* part (as an empty string) and therefore
244            it does not match with remaining "x..." part of a "0x..." integer
245            literal.
246            ($get_type): It now returns a string, not an array reference,
247            for regular types and |sequence| types (i.e. it in any case
248            returns a string).
249            ($get_next_token): The second item in the array that represents
250            a integer or float token is now a Perl number value, not the
251            original string representation of the number.
252            (check): Support for const value consistency checking.
253            No extended attribute is defined for constants.
254            (Node subclasses): Use simple strings rather than array references
255            for default data type values.
256            ($serialize_type): Type values are now simple strings.
257            (value): If the new attribute value is a false value, then
258            a FALSE value is set to the attribute.
259    
260    2008-08-02  Wakaba  <wakaba@suika.fam.cx>
261    
262            * WebIDL.pm ($get_scoped_name): Now scoped names are stored
263            in its stringified format ("scoped name" as defined in the
264            spec).  Note that future version of this module should not use
265            array references for type values and the |type_text| attribute
266            should be made obsolete.
267            (parse_char_string): Unescape attribute names.
268            (check): Support for checking of whether inherited interfaces
269            are actually defined or not.  Support for checking of whether
270            interface member identifiers are duplicated or not.
271            ($serialize_type): Scoped names are returned as is.  A future
272            version of this code should escape identifiers other than "DOMString",
273            otherwise the idl_text would be non-conforming.
274    
275    2008-08-02  Wakaba  <wakaba@suika.fam.cx>
276    
277            * WebIDL.pm (parse_char_string): Set line/column numbers
278            to generated nodes.  Unescape identifiers.  Extended attributes
279            for Definition's were ignored.
280            (append_child): Set |parent_node| attribute.
281            (parent_node): New attribute.
282            (check): Support interface/exception members.  Support
283            extended attributes.  Support definition identifier uniqueness
284            constraint.
285            (qualified_name): New attribute.
286            (Interface/Exception idl_text): Extended attributes were
287            not prepended to the returned text.
288    
289    2008-08-02  Wakaba  <wakaba@suika.fam.cx>
290    
291            * WebIDL.pm (parse_char_string): Set line/column numbers
292            to interface object experimentally.  s/shift/pop/g, shift
293            would make things wrong.  Support for interface forward
294            declarations was missing.  Broken interface declarations
295            with no block were not ignored entirely.
296            (Whatpm::WebIDL::Node): New abstract class.  This class
297            makes things easier.
298            (child_nodes): New attribute.  Unlike DOM's attribute with
299            same name, this attribute returns a dead list of nodes for
300            simplicity.
301            (get_user_data, set_user_data): New methods.
302            (Module idl_text): A SPACE character should be inserted
303            before the |{| character.
304            (Interface idl_text): Support for interface forward declarations.
305            (is_forward_declaration): New attribute.
306    
307    2008-07-19  Wakaba  <wakaba@suika.fam.cx>
308    
309            * WebIDL.pm (type_text): Better serializer.
310    
311    2008-07-19  Wakaba  <wakaba@suika.fam.cx>
312    
313            * WebIDL.pm: Revise forward-compatible parsing so that
314            it now can handle broken extended attributes and as such.
315    
316    2008-07-19  Wakaba  <wakaba@suika.fam.cx>
317    
318            * WebIDL.pm: Real support for extended attributes.
319            Support for extended attributes with arguments.
320    
321    2008-07-19  Wakaba  <wakaba@suika.fam.cx>
322    
323            * WebIDL.pm: Support for |exception| syntax.
324            (Interface->idl_text): Tentative support for inheritances.
325    
326    2008-07-19  Wakaba  <wakaba@suika.fam.cx>
327    
328            * WebIDL.pm: Hierarchical scoped name support was broken.
329            Support for raises, setraises, and getraises syntaxes.
330    
331    2008-07-18  Wakaba  <wakaba@suika.fam.cx>
332    
333            * WebIDL.pm: Support for |idl_text| attribute, version 1 (no
334            proper support for types, extended attributes, and exceptions yet).
335            WebIDL parser, version 1 (no support for exceptions yet,
336            no proper support for extended attributes yet).
337    
338    2008-07-09  Wakaba  <wakaba@suika.fam.cx>
339    
340            * WebIDL.pm (parse_char_string): Support for basic attribute syntax.
341    
342    2008-06-29  Wakaba  <wakaba@suika.fam.cx>
343    
344            * WebIDL.pm: Support for valuetype and const.
345    
346    2008-06-29  Wakaba  <wakaba@suika.fam.cx>
347            
348            * WebIDL.pm: New module.
349    
350    2008-06-15  Wakaba  <wakaba@suika.fam.cx>
351    
352            * Makefile (Entities.html): URI changed.
353    
354    2008-06-08  Wakaba  <wakaba@suika.fam.cx>
355    
356            * HTML.pm.src: Support for ruby parsing (HTML5 revision 1704).
357    
358    2008-06-01  Wakaba  <wakaba@suika.fam.cx>
359    
360            * HTML.pm.src (_get_next_token): A parse error was missing.
361    
362    2008-06-01  Wakaba  <wakaba@suika.fam.cx>
363    
364            * mklinktypelist.pl: rel=contact is no longer part of the HTML5
365            spec (commented out). (HTML5 revision 1711).
366    
367    2008-05-25  Wakaba  <wakaba@suika.fam.cx>
368    
369            * ContentType.pm: Drop support for UTF-32 (HTML5 revision 1701).
370    
371            * HTML.pm.src: UTF-16BE and UTF-16LE should be considered
372            as UTF-16 (HTML5 revision 1701).
373    
374    2008-05-25  Wakaba  <wakaba@suika.fam.cx>
375    
376            * HTML.pm.src: Support for <noframes> in <head> (HTML5 revision
377            1692).
378    
379    2008-05-25  Wakaba  <wakaba@suika.fam.cx>
380    
381            * HTML.pm.src: The secondary insertion mode used when switching
382            to foreign content is the "in body" insertion mode (HTML5 revision
383            1696).
384    
385    2008-05-25  Wakaba  <wakaba@suika.fam.cx>
386    
387            * HTML.pm.src: Don't raise parse error for <isindex/> (HTML5
388            revision 1697).
389    
390    2008-05-24  Wakaba  <wakaba@suika.fam.cx>
391    
392            * HTML.pm.src: Support for end-of-file token in foreign content
393            insertion mode (HTML5 revision 1693).  Update SVG camelCase
394            attribute list (HTML5 revision 1700).  <textarea> closes
395            </select> (HTML5 revision 1699).  More start tags close in
396            foreign content insertion mode (HTML5 revision 1698).
397    
398    2008-05-24  Wakaba  <wakaba@suika.fam.cx>
399    
400            * HTML.pm.src: ";" is not part of charset name (HTML5 revision 1665).
401    
402    2008-05-24  Wakaba  <wakaba@suika.fam.cx>
403    
404            * HTML.pm.src: More robust charset parameter detection (HTML5
405            revision 1674).
406    
407    2008-05-24  Wakaba  <wakaba@suika.fam.cx>
408    
409            * ContentType.pm: Support for image/vnd.microsoft.icon (HTML5
410            revision 1676).
411    
412    2008-05-24  Wakaba  <wakaba@suika.fam.cx>
413    
414            * HTML.pm.src: Ignore language part of public identifiers for
415            quriks mode detection (HTML5 revision 1679).
416    
417    2008-05-24  Wakaba  <wakaba@suika.fam.cx>
418    
419            * HTML.pm.src: Reduce the number of errors in truncated doctypes (HTML5
420            revision 1685).
421    
422    2008-05-24  Wakaba  <wakaba@suika.fam.cx>
423    
424            * HTML.pm.src: Support for EOF in new states for tags (HTML5
425            revision 1684).
426    
427    2008-05-24  Wakaba  <wakaba@suika.fam.cx>
428    
429            * HTML.pm.src (_reset_insertion_mode): Make <td>.innerHTML
430            work (HTML5 revision 1690).
431    
432    2008-05-24  Wakaba  <wakaba@suika.fam.cx>
433    
434            * HTML.pm.src (_tree_construction_main): Change handling of
435            end tags in head insertion modes (HTML5 revision 1686).
436            (parse_char_string): Bug fix for non-utf8 character string handlings.
437            (parse_char_stream): |ungetc| does not work well for this context.
438    
439    2008-05-18  Wakaba  <wakaba@suika.fam.cx>
440    
441            * HTML.pm.src (parse_byte_string): Redefined to invoke
442            |parse_byte_stream|.
443            (parse_byte_stream): New method.
444    
445    2008-05-18  Wakaba  <wakaba@suika.fam.cx>
446    
447            * HTML.pm.src (parse_byte_string): Fix the column number reported
448            by encoding layer error reporter.
449    
450    2008-05-17  Wakaba  <wakaba@suika.fam.cx>
451    
452            * HTML.pm.src (parse_byte_string): Use streaming decoder
453            rather than converting the whole byte string and then parsing.
454            Propagate errors in character encoding layer.
455            (get_next_token): Precise error reporting for |bare stago| error.
456    
457    2008-05-17  Wakaba  <wakaba@suika.fam.cx>
458    
459            * HTML.pm.src (parse_char_stream): New method.
460            (parse_char_string): This method is now defined as an invocation
461            of the |parse_char_stream| method.
462    
463    2008-05-17  Wakaba  <wakaba@suika.fam.cx>
464    
465            * HTML.pm.src (parse_byte_string): Report various status
466            of the sniffing as info-level errors.  Support for new
467            decoding framework in parser resestting.
468            (new): Various default error levels were not set.
469    
470    2008-05-17  Wakaba  <wakaba@suika.fam.cx>
471    
472            * HTML.pm.src (parse_byte_string): HTML5 encoding siniffing
473            algorithm, except for the actual sniffing, is implemented
474            with new framework with Message::Charset::Info.
475    
476    2008-05-16  Wakaba  <wakaba@suika.fam.cx>
477    
478            * CacheManifest.pm (_parse): Drop fragment identifiers from
479            URIs in fallback section (HTML5 revision 1596).
480    
481    2008-05-10  Wakaba  <wakaba@suika.fam.cx>
482    
483            * Makefile (Entities.html): URI has changed.
484    
485    2008-05-10  Wakaba  <wakaba@suika.fam.cx>
486    
487            * CacheManifest.pm: Don't replace U+0000 NULL (HTML5 revision
488            1553).
489    
490    2008-05-06  Wakaba  <wakaba@suika.fam.cx>
491    
492            * ContentChecker.pm: Noted that those returned in |table| are
493            no longer table elements, but table objects returned
494            by Whatpm::HTMLTable.
495    
496            * HTMLTable.pm (form_table): Return table element node
497            as |$table->{element}|.
498            (assign_header): Support for the |headers=""| attribute.
499    
500    2008-05-06  Wakaba  <wakaba@suika.fam.cx>
501    
502            * HTMLTable.pm (assign_header): New function; first version
503            with no support for headers="".
504            (form_table): Include table width and height to the returned
505            table object for covenience.  Indexing in column assignement was wrong.
506            Set whether a data cell is empty or not for convenience.
507    
508    2008-05-05  Wakaba  <wakaba@suika.fam.cx>
509    
510            * HTMLTable.pm: Robuster caption support (HTML5 revision 1393).
511    
512    2008-05-05  Wakaba  <wakaba@suika.fam.cx>
513    
514            * HTMLTable.pm: How table model errors are detected is
515            changed (HTML5 revision 1387).
516    
517    2008-05-05  Wakaba  <wakaba@suika.fam.cx>
518    
519            * HTMLTable.pm: The algorithm now moves |tfoot| elements
520            to the end of the table (HTML5 revision 1380).
521    
522  2008-05-05  Wakaba  <wakaba@suika.fam.cx>  2008-05-05  Wakaba  <wakaba@suika.fam.cx>
523    
524          * HTMLTable.pm: The algorithm is now 0-based indexing, instead          * HTMLTable.pm: The algorithm is now 0-based indexing, instead

Legend:
Removed from v.1.224  
changed lines
  Added in v.1.301

admin@suikawiki.org
ViewVC Help
Powered by ViewVC 1.1.24