/[suikacvs]/markup/html/whatpm/Whatpm/ChangeLog
Suika

Diff of /markup/html/whatpm/Whatpm/ChangeLog

Parent Directory Parent Directory | Revision Log Revision Log | View Patch Patch

revision 1.248 by wakaba, Sun May 25 06:03:56 2008 UTC revision 1.326 by wakaba, Sat Sep 20 07:54:47 2008 UTC
# Line 1  Line 1 
1    2008-09-20  Wakaba  <wakaba@suika.fam.cx>
2    
3            * ContentType.pm: 0x0B is no longer a space character (HTML5
4            revision 1738).
5    
6            * HTML.pm.src: U+000B is no longer a space character for the
7            algorithm for extracting an encoding from a Content-Type (HTML5
8            revision 1738).
9    
10    2008-09-20  Wakaba  <wakaba@suika.fam.cx>
11    
12            * ContentChecker.pm ($IsInHTMLInteractiveContent): New.
13    
14    2008-09-18  Wakaba  <wakaba@suika.fam.cx>
15    
16            * LangTag.pm: Add checks for remaining requirements from RFC 4646.
17    
18            * mklangreg.pl: Sort 'Prefix' values by their length, to ease
19            matching.
20    
21    2008-09-18  Wakaba  <wakaba@suika.fam.cx>
22    
23            * LangTag.pm: Warn for private use language subtags.  Error level
24            typos fixed.  Support for Suppress-Script field.
25    
26            * mklangreg.pl: Support for dumping of nested structure.
27    
28    2008-09-18  Wakaba  <wakaba@suika.fam.cx>
29    
30            * LangTag.pm (check_rfc4646_langtag): Check if a tag is in the
31            recommended case as per RFC 4646.
32    
33    2008-09-18  Wakaba  <wakaba@suika.fam.cx>
34    
35            * LangTag.pm (check_rfc4646_langtag): New method.
36    
37    2008-09-18  Wakaba  <wakaba@suika.fam.cx>
38    
39            * mklangreg.pl: New script.
40    
41            * Makefile: Updated for creation of the module for language subtag
42            registry.
43            
44    2008-09-16  Wakaba  <wakaba@suika.fam.cx>
45    
46            * Makefile: WebIDL.html added.
47    
48            * WebIDL.pod: New documentation.
49    
50    2008-09-16  Wakaba  <wakaba@suika.fam.cx>
51    
52            * WebIDL.pm: Checker's error types are redefined.
53    
54    2008-09-16  Wakaba  <wakaba@suika.fam.cx>
55    
56            * WebIDL.pm: Parser's error types are redefined.  Some forward
57            compatible parsing bugs are fixed.  Some unreachable codes are
58            commented out.
59    
60    2008-09-16  Wakaba  <wakaba@suika.fam.cx>
61    
62            * WebIDL.pm: Support for the reminding extended attributes are
63            added.  It does not satisfy the definition that a forward
64            interface declaration has an extended attribute.  It seems that
65            unless explicitly allowed multiple extended attributes with the
66            same name is not allowed, though it is not explicitly mentioned in
67            the spec.
68    
69    2008-09-16  Wakaba  <wakaba@suika.fam.cx>
70    
71            * WebIDL.pm: Unescapes extended attribute names and extended
72            attribute identifiers.  Preserve whether an extended attribute has
73            an argument list of not.  Support for extended attributes:
74            Constructor, ExceptionConsts, IndexGetter, IndexSetter,
75            NameGetter, NameSetter, and Null.
76            (has_argument_list): New attribute.
77            (idl_text): Stringifies argument lists, if any, even if it is
78            empty.
79    
80    2008-09-15  Wakaba  <wakaba@suika.fam.cx>
81    
82            * HTML.pm.src: New state |PCDATA_STATE|.  Use an empty string for
83            |{s_kwd}| in DATA_STATE as default.
84    
85    2008-09-15  Wakaba  <wakaba@suika.fam.cx>
86    
87            * HTML.pm.src, mkhtmlparser.pl: Replace |{prev_char}|
88            by |{s_kwd}| in DATA_STATE.
89    
90    2008-09-15  Wakaba  <wakaba@suika.fam.cx>
91    
92            * HTML.pm.src: Shorten keys.
93    
94    2008-09-15  Wakaba  <wakaba@suika.fam.cx>
95    
96            * HTML.pm.src: Remove checking for control character, surrogate
97            pair, or noncharacter code points and non-Unicode code
98            points (they should be handled by Whatpm::Charset::UnicodeChecker).
99            (parse_char_stream): Support for the |$get_wrapper| argument and
100            character stream error handlers.
101    
102    2008-09-15  Wakaba  <wakaba@suika.fam.cx>
103    
104            * ContentChecker.pm: Don't call |loda_ns_module|
105            for null-namespace elements/attributes.
106    
107            * HTML.pm.src: Fact out $disallowed_control_chars
108            as a hash.
109    
110    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
111    
112            * HTML.pm.src: Regexp typo fixed.  |{prev_char}|
113            and |{next_char}| initializations are moved to initialization
114            method.  |{read_until}| now supports buffering.  Sync |set_inner_html|
115            with |parse_char_stream|.
116    
117    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
118    
119            * HTML.pm.src (parse_char_stream): Make |set_next_char|
120            invoke |manakai_read_until|, not only |read|, where
121            possible, to decrease the number of |read| method calls.
122    
123            * mkhtmlparser.pl: Related changes to the aforementioned
124            modification.
125    
126    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
127    
128            * HTML.pm.src: Use |read| instead of |getc|.  |set_inner_html|
129            would report character error from now.
130    
131    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
132    
133            * HTML.pm.src: White-space-leaded non-white-space character
134            tokens in "before head insertion mode" was not
135            correctly handled.
136            (set_inner_html): Reimplemented using CharString decodehandle
137            class.  Support for $get_wrapper argument.  Support
138            for |{read_until}| feature.
139    
140    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
141    
142            * HTML.pm.src: Make a "bare ero" error for unknown
143            entities point the "&" character.
144    
145    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
146    
147            * HTML.pm.src: It turns out that U+FFFD don't have to
148            be added to the list of excluded characters.
149    
150    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
151    
152            * HTML.pm.src ($char_onerror): Have character decoder's |line|
153            and |column| a higher priority than the one set by the
154            tokenizer's input handler.
155            ($self->{read_until}): Exclude U+FFFD (but this might
156            not be necessary, since now we do line/column fixup in
157            the character decode handle).
158    
159    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
160    
161            * HTML.pm.src: Use |{read_until}| where possible.
162    
163    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
164    
165            * HTML.pm.src: Change |{getc_until}| to |{read_until}|
166            and |manakai_getc_until| to |manakai_read_until| to
167            reduce the number of string copies.
168    
169    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
170    
171            * HTML.pm.src (parse_char_string): Use newly created
172            |Whatpm::Charset::DecodeHandle::CharString| instead of Perl's
173            standard feature to |open| a string as a filehandle,
174            since Perl's string filehandle seems not supporting |ungetc|
175            method correctly.
176            (parse_char_stream): Define |{getc_until}| method.
177            (DATA_STATE): Experimental support for |getc_until| feature.
178    
179    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
180    
181            * HTML.pm.src: Check points added to newly added branches.
182    
183    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
184    
185            * HTML.pm.src: Remove |{char}|, which is no longer used.
186            Remove |{entity_in_attr}| and |{last_attribute_value_state}|
187            and replaced by |{prev_state}|.
188    
189            * mkhtmlparser.pl: Remove |{char}| feature.
190            Remove |!!!back-next-input-character;| macro.
191    
192    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
193    
194            * HTML.pm.src: Finally we get rid of all the inner loops.  Remove
195            entity related tokenizer states in favor of new states
196            implementing the consume character reference algorithm.
197    
198    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
199    
200            * HTML.pm.src: "Consume a character reference" algorithm is
201            now implemented as a tokenizer's state, rather than
202            a method, with minimum changes (more changes will
203            be made, in due course).  "Bogus comment state"'s inner
204            loop gets removed.
205    
206    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
207    
208            * HTML.pm.src: Make |PUBLIC| and |SYSTEM| keyword tokenizing
209            into their own tokenizer states.
210    
211    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
212    
213            * HTML.pm.src: |CDATA_SECTION_STATE| (formally |CDATA_BLOCK_STATE|
214            is split into three states.
215    
216    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
217    
218            * HTML.pm.src: |CLOSE_TAG_OPEN_STATE| is broken into
219            itself and new |CDATA_PCDATA_CLOSE_TAG_STATE| so that
220            no longer does the tokenizer have to push back next input
221            characters in those states.
222    
223    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
224    
225            * HTML.pm.src: |MARKUP_DECLARATION_OPEN_STATE| broken
226            into four states so that no longer does the tokenizer have to push
227            back next input characters in that state.
228    
229    2008-09-11  Wakaba  <wakaba@suika.fam.cx>
230    
231            * HTML.pm.src: Methods now accept additional parameter, $get_wrapper,
232            which can be used to insert some wrapper between the character
233            stream handle and the tokenizer.  (It is currently not supported
234            for |set_inner_html| for |Element|s).
235    
236    2008-09-10  Wakaba  <wakaba@suika.fam.cx>
237    
238            * HTML.pm.src: Ignore punctuations in charset names.
239    
240    2008-09-10  Wakaba  <wakaba@suika.fam.cx>
241    
242            * ContentChecker.pm: Support for charset-layer error levels.
243    
244            * HTML.pm.src: Don't specify |text| argument for the
245            |chardecode:fallback| error, since it is not the encoding
246            being used alternatively.
247    
248    2008-09-06  Wakaba  <wakaba@suika.fam.cx>
249    
250            * HTML.pm.src: Support for |XSLT-compat| (HTML5 revision 2141).
251    
252    2008-08-31  Wakaba  <wakaba@suika.fam.cx>
253    
254            * CacheManifest.pm: Support for extensibility (HTML5 revision 2051).
255    
256    2008-08-31  Wakaba  <wakaba@suika.fam.cx>
257    
258            * HTML.pm.src: Bug fix and sync with the spec with regard
259            to after after frameset insertion mode processing (HTML5
260            revision 1909).  Note that the implementation was wrong
261            per the old spec before the r1909 changes.
262    
263    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
264    
265            * HTMLTable.pm: scope=auto algorithm fix synced with the
266            spec (HTML5 revision 2093).
267            ($process_row): Algorithm step numbers synced with the
268            spec (HTML5 revision 2092).
269    
270    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
271    
272            * HTMLTable.pm: Zs is not what we want; we want White_Space! (HTML5
273            revision 2094).
274    
275    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
276    
277            * ContentType.pm: Support for image/svg+xml (HTML5 revision 2096).
278    
279    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
280    
281            * HTML.pm.src: '"' and "'" at the end of attribute
282            name (after another attribute) now raise parse error (HTML5
283            revision 2123).  Empty unquoted attribute values are no
284            longer allowed (HTML5 revision 2122).
285    
286    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
287    
288            * mkhtmlparser.pl: Support for MathML |definitionURL| attribute (HTML5
289            revision 2130).
290    
291    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
292    
293            * ContentChecker.pm: |xml:lang| attribute value must be same
294            as |lang| attribute value for HTML elements (HTML5 revision 2062
295            and so on).
296    
297    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
298    
299            * ContentChecker.pm: Error level definition for |xml_id_error|
300            was missing.
301    
302            * URIChecker.pm: The end of the URL should be marked as the
303            error location for an empty path error.  The position
304            between the userinfo and the port components should be
305            marked as the error location for an empty host error.
306    
307    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
308    
309            * URIChecker.pm: Set parameters representing where in the
310            value the error occurs for errors.  Report unknown
311            address format error in warning level, since address
312            formats are rarely added.  Path segments starting with "/.."
313            were misinterpreted as a dot-segment.
314    
315    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
316    
317            * URIChecker.pm (check_iri_reference): Requires
318            |Message::DOM::DOMImplementation|.
319    
320    2008-08-29  Wakaba  <wakaba@suika.fam.cx>
321    
322            * IMTChecker.pm: Updated for the new error reporting architecture.
323    
324            * ContentChecker.pm: Error levels for IMTs are added.
325    
326    2008-08-17  Wakaba  <wakaba@suika.fam.cx>
327    
328            * H2H.pm (_shift_token): Support for unquoted HTML attribute
329            values.
330    
331    2008-08-16  Wakaba  <wakaba@suika.fam.cx>
332    
333            * CacheManifest.pm: Support for new style of error
334            reports.
335    
336            * HTML.pm.src: Set line=1, column=1 to the document node.
337    
338    2008-08-16  Wakaba  <wakaba@suika.fam.cx>
339    
340            * ContentChecker.pm, RDFXML.pm: Pass {level} object to language tag
341            and URL checkers.  Support for more error levels for bogus
342            langauge tag and URL "standards".
343    
344            * LangTag.pm, URIChecker.pm: Support for new style error
345            level reporting.
346    
347    2008-08-15  Wakaba  <wakaba@suika.fam.cx>
348    
349            * ContentChecker.pm: Support for RDF/XML error levels.
350    
351            * HTMLTable.pm, RDFXML.pm: Support for new style of error level
352            specifying.  Error types are revised.
353    
354    2008-08-15  Wakaba  <wakaba@suika.fam.cx>
355    
356            * ContentChecker.pm: All error reporting method calls are
357            renewed.
358    
359    2008-08-15  Wakaba  <wakaba@suika.fam.cx>
360    
361            * HTML.pm.src: All error type names and "text" parameters
362            are revised.  Use new style for "level" specification.
363    
364            * mkhtmlparser.pl: Use new style for "level" specification.
365    
366    2008-08-03  Wakaba  <wakaba@suika.fam.cx>
367    
368            * WebIDL.pm (parse_char_string): Simplified error
369            reporting process for broken ignored valuetype definition.
370            (Valuetype idl_text): Support for special "DOMString" name.
371    
372    2008-08-03  Wakaba  <wakaba@suika.fam.cx>
373    
374            * WebIDL.pm ($get_scoped_name): Append "::::" if the last
375            terminal of the ScopedName is "DOMString", such that whether
376            the last part of the scoped name is "DOMString" or "_DOMString"
377            later.  It is necessary to determine whether a |typedef|
378            definition should be ignored or not.
379            (parse_char_string): Unescape the identifier of
380            exception members.
381            ($resolve): Return undef for builtin types and sequence<T>
382            types (we might not have to do this, however...).
383            (check): Support checking for Exceptions, Valuetypes,
384            and Typedefs.
385            ($serialize_type): Support for "DOMString::::" syntax.
386            (Typedef idl_text): Output Type as "DOMString" if it
387            is really "DOMString" (i.e. its internal representation
388            is "::DOMString::").
389    
390    2008-08-03  Wakaba  <wakaba@suika.fam.cx>
391    
392            * WebIDL.pm ($resolve): New code, based on resolve code
393            for constant types in the |check| method.
394            (check): Support for checking of attributes, operations, and
395            arguments.
396            (Attribute/Operation idl_text): Exception names in getraises,
397            setraises, and raises clauses is serizlied by |$serialize_type|
398            code.
399    
400    2008-08-02  Wakaba  <wakaba@suika.fam.cx>
401    
402            * WebIDL.pm ($integer): Order of selections are changed to match
403            hexadecimal numbers (the original pattern, taken from the spec,
404            was not work for hexadecimal numbers, because the "0" prefix
405            matches to the [0-7]* part (as an empty string) and therefore
406            it does not match with remaining "x..." part of a "0x..." integer
407            literal.
408            ($get_type): It now returns a string, not an array reference,
409            for regular types and |sequence| types (i.e. it in any case
410            returns a string).
411            ($get_next_token): The second item in the array that represents
412            a integer or float token is now a Perl number value, not the
413            original string representation of the number.
414            (check): Support for const value consistency checking.
415            No extended attribute is defined for constants.
416            (Node subclasses): Use simple strings rather than array references
417            for default data type values.
418            ($serialize_type): Type values are now simple strings.
419            (value): If the new attribute value is a false value, then
420            a FALSE value is set to the attribute.
421    
422    2008-08-02  Wakaba  <wakaba@suika.fam.cx>
423    
424            * WebIDL.pm ($get_scoped_name): Now scoped names are stored
425            in its stringified format ("scoped name" as defined in the
426            spec).  Note that future version of this module should not use
427            array references for type values and the |type_text| attribute
428            should be made obsolete.
429            (parse_char_string): Unescape attribute names.
430            (check): Support for checking of whether inherited interfaces
431            are actually defined or not.  Support for checking of whether
432            interface member identifiers are duplicated or not.
433            ($serialize_type): Scoped names are returned as is.  A future
434            version of this code should escape identifiers other than "DOMString",
435            otherwise the idl_text would be non-conforming.
436    
437    2008-08-02  Wakaba  <wakaba@suika.fam.cx>
438    
439            * WebIDL.pm (parse_char_string): Set line/column numbers
440            to generated nodes.  Unescape identifiers.  Extended attributes
441            for Definition's were ignored.
442            (append_child): Set |parent_node| attribute.
443            (parent_node): New attribute.
444            (check): Support interface/exception members.  Support
445            extended attributes.  Support definition identifier uniqueness
446            constraint.
447            (qualified_name): New attribute.
448            (Interface/Exception idl_text): Extended attributes were
449            not prepended to the returned text.
450    
451    2008-08-02  Wakaba  <wakaba@suika.fam.cx>
452    
453            * WebIDL.pm (parse_char_string): Set line/column numbers
454            to interface object experimentally.  s/shift/pop/g, shift
455            would make things wrong.  Support for interface forward
456            declarations was missing.  Broken interface declarations
457            with no block were not ignored entirely.
458            (Whatpm::WebIDL::Node): New abstract class.  This class
459            makes things easier.
460            (child_nodes): New attribute.  Unlike DOM's attribute with
461            same name, this attribute returns a dead list of nodes for
462            simplicity.
463            (get_user_data, set_user_data): New methods.
464            (Module idl_text): A SPACE character should be inserted
465            before the |{| character.
466            (Interface idl_text): Support for interface forward declarations.
467            (is_forward_declaration): New attribute.
468    
469    2008-07-19  Wakaba  <wakaba@suika.fam.cx>
470    
471            * WebIDL.pm (type_text): Better serializer.
472    
473    2008-07-19  Wakaba  <wakaba@suika.fam.cx>
474    
475            * WebIDL.pm: Revise forward-compatible parsing so that
476            it now can handle broken extended attributes and as such.
477    
478    2008-07-19  Wakaba  <wakaba@suika.fam.cx>
479    
480            * WebIDL.pm: Real support for extended attributes.
481            Support for extended attributes with arguments.
482    
483    2008-07-19  Wakaba  <wakaba@suika.fam.cx>
484    
485            * WebIDL.pm: Support for |exception| syntax.
486            (Interface->idl_text): Tentative support for inheritances.
487    
488    2008-07-19  Wakaba  <wakaba@suika.fam.cx>
489    
490            * WebIDL.pm: Hierarchical scoped name support was broken.
491            Support for raises, setraises, and getraises syntaxes.
492    
493    2008-07-18  Wakaba  <wakaba@suika.fam.cx>
494    
495            * WebIDL.pm: Support for |idl_text| attribute, version 1 (no
496            proper support for types, extended attributes, and exceptions yet).
497            WebIDL parser, version 1 (no support for exceptions yet,
498            no proper support for extended attributes yet).
499    
500    2008-07-09  Wakaba  <wakaba@suika.fam.cx>
501    
502            * WebIDL.pm (parse_char_string): Support for basic attribute syntax.
503    
504    2008-06-29  Wakaba  <wakaba@suika.fam.cx>
505    
506            * WebIDL.pm: Support for valuetype and const.
507    
508    2008-06-29  Wakaba  <wakaba@suika.fam.cx>
509            
510            * WebIDL.pm: New module.
511    
512    2008-06-15  Wakaba  <wakaba@suika.fam.cx>
513    
514            * Makefile (Entities.html): URI changed.
515    
516    2008-06-08  Wakaba  <wakaba@suika.fam.cx>
517    
518            * HTML.pm.src: Support for ruby parsing (HTML5 revision 1704).
519    
520    2008-06-01  Wakaba  <wakaba@suika.fam.cx>
521    
522            * HTML.pm.src (_get_next_token): A parse error was missing.
523    
524    2008-06-01  Wakaba  <wakaba@suika.fam.cx>
525    
526            * mklinktypelist.pl: rel=contact is no longer part of the HTML5
527            spec (commented out). (HTML5 revision 1711).
528    
529    2008-05-25  Wakaba  <wakaba@suika.fam.cx>
530    
531            * ContentType.pm: Drop support for UTF-32 (HTML5 revision 1701).
532    
533            * HTML.pm.src: UTF-16BE and UTF-16LE should be considered
534            as UTF-16 (HTML5 revision 1701).
535    
536    2008-05-25  Wakaba  <wakaba@suika.fam.cx>
537    
538            * HTML.pm.src: Support for <noframes> in <head> (HTML5 revision
539            1692).
540    
541    2008-05-25  Wakaba  <wakaba@suika.fam.cx>
542    
543            * HTML.pm.src: The secondary insertion mode used when switching
544            to foreign content is the "in body" insertion mode (HTML5 revision
545            1696).
546    
547  2008-05-25  Wakaba  <wakaba@suika.fam.cx>  2008-05-25  Wakaba  <wakaba@suika.fam.cx>
548    
549          * HTML.pm.src: Don't raise parse error for <isindex/> (HTML5          * HTML.pm.src: Don't raise parse error for <isindex/> (HTML5

Legend:
Removed from v.1.248  
changed lines
  Added in v.1.326

admin@suikawiki.org
ViewVC Help
Powered by ViewVC 1.1.24