/[suikacvs]/markup/html/whatpm/Whatpm/ChangeLog
Suika

Diff of /markup/html/whatpm/Whatpm/ChangeLog

Parent Directory Parent Directory | Revision Log Revision Log | View Patch Patch

revision 1.266 by wakaba, Sat Aug 2 12:51:52 2008 UTC revision 1.324 by wakaba, Thu Sep 18 14:32:48 2008 UTC
# Line 1  Line 1 
1    2008-09-18  Wakaba  <wakaba@suika.fam.cx>
2    
3            * LangTag.pm: Add checks for remaining requirements from RFC 4646.
4    
5            * mklangreg.pl: Sort 'Prefix' values by their length, to ease
6            matching.
7    
8    2008-09-18  Wakaba  <wakaba@suika.fam.cx>
9    
10            * LangTag.pm: Warn for private use language subtags.  Error level
11            typos fixed.  Support for Suppress-Script field.
12    
13            * mklangreg.pl: Support for dumping of nested structure.
14    
15    2008-09-18  Wakaba  <wakaba@suika.fam.cx>
16    
17            * LangTag.pm (check_rfc4646_langtag): Check if a tag is in the
18            recommended case as per RFC 4646.
19    
20    2008-09-18  Wakaba  <wakaba@suika.fam.cx>
21    
22            * LangTag.pm (check_rfc4646_langtag): New method.
23    
24    2008-09-18  Wakaba  <wakaba@suika.fam.cx>
25    
26            * mklangreg.pl: New script.
27    
28            * Makefile: Updated for creation of the module for language subtag
29            registry.
30            
31    2008-09-16  Wakaba  <wakaba@suika.fam.cx>
32    
33            * Makefile: WebIDL.html added.
34    
35            * WebIDL.pod: New documentation.
36    
37    2008-09-16  Wakaba  <wakaba@suika.fam.cx>
38    
39            * WebIDL.pm: Checker's error types are redefined.
40    
41    2008-09-16  Wakaba  <wakaba@suika.fam.cx>
42    
43            * WebIDL.pm: Parser's error types are redefined.  Some forward
44            compatible parsing bugs are fixed.  Some unreachable codes are
45            commented out.
46    
47    2008-09-16  Wakaba  <wakaba@suika.fam.cx>
48    
49            * WebIDL.pm: Support for the reminding extended attributes are
50            added.  It does not satisfy the definition that a forward
51            interface declaration has an extended attribute.  It seems that
52            unless explicitly allowed multiple extended attributes with the
53            same name is not allowed, though it is not explicitly mentioned in
54            the spec.
55    
56    2008-09-16  Wakaba  <wakaba@suika.fam.cx>
57    
58            * WebIDL.pm: Unescapes extended attribute names and extended
59            attribute identifiers.  Preserve whether an extended attribute has
60            an argument list of not.  Support for extended attributes:
61            Constructor, ExceptionConsts, IndexGetter, IndexSetter,
62            NameGetter, NameSetter, and Null.
63            (has_argument_list): New attribute.
64            (idl_text): Stringifies argument lists, if any, even if it is
65            empty.
66    
67    2008-09-15  Wakaba  <wakaba@suika.fam.cx>
68    
69            * HTML.pm.src: New state |PCDATA_STATE|.  Use an empty string for
70            |{s_kwd}| in DATA_STATE as default.
71    
72    2008-09-15  Wakaba  <wakaba@suika.fam.cx>
73    
74            * HTML.pm.src, mkhtmlparser.pl: Replace |{prev_char}|
75            by |{s_kwd}| in DATA_STATE.
76    
77    2008-09-15  Wakaba  <wakaba@suika.fam.cx>
78    
79            * HTML.pm.src: Shorten keys.
80    
81    2008-09-15  Wakaba  <wakaba@suika.fam.cx>
82    
83            * HTML.pm.src: Remove checking for control character, surrogate
84            pair, or noncharacter code points and non-Unicode code
85            points (they should be handled by Whatpm::Charset::UnicodeChecker).
86            (parse_char_stream): Support for the |$get_wrapper| argument and
87            character stream error handlers.
88    
89    2008-09-15  Wakaba  <wakaba@suika.fam.cx>
90    
91            * ContentChecker.pm: Don't call |loda_ns_module|
92            for null-namespace elements/attributes.
93    
94            * HTML.pm.src: Fact out $disallowed_control_chars
95            as a hash.
96    
97    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
98    
99            * HTML.pm.src: Regexp typo fixed.  |{prev_char}|
100            and |{next_char}| initializations are moved to initialization
101            method.  |{read_until}| now supports buffering.  Sync |set_inner_html|
102            with |parse_char_stream|.
103    
104    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
105    
106            * HTML.pm.src (parse_char_stream): Make |set_next_char|
107            invoke |manakai_read_until|, not only |read|, where
108            possible, to decrease the number of |read| method calls.
109    
110            * mkhtmlparser.pl: Related changes to the aforementioned
111            modification.
112    
113    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
114    
115            * HTML.pm.src: Use |read| instead of |getc|.  |set_inner_html|
116            would report character error from now.
117    
118    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
119    
120            * HTML.pm.src: White-space-leaded non-white-space character
121            tokens in "before head insertion mode" was not
122            correctly handled.
123            (set_inner_html): Reimplemented using CharString decodehandle
124            class.  Support for $get_wrapper argument.  Support
125            for |{read_until}| feature.
126    
127    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
128    
129            * HTML.pm.src: Make a "bare ero" error for unknown
130            entities point the "&" character.
131    
132    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
133    
134            * HTML.pm.src: It turns out that U+FFFD don't have to
135            be added to the list of excluded characters.
136    
137    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
138    
139            * HTML.pm.src ($char_onerror): Have character decoder's |line|
140            and |column| a higher priority than the one set by the
141            tokenizer's input handler.
142            ($self->{read_until}): Exclude U+FFFD (but this might
143            not be necessary, since now we do line/column fixup in
144            the character decode handle).
145    
146    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
147    
148            * HTML.pm.src: Use |{read_until}| where possible.
149    
150    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
151    
152            * HTML.pm.src: Change |{getc_until}| to |{read_until}|
153            and |manakai_getc_until| to |manakai_read_until| to
154            reduce the number of string copies.
155    
156    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
157    
158            * HTML.pm.src (parse_char_string): Use newly created
159            |Whatpm::Charset::DecodeHandle::CharString| instead of Perl's
160            standard feature to |open| a string as a filehandle,
161            since Perl's string filehandle seems not supporting |ungetc|
162            method correctly.
163            (parse_char_stream): Define |{getc_until}| method.
164            (DATA_STATE): Experimental support for |getc_until| feature.
165    
166    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
167    
168            * HTML.pm.src: Check points added to newly added branches.
169    
170    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
171    
172            * HTML.pm.src: Remove |{char}|, which is no longer used.
173            Remove |{entity_in_attr}| and |{last_attribute_value_state}|
174            and replaced by |{prev_state}|.
175    
176            * mkhtmlparser.pl: Remove |{char}| feature.
177            Remove |!!!back-next-input-character;| macro.
178    
179    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
180    
181            * HTML.pm.src: Finally we get rid of all the inner loops.  Remove
182            entity related tokenizer states in favor of new states
183            implementing the consume character reference algorithm.
184    
185    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
186    
187            * HTML.pm.src: "Consume a character reference" algorithm is
188            now implemented as a tokenizer's state, rather than
189            a method, with minimum changes (more changes will
190            be made, in due course).  "Bogus comment state"'s inner
191            loop gets removed.
192    
193    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
194    
195            * HTML.pm.src: Make |PUBLIC| and |SYSTEM| keyword tokenizing
196            into their own tokenizer states.
197    
198    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
199    
200            * HTML.pm.src: |CDATA_SECTION_STATE| (formally |CDATA_BLOCK_STATE|
201            is split into three states.
202    
203    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
204    
205            * HTML.pm.src: |CLOSE_TAG_OPEN_STATE| is broken into
206            itself and new |CDATA_PCDATA_CLOSE_TAG_STATE| so that
207            no longer does the tokenizer have to push back next input
208            characters in those states.
209    
210    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
211    
212            * HTML.pm.src: |MARKUP_DECLARATION_OPEN_STATE| broken
213            into four states so that no longer does the tokenizer have to push
214            back next input characters in that state.
215    
216    2008-09-11  Wakaba  <wakaba@suika.fam.cx>
217    
218            * HTML.pm.src: Methods now accept additional parameter, $get_wrapper,
219            which can be used to insert some wrapper between the character
220            stream handle and the tokenizer.  (It is currently not supported
221            for |set_inner_html| for |Element|s).
222    
223    2008-09-10  Wakaba  <wakaba@suika.fam.cx>
224    
225            * HTML.pm.src: Ignore punctuations in charset names.
226    
227    2008-09-10  Wakaba  <wakaba@suika.fam.cx>
228    
229            * ContentChecker.pm: Support for charset-layer error levels.
230    
231            * HTML.pm.src: Don't specify |text| argument for the
232            |chardecode:fallback| error, since it is not the encoding
233            being used alternatively.
234    
235    2008-09-06  Wakaba  <wakaba@suika.fam.cx>
236    
237            * HTML.pm.src: Support for |XSLT-compat| (HTML5 revision 2141).
238    
239    2008-08-31  Wakaba  <wakaba@suika.fam.cx>
240    
241            * CacheManifest.pm: Support for extensibility (HTML5 revision 2051).
242    
243    2008-08-31  Wakaba  <wakaba@suika.fam.cx>
244    
245            * HTML.pm.src: Bug fix and sync with the spec with regard
246            to after after frameset insertion mode processing (HTML5
247            revision 1909).  Note that the implementation was wrong
248            per the old spec before the r1909 changes.
249    
250    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
251    
252            * HTMLTable.pm: scope=auto algorithm fix synced with the
253            spec (HTML5 revision 2093).
254            ($process_row): Algorithm step numbers synced with the
255            spec (HTML5 revision 2092).
256    
257    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
258    
259            * HTMLTable.pm: Zs is not what we want; we want White_Space! (HTML5
260            revision 2094).
261    
262    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
263    
264            * ContentType.pm: Support for image/svg+xml (HTML5 revision 2096).
265    
266    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
267    
268            * HTML.pm.src: '"' and "'" at the end of attribute
269            name (after another attribute) now raise parse error (HTML5
270            revision 2123).  Empty unquoted attribute values are no
271            longer allowed (HTML5 revision 2122).
272    
273    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
274    
275            * mkhtmlparser.pl: Support for MathML |definitionURL| attribute (HTML5
276            revision 2130).
277    
278    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
279    
280            * ContentChecker.pm: |xml:lang| attribute value must be same
281            as |lang| attribute value for HTML elements (HTML5 revision 2062
282            and so on).
283    
284    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
285    
286            * ContentChecker.pm: Error level definition for |xml_id_error|
287            was missing.
288    
289            * URIChecker.pm: The end of the URL should be marked as the
290            error location for an empty path error.  The position
291            between the userinfo and the port components should be
292            marked as the error location for an empty host error.
293    
294    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
295    
296            * URIChecker.pm: Set parameters representing where in the
297            value the error occurs for errors.  Report unknown
298            address format error in warning level, since address
299            formats are rarely added.  Path segments starting with "/.."
300            were misinterpreted as a dot-segment.
301    
302    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
303    
304            * URIChecker.pm (check_iri_reference): Requires
305            |Message::DOM::DOMImplementation|.
306    
307    2008-08-29  Wakaba  <wakaba@suika.fam.cx>
308    
309            * IMTChecker.pm: Updated for the new error reporting architecture.
310    
311            * ContentChecker.pm: Error levels for IMTs are added.
312    
313    2008-08-17  Wakaba  <wakaba@suika.fam.cx>
314    
315            * H2H.pm (_shift_token): Support for unquoted HTML attribute
316            values.
317    
318    2008-08-16  Wakaba  <wakaba@suika.fam.cx>
319    
320            * CacheManifest.pm: Support for new style of error
321            reports.
322    
323            * HTML.pm.src: Set line=1, column=1 to the document node.
324    
325    2008-08-16  Wakaba  <wakaba@suika.fam.cx>
326    
327            * ContentChecker.pm, RDFXML.pm: Pass {level} object to language tag
328            and URL checkers.  Support for more error levels for bogus
329            langauge tag and URL "standards".
330    
331            * LangTag.pm, URIChecker.pm: Support for new style error
332            level reporting.
333    
334    2008-08-15  Wakaba  <wakaba@suika.fam.cx>
335    
336            * ContentChecker.pm: Support for RDF/XML error levels.
337    
338            * HTMLTable.pm, RDFXML.pm: Support for new style of error level
339            specifying.  Error types are revised.
340    
341    2008-08-15  Wakaba  <wakaba@suika.fam.cx>
342    
343            * ContentChecker.pm: All error reporting method calls are
344            renewed.
345    
346    2008-08-15  Wakaba  <wakaba@suika.fam.cx>
347    
348            * HTML.pm.src: All error type names and "text" parameters
349            are revised.  Use new style for "level" specification.
350    
351            * mkhtmlparser.pl: Use new style for "level" specification.
352    
353    2008-08-03  Wakaba  <wakaba@suika.fam.cx>
354    
355            * WebIDL.pm (parse_char_string): Simplified error
356            reporting process for broken ignored valuetype definition.
357            (Valuetype idl_text): Support for special "DOMString" name.
358    
359    2008-08-03  Wakaba  <wakaba@suika.fam.cx>
360    
361            * WebIDL.pm ($get_scoped_name): Append "::::" if the last
362            terminal of the ScopedName is "DOMString", such that whether
363            the last part of the scoped name is "DOMString" or "_DOMString"
364            later.  It is necessary to determine whether a |typedef|
365            definition should be ignored or not.
366            (parse_char_string): Unescape the identifier of
367            exception members.
368            ($resolve): Return undef for builtin types and sequence<T>
369            types (we might not have to do this, however...).
370            (check): Support checking for Exceptions, Valuetypes,
371            and Typedefs.
372            ($serialize_type): Support for "DOMString::::" syntax.
373            (Typedef idl_text): Output Type as "DOMString" if it
374            is really "DOMString" (i.e. its internal representation
375            is "::DOMString::").
376    
377    2008-08-03  Wakaba  <wakaba@suika.fam.cx>
378    
379            * WebIDL.pm ($resolve): New code, based on resolve code
380            for constant types in the |check| method.
381            (check): Support for checking of attributes, operations, and
382            arguments.
383            (Attribute/Operation idl_text): Exception names in getraises,
384            setraises, and raises clauses is serizlied by |$serialize_type|
385            code.
386    
387    2008-08-02  Wakaba  <wakaba@suika.fam.cx>
388    
389            * WebIDL.pm ($integer): Order of selections are changed to match
390            hexadecimal numbers (the original pattern, taken from the spec,
391            was not work for hexadecimal numbers, because the "0" prefix
392            matches to the [0-7]* part (as an empty string) and therefore
393            it does not match with remaining "x..." part of a "0x..." integer
394            literal.
395            ($get_type): It now returns a string, not an array reference,
396            for regular types and |sequence| types (i.e. it in any case
397            returns a string).
398            ($get_next_token): The second item in the array that represents
399            a integer or float token is now a Perl number value, not the
400            original string representation of the number.
401            (check): Support for const value consistency checking.
402            No extended attribute is defined for constants.
403            (Node subclasses): Use simple strings rather than array references
404            for default data type values.
405            ($serialize_type): Type values are now simple strings.
406            (value): If the new attribute value is a false value, then
407            a FALSE value is set to the attribute.
408    
409  2008-08-02  Wakaba  <wakaba@suika.fam.cx>  2008-08-02  Wakaba  <wakaba@suika.fam.cx>
410    
411          * WebIDL.pm ($get_scoped_name): Now scoped names are stored          * WebIDL.pm ($get_scoped_name): Now scoped names are stored

Legend:
Removed from v.1.266  
changed lines
  Added in v.1.324

admin@suikawiki.org
ViewVC Help
Powered by ViewVC 1.1.24