/[suikacvs]/markup/html/whatpm/Whatpm/ChangeLog
Suika

Diff of /markup/html/whatpm/Whatpm/ChangeLog

Parent Directory Parent Directory | Revision Log Revision Log | View Patch Patch

revision 1.267 by wakaba, Sat Aug 2 15:14:23 2008 UTC revision 1.334 by wakaba, Mon Sep 22 06:04:28 2008 UTC
# Line 1  Line 1 
1    2008-09-22  Wakaba  <wakaba@suika.fam.cx>
2    
3            * HTML.pm.src: Character references for non-space C0 characters,
4            including U+000B VT, DEL character, noncharacter code points, are
5            now converted to the U+FFFD character (cf. HTML5 revision 2138).
6    
7    2008-09-21  Wakaba  <wakaba@suika.fam.cx>
8    
9            * ContentChecker.pm: |form=""| check support added.
10    
11    2008-09-21  Wakaba  <wakaba@suika.fam.cx>
12    
13            * ContentChecker.pm: |contextmenu| validness is now checked using
14            |id| and |id_type| properties, and |menu| property is removed.
15    
16    2008-09-21  Wakaba  <wakaba@suika.fam.cx>
17    
18            * ContentChecker.pm: Prepare for |form| |name| attribute's
19            duplication checking.
20    
21    2008-09-21  Wakaba  <wakaba@suika.fam.cx>
22    
23            * HTML.pm.src (parse_byte_stream): Support (or non-support) for
24            unsupported charset="" parameter value (HTML5 revision 2131).
25    
26    2008-09-20  Wakaba  <wakaba@suika.fam.cx>
27    
28            * HTML.pm.src: Reminding places where U+000B is allowed as a space
29            character is fixed (cf. HTML5 revision 1738).
30    
31            * ContentChecker.pm, HTMLTable.pm: U+000B is no longer part of
32            space characters (HTML5 revision 1738).
33    
34    2008-09-20  Wakaba  <wakaba@suika.fam.cx>
35    
36            * HTML.pm.src: The "anything else" case for the "after after body"
37            insertion mode was not updated to swtich to the "in body"
38            insertion mode.  U+000B is no longer a space character for the
39            purpose of tree construction phase (HTML5 revision 1738).
40    
41    2008-09-20  Wakaba  <wakaba@suika.fam.cx>
42    
43            * HTML.pm.src: U+000B is no longer a space character (HTML5
44            revision 1738).
45    
46    2008-09-20  Wakaba  <wakaba@suika.fam.cx>
47    
48            * ContentType.pm: 0x0B is no longer a space character (HTML5
49            revision 1738).
50    
51            * HTML.pm.src: U+000B is no longer a space character for the
52            algorithm for extracting an encoding from a Content-Type (HTML5
53            revision 1738).
54    
55    2008-09-20  Wakaba  <wakaba@suika.fam.cx>
56    
57            * ContentChecker.pm ($IsInHTMLInteractiveContent): New.
58    
59    2008-09-18  Wakaba  <wakaba@suika.fam.cx>
60    
61            * LangTag.pm: Add checks for remaining requirements from RFC 4646.
62    
63            * mklangreg.pl: Sort 'Prefix' values by their length, to ease
64            matching.
65    
66    2008-09-18  Wakaba  <wakaba@suika.fam.cx>
67    
68            * LangTag.pm: Warn for private use language subtags.  Error level
69            typos fixed.  Support for Suppress-Script field.
70    
71            * mklangreg.pl: Support for dumping of nested structure.
72    
73    2008-09-18  Wakaba  <wakaba@suika.fam.cx>
74    
75            * LangTag.pm (check_rfc4646_langtag): Check if a tag is in the
76            recommended case as per RFC 4646.
77    
78    2008-09-18  Wakaba  <wakaba@suika.fam.cx>
79    
80            * LangTag.pm (check_rfc4646_langtag): New method.
81    
82    2008-09-18  Wakaba  <wakaba@suika.fam.cx>
83    
84            * mklangreg.pl: New script.
85    
86            * Makefile: Updated for creation of the module for language subtag
87            registry.
88            
89    2008-09-16  Wakaba  <wakaba@suika.fam.cx>
90    
91            * Makefile: WebIDL.html added.
92    
93            * WebIDL.pod: New documentation.
94    
95    2008-09-16  Wakaba  <wakaba@suika.fam.cx>
96    
97            * WebIDL.pm: Checker's error types are redefined.
98    
99    2008-09-16  Wakaba  <wakaba@suika.fam.cx>
100    
101            * WebIDL.pm: Parser's error types are redefined.  Some forward
102            compatible parsing bugs are fixed.  Some unreachable codes are
103            commented out.
104    
105    2008-09-16  Wakaba  <wakaba@suika.fam.cx>
106    
107            * WebIDL.pm: Support for the reminding extended attributes are
108            added.  It does not satisfy the definition that a forward
109            interface declaration has an extended attribute.  It seems that
110            unless explicitly allowed multiple extended attributes with the
111            same name is not allowed, though it is not explicitly mentioned in
112            the spec.
113    
114    2008-09-16  Wakaba  <wakaba@suika.fam.cx>
115    
116            * WebIDL.pm: Unescapes extended attribute names and extended
117            attribute identifiers.  Preserve whether an extended attribute has
118            an argument list of not.  Support for extended attributes:
119            Constructor, ExceptionConsts, IndexGetter, IndexSetter,
120            NameGetter, NameSetter, and Null.
121            (has_argument_list): New attribute.
122            (idl_text): Stringifies argument lists, if any, even if it is
123            empty.
124    
125    2008-09-15  Wakaba  <wakaba@suika.fam.cx>
126    
127            * HTML.pm.src: New state |PCDATA_STATE|.  Use an empty string for
128            |{s_kwd}| in DATA_STATE as default.
129    
130    2008-09-15  Wakaba  <wakaba@suika.fam.cx>
131    
132            * HTML.pm.src, mkhtmlparser.pl: Replace |{prev_char}|
133            by |{s_kwd}| in DATA_STATE.
134    
135    2008-09-15  Wakaba  <wakaba@suika.fam.cx>
136    
137            * HTML.pm.src: Shorten keys.
138    
139    2008-09-15  Wakaba  <wakaba@suika.fam.cx>
140    
141            * HTML.pm.src: Remove checking for control character, surrogate
142            pair, or noncharacter code points and non-Unicode code
143            points (they should be handled by Whatpm::Charset::UnicodeChecker).
144            (parse_char_stream): Support for the |$get_wrapper| argument and
145            character stream error handlers.
146    
147    2008-09-15  Wakaba  <wakaba@suika.fam.cx>
148    
149            * ContentChecker.pm: Don't call |loda_ns_module|
150            for null-namespace elements/attributes.
151    
152            * HTML.pm.src: Fact out $disallowed_control_chars
153            as a hash.
154    
155    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
156    
157            * HTML.pm.src: Regexp typo fixed.  |{prev_char}|
158            and |{next_char}| initializations are moved to initialization
159            method.  |{read_until}| now supports buffering.  Sync |set_inner_html|
160            with |parse_char_stream|.
161    
162    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
163    
164            * HTML.pm.src (parse_char_stream): Make |set_next_char|
165            invoke |manakai_read_until|, not only |read|, where
166            possible, to decrease the number of |read| method calls.
167    
168            * mkhtmlparser.pl: Related changes to the aforementioned
169            modification.
170    
171    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
172    
173            * HTML.pm.src: Use |read| instead of |getc|.  |set_inner_html|
174            would report character error from now.
175    
176    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
177    
178            * HTML.pm.src: White-space-leaded non-white-space character
179            tokens in "before head insertion mode" was not
180            correctly handled.
181            (set_inner_html): Reimplemented using CharString decodehandle
182            class.  Support for $get_wrapper argument.  Support
183            for |{read_until}| feature.
184    
185    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
186    
187            * HTML.pm.src: Make a "bare ero" error for unknown
188            entities point the "&" character.
189    
190    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
191    
192            * HTML.pm.src: It turns out that U+FFFD don't have to
193            be added to the list of excluded characters.
194    
195    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
196    
197            * HTML.pm.src ($char_onerror): Have character decoder's |line|
198            and |column| a higher priority than the one set by the
199            tokenizer's input handler.
200            ($self->{read_until}): Exclude U+FFFD (but this might
201            not be necessary, since now we do line/column fixup in
202            the character decode handle).
203    
204    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
205    
206            * HTML.pm.src: Use |{read_until}| where possible.
207    
208    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
209    
210            * HTML.pm.src: Change |{getc_until}| to |{read_until}|
211            and |manakai_getc_until| to |manakai_read_until| to
212            reduce the number of string copies.
213    
214    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
215    
216            * HTML.pm.src (parse_char_string): Use newly created
217            |Whatpm::Charset::DecodeHandle::CharString| instead of Perl's
218            standard feature to |open| a string as a filehandle,
219            since Perl's string filehandle seems not supporting |ungetc|
220            method correctly.
221            (parse_char_stream): Define |{getc_until}| method.
222            (DATA_STATE): Experimental support for |getc_until| feature.
223    
224    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
225    
226            * HTML.pm.src: Check points added to newly added branches.
227    
228    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
229    
230            * HTML.pm.src: Remove |{char}|, which is no longer used.
231            Remove |{entity_in_attr}| and |{last_attribute_value_state}|
232            and replaced by |{prev_state}|.
233    
234            * mkhtmlparser.pl: Remove |{char}| feature.
235            Remove |!!!back-next-input-character;| macro.
236    
237    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
238    
239            * HTML.pm.src: Finally we get rid of all the inner loops.  Remove
240            entity related tokenizer states in favor of new states
241            implementing the consume character reference algorithm.
242    
243    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
244    
245            * HTML.pm.src: "Consume a character reference" algorithm is
246            now implemented as a tokenizer's state, rather than
247            a method, with minimum changes (more changes will
248            be made, in due course).  "Bogus comment state"'s inner
249            loop gets removed.
250    
251    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
252    
253            * HTML.pm.src: Make |PUBLIC| and |SYSTEM| keyword tokenizing
254            into their own tokenizer states.
255    
256    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
257    
258            * HTML.pm.src: |CDATA_SECTION_STATE| (formally |CDATA_BLOCK_STATE|
259            is split into three states.
260    
261    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
262    
263            * HTML.pm.src: |CLOSE_TAG_OPEN_STATE| is broken into
264            itself and new |CDATA_PCDATA_CLOSE_TAG_STATE| so that
265            no longer does the tokenizer have to push back next input
266            characters in those states.
267    
268    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
269    
270            * HTML.pm.src: |MARKUP_DECLARATION_OPEN_STATE| broken
271            into four states so that no longer does the tokenizer have to push
272            back next input characters in that state.
273    
274    2008-09-11  Wakaba  <wakaba@suika.fam.cx>
275    
276            * HTML.pm.src: Methods now accept additional parameter, $get_wrapper,
277            which can be used to insert some wrapper between the character
278            stream handle and the tokenizer.  (It is currently not supported
279            for |set_inner_html| for |Element|s).
280    
281    2008-09-10  Wakaba  <wakaba@suika.fam.cx>
282    
283            * HTML.pm.src: Ignore punctuations in charset names.
284    
285    2008-09-10  Wakaba  <wakaba@suika.fam.cx>
286    
287            * ContentChecker.pm: Support for charset-layer error levels.
288    
289            * HTML.pm.src: Don't specify |text| argument for the
290            |chardecode:fallback| error, since it is not the encoding
291            being used alternatively.
292    
293    2008-09-06  Wakaba  <wakaba@suika.fam.cx>
294    
295            * HTML.pm.src: Support for |XSLT-compat| (HTML5 revision 2141).
296    
297    2008-08-31  Wakaba  <wakaba@suika.fam.cx>
298    
299            * CacheManifest.pm: Support for extensibility (HTML5 revision 2051).
300    
301    2008-08-31  Wakaba  <wakaba@suika.fam.cx>
302    
303            * HTML.pm.src: Bug fix and sync with the spec with regard
304            to after after frameset insertion mode processing (HTML5
305            revision 1909).  Note that the implementation was wrong
306            per the old spec before the r1909 changes.
307    
308    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
309    
310            * HTMLTable.pm: scope=auto algorithm fix synced with the
311            spec (HTML5 revision 2093).
312            ($process_row): Algorithm step numbers synced with the
313            spec (HTML5 revision 2092).
314    
315    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
316    
317            * HTMLTable.pm: Zs is not what we want; we want White_Space! (HTML5
318            revision 2094).
319    
320    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
321    
322            * ContentType.pm: Support for image/svg+xml (HTML5 revision 2096).
323    
324    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
325    
326            * HTML.pm.src: '"' and "'" at the end of attribute
327            name (after another attribute) now raise parse error (HTML5
328            revision 2123).  Empty unquoted attribute values are no
329            longer allowed (HTML5 revision 2122).
330    
331    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
332    
333            * mkhtmlparser.pl: Support for MathML |definitionURL| attribute (HTML5
334            revision 2130).
335    
336    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
337    
338            * ContentChecker.pm: |xml:lang| attribute value must be same
339            as |lang| attribute value for HTML elements (HTML5 revision 2062
340            and so on).
341    
342    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
343    
344            * ContentChecker.pm: Error level definition for |xml_id_error|
345            was missing.
346    
347            * URIChecker.pm: The end of the URL should be marked as the
348            error location for an empty path error.  The position
349            between the userinfo and the port components should be
350            marked as the error location for an empty host error.
351    
352    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
353    
354            * URIChecker.pm: Set parameters representing where in the
355            value the error occurs for errors.  Report unknown
356            address format error in warning level, since address
357            formats are rarely added.  Path segments starting with "/.."
358            were misinterpreted as a dot-segment.
359    
360    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
361    
362            * URIChecker.pm (check_iri_reference): Requires
363            |Message::DOM::DOMImplementation|.
364    
365    2008-08-29  Wakaba  <wakaba@suika.fam.cx>
366    
367            * IMTChecker.pm: Updated for the new error reporting architecture.
368    
369            * ContentChecker.pm: Error levels for IMTs are added.
370    
371    2008-08-17  Wakaba  <wakaba@suika.fam.cx>
372    
373            * H2H.pm (_shift_token): Support for unquoted HTML attribute
374            values.
375    
376    2008-08-16  Wakaba  <wakaba@suika.fam.cx>
377    
378            * CacheManifest.pm: Support for new style of error
379            reports.
380    
381            * HTML.pm.src: Set line=1, column=1 to the document node.
382    
383    2008-08-16  Wakaba  <wakaba@suika.fam.cx>
384    
385            * ContentChecker.pm, RDFXML.pm: Pass {level} object to language tag
386            and URL checkers.  Support for more error levels for bogus
387            langauge tag and URL "standards".
388    
389            * LangTag.pm, URIChecker.pm: Support for new style error
390            level reporting.
391    
392    2008-08-15  Wakaba  <wakaba@suika.fam.cx>
393    
394            * ContentChecker.pm: Support for RDF/XML error levels.
395    
396            * HTMLTable.pm, RDFXML.pm: Support for new style of error level
397            specifying.  Error types are revised.
398    
399    2008-08-15  Wakaba  <wakaba@suika.fam.cx>
400    
401            * ContentChecker.pm: All error reporting method calls are
402            renewed.
403    
404    2008-08-15  Wakaba  <wakaba@suika.fam.cx>
405    
406            * HTML.pm.src: All error type names and "text" parameters
407            are revised.  Use new style for "level" specification.
408    
409            * mkhtmlparser.pl: Use new style for "level" specification.
410    
411    2008-08-03  Wakaba  <wakaba@suika.fam.cx>
412    
413            * WebIDL.pm (parse_char_string): Simplified error
414            reporting process for broken ignored valuetype definition.
415            (Valuetype idl_text): Support for special "DOMString" name.
416    
417    2008-08-03  Wakaba  <wakaba@suika.fam.cx>
418    
419            * WebIDL.pm ($get_scoped_name): Append "::::" if the last
420            terminal of the ScopedName is "DOMString", such that whether
421            the last part of the scoped name is "DOMString" or "_DOMString"
422            later.  It is necessary to determine whether a |typedef|
423            definition should be ignored or not.
424            (parse_char_string): Unescape the identifier of
425            exception members.
426            ($resolve): Return undef for builtin types and sequence<T>
427            types (we might not have to do this, however...).
428            (check): Support checking for Exceptions, Valuetypes,
429            and Typedefs.
430            ($serialize_type): Support for "DOMString::::" syntax.
431            (Typedef idl_text): Output Type as "DOMString" if it
432            is really "DOMString" (i.e. its internal representation
433            is "::DOMString::").
434    
435    2008-08-03  Wakaba  <wakaba@suika.fam.cx>
436    
437            * WebIDL.pm ($resolve): New code, based on resolve code
438            for constant types in the |check| method.
439            (check): Support for checking of attributes, operations, and
440            arguments.
441            (Attribute/Operation idl_text): Exception names in getraises,
442            setraises, and raises clauses is serizlied by |$serialize_type|
443            code.
444    
445  2008-08-02  Wakaba  <wakaba@suika.fam.cx>  2008-08-02  Wakaba  <wakaba@suika.fam.cx>
446    
447          * WebIDL.pm ($integer): Order of selections are changed to match          * WebIDL.pm ($integer): Order of selections are changed to match

Legend:
Removed from v.1.267  
changed lines
  Added in v.1.334

admin@suikawiki.org
ViewVC Help
Powered by ViewVC 1.1.24