/[suikacvs]/markup/html/whatpm/Whatpm/ChangeLog
Suika

Diff of /markup/html/whatpm/Whatpm/ChangeLog

Parent Directory Parent Directory | Revision Log Revision Log | View Patch Patch

revision 1.267 by wakaba, Sat Aug 2 15:14:23 2008 UTC revision 1.327 by wakaba, Sat Sep 20 09:28:29 2008 UTC
# Line 1  Line 1 
1    2008-09-20  Wakaba  <wakaba@suika.fam.cx>
2    
3            * HTML.pm.src: U+000B is no longer a space character (HTML5
4            revision 1738).
5    
6    2008-09-20  Wakaba  <wakaba@suika.fam.cx>
7    
8            * ContentType.pm: 0x0B is no longer a space character (HTML5
9            revision 1738).
10    
11            * HTML.pm.src: U+000B is no longer a space character for the
12            algorithm for extracting an encoding from a Content-Type (HTML5
13            revision 1738).
14    
15    2008-09-20  Wakaba  <wakaba@suika.fam.cx>
16    
17            * ContentChecker.pm ($IsInHTMLInteractiveContent): New.
18    
19    2008-09-18  Wakaba  <wakaba@suika.fam.cx>
20    
21            * LangTag.pm: Add checks for remaining requirements from RFC 4646.
22    
23            * mklangreg.pl: Sort 'Prefix' values by their length, to ease
24            matching.
25    
26    2008-09-18  Wakaba  <wakaba@suika.fam.cx>
27    
28            * LangTag.pm: Warn for private use language subtags.  Error level
29            typos fixed.  Support for Suppress-Script field.
30    
31            * mklangreg.pl: Support for dumping of nested structure.
32    
33    2008-09-18  Wakaba  <wakaba@suika.fam.cx>
34    
35            * LangTag.pm (check_rfc4646_langtag): Check if a tag is in the
36            recommended case as per RFC 4646.
37    
38    2008-09-18  Wakaba  <wakaba@suika.fam.cx>
39    
40            * LangTag.pm (check_rfc4646_langtag): New method.
41    
42    2008-09-18  Wakaba  <wakaba@suika.fam.cx>
43    
44            * mklangreg.pl: New script.
45    
46            * Makefile: Updated for creation of the module for language subtag
47            registry.
48            
49    2008-09-16  Wakaba  <wakaba@suika.fam.cx>
50    
51            * Makefile: WebIDL.html added.
52    
53            * WebIDL.pod: New documentation.
54    
55    2008-09-16  Wakaba  <wakaba@suika.fam.cx>
56    
57            * WebIDL.pm: Checker's error types are redefined.
58    
59    2008-09-16  Wakaba  <wakaba@suika.fam.cx>
60    
61            * WebIDL.pm: Parser's error types are redefined.  Some forward
62            compatible parsing bugs are fixed.  Some unreachable codes are
63            commented out.
64    
65    2008-09-16  Wakaba  <wakaba@suika.fam.cx>
66    
67            * WebIDL.pm: Support for the reminding extended attributes are
68            added.  It does not satisfy the definition that a forward
69            interface declaration has an extended attribute.  It seems that
70            unless explicitly allowed multiple extended attributes with the
71            same name is not allowed, though it is not explicitly mentioned in
72            the spec.
73    
74    2008-09-16  Wakaba  <wakaba@suika.fam.cx>
75    
76            * WebIDL.pm: Unescapes extended attribute names and extended
77            attribute identifiers.  Preserve whether an extended attribute has
78            an argument list of not.  Support for extended attributes:
79            Constructor, ExceptionConsts, IndexGetter, IndexSetter,
80            NameGetter, NameSetter, and Null.
81            (has_argument_list): New attribute.
82            (idl_text): Stringifies argument lists, if any, even if it is
83            empty.
84    
85    2008-09-15  Wakaba  <wakaba@suika.fam.cx>
86    
87            * HTML.pm.src: New state |PCDATA_STATE|.  Use an empty string for
88            |{s_kwd}| in DATA_STATE as default.
89    
90    2008-09-15  Wakaba  <wakaba@suika.fam.cx>
91    
92            * HTML.pm.src, mkhtmlparser.pl: Replace |{prev_char}|
93            by |{s_kwd}| in DATA_STATE.
94    
95    2008-09-15  Wakaba  <wakaba@suika.fam.cx>
96    
97            * HTML.pm.src: Shorten keys.
98    
99    2008-09-15  Wakaba  <wakaba@suika.fam.cx>
100    
101            * HTML.pm.src: Remove checking for control character, surrogate
102            pair, or noncharacter code points and non-Unicode code
103            points (they should be handled by Whatpm::Charset::UnicodeChecker).
104            (parse_char_stream): Support for the |$get_wrapper| argument and
105            character stream error handlers.
106    
107    2008-09-15  Wakaba  <wakaba@suika.fam.cx>
108    
109            * ContentChecker.pm: Don't call |loda_ns_module|
110            for null-namespace elements/attributes.
111    
112            * HTML.pm.src: Fact out $disallowed_control_chars
113            as a hash.
114    
115    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
116    
117            * HTML.pm.src: Regexp typo fixed.  |{prev_char}|
118            and |{next_char}| initializations are moved to initialization
119            method.  |{read_until}| now supports buffering.  Sync |set_inner_html|
120            with |parse_char_stream|.
121    
122    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
123    
124            * HTML.pm.src (parse_char_stream): Make |set_next_char|
125            invoke |manakai_read_until|, not only |read|, where
126            possible, to decrease the number of |read| method calls.
127    
128            * mkhtmlparser.pl: Related changes to the aforementioned
129            modification.
130    
131    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
132    
133            * HTML.pm.src: Use |read| instead of |getc|.  |set_inner_html|
134            would report character error from now.
135    
136    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
137    
138            * HTML.pm.src: White-space-leaded non-white-space character
139            tokens in "before head insertion mode" was not
140            correctly handled.
141            (set_inner_html): Reimplemented using CharString decodehandle
142            class.  Support for $get_wrapper argument.  Support
143            for |{read_until}| feature.
144    
145    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
146    
147            * HTML.pm.src: Make a "bare ero" error for unknown
148            entities point the "&" character.
149    
150    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
151    
152            * HTML.pm.src: It turns out that U+FFFD don't have to
153            be added to the list of excluded characters.
154    
155    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
156    
157            * HTML.pm.src ($char_onerror): Have character decoder's |line|
158            and |column| a higher priority than the one set by the
159            tokenizer's input handler.
160            ($self->{read_until}): Exclude U+FFFD (but this might
161            not be necessary, since now we do line/column fixup in
162            the character decode handle).
163    
164    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
165    
166            * HTML.pm.src: Use |{read_until}| where possible.
167    
168    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
169    
170            * HTML.pm.src: Change |{getc_until}| to |{read_until}|
171            and |manakai_getc_until| to |manakai_read_until| to
172            reduce the number of string copies.
173    
174    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
175    
176            * HTML.pm.src (parse_char_string): Use newly created
177            |Whatpm::Charset::DecodeHandle::CharString| instead of Perl's
178            standard feature to |open| a string as a filehandle,
179            since Perl's string filehandle seems not supporting |ungetc|
180            method correctly.
181            (parse_char_stream): Define |{getc_until}| method.
182            (DATA_STATE): Experimental support for |getc_until| feature.
183    
184    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
185    
186            * HTML.pm.src: Check points added to newly added branches.
187    
188    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
189    
190            * HTML.pm.src: Remove |{char}|, which is no longer used.
191            Remove |{entity_in_attr}| and |{last_attribute_value_state}|
192            and replaced by |{prev_state}|.
193    
194            * mkhtmlparser.pl: Remove |{char}| feature.
195            Remove |!!!back-next-input-character;| macro.
196    
197    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
198    
199            * HTML.pm.src: Finally we get rid of all the inner loops.  Remove
200            entity related tokenizer states in favor of new states
201            implementing the consume character reference algorithm.
202    
203    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
204    
205            * HTML.pm.src: "Consume a character reference" algorithm is
206            now implemented as a tokenizer's state, rather than
207            a method, with minimum changes (more changes will
208            be made, in due course).  "Bogus comment state"'s inner
209            loop gets removed.
210    
211    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
212    
213            * HTML.pm.src: Make |PUBLIC| and |SYSTEM| keyword tokenizing
214            into their own tokenizer states.
215    
216    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
217    
218            * HTML.pm.src: |CDATA_SECTION_STATE| (formally |CDATA_BLOCK_STATE|
219            is split into three states.
220    
221    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
222    
223            * HTML.pm.src: |CLOSE_TAG_OPEN_STATE| is broken into
224            itself and new |CDATA_PCDATA_CLOSE_TAG_STATE| so that
225            no longer does the tokenizer have to push back next input
226            characters in those states.
227    
228    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
229    
230            * HTML.pm.src: |MARKUP_DECLARATION_OPEN_STATE| broken
231            into four states so that no longer does the tokenizer have to push
232            back next input characters in that state.
233    
234    2008-09-11  Wakaba  <wakaba@suika.fam.cx>
235    
236            * HTML.pm.src: Methods now accept additional parameter, $get_wrapper,
237            which can be used to insert some wrapper between the character
238            stream handle and the tokenizer.  (It is currently not supported
239            for |set_inner_html| for |Element|s).
240    
241    2008-09-10  Wakaba  <wakaba@suika.fam.cx>
242    
243            * HTML.pm.src: Ignore punctuations in charset names.
244    
245    2008-09-10  Wakaba  <wakaba@suika.fam.cx>
246    
247            * ContentChecker.pm: Support for charset-layer error levels.
248    
249            * HTML.pm.src: Don't specify |text| argument for the
250            |chardecode:fallback| error, since it is not the encoding
251            being used alternatively.
252    
253    2008-09-06  Wakaba  <wakaba@suika.fam.cx>
254    
255            * HTML.pm.src: Support for |XSLT-compat| (HTML5 revision 2141).
256    
257    2008-08-31  Wakaba  <wakaba@suika.fam.cx>
258    
259            * CacheManifest.pm: Support for extensibility (HTML5 revision 2051).
260    
261    2008-08-31  Wakaba  <wakaba@suika.fam.cx>
262    
263            * HTML.pm.src: Bug fix and sync with the spec with regard
264            to after after frameset insertion mode processing (HTML5
265            revision 1909).  Note that the implementation was wrong
266            per the old spec before the r1909 changes.
267    
268    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
269    
270            * HTMLTable.pm: scope=auto algorithm fix synced with the
271            spec (HTML5 revision 2093).
272            ($process_row): Algorithm step numbers synced with the
273            spec (HTML5 revision 2092).
274    
275    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
276    
277            * HTMLTable.pm: Zs is not what we want; we want White_Space! (HTML5
278            revision 2094).
279    
280    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
281    
282            * ContentType.pm: Support for image/svg+xml (HTML5 revision 2096).
283    
284    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
285    
286            * HTML.pm.src: '"' and "'" at the end of attribute
287            name (after another attribute) now raise parse error (HTML5
288            revision 2123).  Empty unquoted attribute values are no
289            longer allowed (HTML5 revision 2122).
290    
291    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
292    
293            * mkhtmlparser.pl: Support for MathML |definitionURL| attribute (HTML5
294            revision 2130).
295    
296    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
297    
298            * ContentChecker.pm: |xml:lang| attribute value must be same
299            as |lang| attribute value for HTML elements (HTML5 revision 2062
300            and so on).
301    
302    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
303    
304            * ContentChecker.pm: Error level definition for |xml_id_error|
305            was missing.
306    
307            * URIChecker.pm: The end of the URL should be marked as the
308            error location for an empty path error.  The position
309            between the userinfo and the port components should be
310            marked as the error location for an empty host error.
311    
312    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
313    
314            * URIChecker.pm: Set parameters representing where in the
315            value the error occurs for errors.  Report unknown
316            address format error in warning level, since address
317            formats are rarely added.  Path segments starting with "/.."
318            were misinterpreted as a dot-segment.
319    
320    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
321    
322            * URIChecker.pm (check_iri_reference): Requires
323            |Message::DOM::DOMImplementation|.
324    
325    2008-08-29  Wakaba  <wakaba@suika.fam.cx>
326    
327            * IMTChecker.pm: Updated for the new error reporting architecture.
328    
329            * ContentChecker.pm: Error levels for IMTs are added.
330    
331    2008-08-17  Wakaba  <wakaba@suika.fam.cx>
332    
333            * H2H.pm (_shift_token): Support for unquoted HTML attribute
334            values.
335    
336    2008-08-16  Wakaba  <wakaba@suika.fam.cx>
337    
338            * CacheManifest.pm: Support for new style of error
339            reports.
340    
341            * HTML.pm.src: Set line=1, column=1 to the document node.
342    
343    2008-08-16  Wakaba  <wakaba@suika.fam.cx>
344    
345            * ContentChecker.pm, RDFXML.pm: Pass {level} object to language tag
346            and URL checkers.  Support for more error levels for bogus
347            langauge tag and URL "standards".
348    
349            * LangTag.pm, URIChecker.pm: Support for new style error
350            level reporting.
351    
352    2008-08-15  Wakaba  <wakaba@suika.fam.cx>
353    
354            * ContentChecker.pm: Support for RDF/XML error levels.
355    
356            * HTMLTable.pm, RDFXML.pm: Support for new style of error level
357            specifying.  Error types are revised.
358    
359    2008-08-15  Wakaba  <wakaba@suika.fam.cx>
360    
361            * ContentChecker.pm: All error reporting method calls are
362            renewed.
363    
364    2008-08-15  Wakaba  <wakaba@suika.fam.cx>
365    
366            * HTML.pm.src: All error type names and "text" parameters
367            are revised.  Use new style for "level" specification.
368    
369            * mkhtmlparser.pl: Use new style for "level" specification.
370    
371    2008-08-03  Wakaba  <wakaba@suika.fam.cx>
372    
373            * WebIDL.pm (parse_char_string): Simplified error
374            reporting process for broken ignored valuetype definition.
375            (Valuetype idl_text): Support for special "DOMString" name.
376    
377    2008-08-03  Wakaba  <wakaba@suika.fam.cx>
378    
379            * WebIDL.pm ($get_scoped_name): Append "::::" if the last
380            terminal of the ScopedName is "DOMString", such that whether
381            the last part of the scoped name is "DOMString" or "_DOMString"
382            later.  It is necessary to determine whether a |typedef|
383            definition should be ignored or not.
384            (parse_char_string): Unescape the identifier of
385            exception members.
386            ($resolve): Return undef for builtin types and sequence<T>
387            types (we might not have to do this, however...).
388            (check): Support checking for Exceptions, Valuetypes,
389            and Typedefs.
390            ($serialize_type): Support for "DOMString::::" syntax.
391            (Typedef idl_text): Output Type as "DOMString" if it
392            is really "DOMString" (i.e. its internal representation
393            is "::DOMString::").
394    
395    2008-08-03  Wakaba  <wakaba@suika.fam.cx>
396    
397            * WebIDL.pm ($resolve): New code, based on resolve code
398            for constant types in the |check| method.
399            (check): Support for checking of attributes, operations, and
400            arguments.
401            (Attribute/Operation idl_text): Exception names in getraises,
402            setraises, and raises clauses is serizlied by |$serialize_type|
403            code.
404    
405  2008-08-02  Wakaba  <wakaba@suika.fam.cx>  2008-08-02  Wakaba  <wakaba@suika.fam.cx>
406    
407          * WebIDL.pm ($integer): Order of selections are changed to match          * WebIDL.pm ($integer): Order of selections are changed to match

Legend:
Removed from v.1.267  
changed lines
  Added in v.1.327

admin@suikawiki.org
ViewVC Help
Powered by ViewVC 1.1.24