/[suikacvs]/markup/html/whatpm/Whatpm/ChangeLog
Suika

Diff of /markup/html/whatpm/Whatpm/ChangeLog

Parent Directory Parent Directory | Revision Log Revision Log | View Patch Patch

revision 1.275 by wakaba, Sat Aug 16 07:35:22 2008 UTC revision 1.331 by wakaba, Sun Sep 21 09:45:02 2008 UTC
# Line 1  Line 1 
1    2008-09-21  Wakaba  <wakaba@suika.fam.cx>
2    
3            * ContentChecker.pm: Prepare for |form| |name| attribute's
4            duplication checking.
5    
6    2008-09-21  Wakaba  <wakaba@suika.fam.cx>
7    
8            * HTML.pm.src (parse_byte_stream): Support (or non-support) for
9            unsupported charset="" parameter value (HTML5 revision 2131).
10    
11    2008-09-20  Wakaba  <wakaba@suika.fam.cx>
12    
13            * HTML.pm.src: Reminding places where U+000B is allowed as a space
14            character is fixed (cf. HTML5 revision 1738).
15    
16            * ContentChecker.pm, HTMLTable.pm: U+000B is no longer part of
17            space characters (HTML5 revision 1738).
18    
19    2008-09-20  Wakaba  <wakaba@suika.fam.cx>
20    
21            * HTML.pm.src: The "anything else" case for the "after after body"
22            insertion mode was not updated to swtich to the "in body"
23            insertion mode.  U+000B is no longer a space character for the
24            purpose of tree construction phase (HTML5 revision 1738).
25    
26    2008-09-20  Wakaba  <wakaba@suika.fam.cx>
27    
28            * HTML.pm.src: U+000B is no longer a space character (HTML5
29            revision 1738).
30    
31    2008-09-20  Wakaba  <wakaba@suika.fam.cx>
32    
33            * ContentType.pm: 0x0B is no longer a space character (HTML5
34            revision 1738).
35    
36            * HTML.pm.src: U+000B is no longer a space character for the
37            algorithm for extracting an encoding from a Content-Type (HTML5
38            revision 1738).
39    
40    2008-09-20  Wakaba  <wakaba@suika.fam.cx>
41    
42            * ContentChecker.pm ($IsInHTMLInteractiveContent): New.
43    
44    2008-09-18  Wakaba  <wakaba@suika.fam.cx>
45    
46            * LangTag.pm: Add checks for remaining requirements from RFC 4646.
47    
48            * mklangreg.pl: Sort 'Prefix' values by their length, to ease
49            matching.
50    
51    2008-09-18  Wakaba  <wakaba@suika.fam.cx>
52    
53            * LangTag.pm: Warn for private use language subtags.  Error level
54            typos fixed.  Support for Suppress-Script field.
55    
56            * mklangreg.pl: Support for dumping of nested structure.
57    
58    2008-09-18  Wakaba  <wakaba@suika.fam.cx>
59    
60            * LangTag.pm (check_rfc4646_langtag): Check if a tag is in the
61            recommended case as per RFC 4646.
62    
63    2008-09-18  Wakaba  <wakaba@suika.fam.cx>
64    
65            * LangTag.pm (check_rfc4646_langtag): New method.
66    
67    2008-09-18  Wakaba  <wakaba@suika.fam.cx>
68    
69            * mklangreg.pl: New script.
70    
71            * Makefile: Updated for creation of the module for language subtag
72            registry.
73            
74    2008-09-16  Wakaba  <wakaba@suika.fam.cx>
75    
76            * Makefile: WebIDL.html added.
77    
78            * WebIDL.pod: New documentation.
79    
80    2008-09-16  Wakaba  <wakaba@suika.fam.cx>
81    
82            * WebIDL.pm: Checker's error types are redefined.
83    
84    2008-09-16  Wakaba  <wakaba@suika.fam.cx>
85    
86            * WebIDL.pm: Parser's error types are redefined.  Some forward
87            compatible parsing bugs are fixed.  Some unreachable codes are
88            commented out.
89    
90    2008-09-16  Wakaba  <wakaba@suika.fam.cx>
91    
92            * WebIDL.pm: Support for the reminding extended attributes are
93            added.  It does not satisfy the definition that a forward
94            interface declaration has an extended attribute.  It seems that
95            unless explicitly allowed multiple extended attributes with the
96            same name is not allowed, though it is not explicitly mentioned in
97            the spec.
98    
99    2008-09-16  Wakaba  <wakaba@suika.fam.cx>
100    
101            * WebIDL.pm: Unescapes extended attribute names and extended
102            attribute identifiers.  Preserve whether an extended attribute has
103            an argument list of not.  Support for extended attributes:
104            Constructor, ExceptionConsts, IndexGetter, IndexSetter,
105            NameGetter, NameSetter, and Null.
106            (has_argument_list): New attribute.
107            (idl_text): Stringifies argument lists, if any, even if it is
108            empty.
109    
110    2008-09-15  Wakaba  <wakaba@suika.fam.cx>
111    
112            * HTML.pm.src: New state |PCDATA_STATE|.  Use an empty string for
113            |{s_kwd}| in DATA_STATE as default.
114    
115    2008-09-15  Wakaba  <wakaba@suika.fam.cx>
116    
117            * HTML.pm.src, mkhtmlparser.pl: Replace |{prev_char}|
118            by |{s_kwd}| in DATA_STATE.
119    
120    2008-09-15  Wakaba  <wakaba@suika.fam.cx>
121    
122            * HTML.pm.src: Shorten keys.
123    
124    2008-09-15  Wakaba  <wakaba@suika.fam.cx>
125    
126            * HTML.pm.src: Remove checking for control character, surrogate
127            pair, or noncharacter code points and non-Unicode code
128            points (they should be handled by Whatpm::Charset::UnicodeChecker).
129            (parse_char_stream): Support for the |$get_wrapper| argument and
130            character stream error handlers.
131    
132    2008-09-15  Wakaba  <wakaba@suika.fam.cx>
133    
134            * ContentChecker.pm: Don't call |loda_ns_module|
135            for null-namespace elements/attributes.
136    
137            * HTML.pm.src: Fact out $disallowed_control_chars
138            as a hash.
139    
140    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
141    
142            * HTML.pm.src: Regexp typo fixed.  |{prev_char}|
143            and |{next_char}| initializations are moved to initialization
144            method.  |{read_until}| now supports buffering.  Sync |set_inner_html|
145            with |parse_char_stream|.
146    
147    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
148    
149            * HTML.pm.src (parse_char_stream): Make |set_next_char|
150            invoke |manakai_read_until|, not only |read|, where
151            possible, to decrease the number of |read| method calls.
152    
153            * mkhtmlparser.pl: Related changes to the aforementioned
154            modification.
155    
156    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
157    
158            * HTML.pm.src: Use |read| instead of |getc|.  |set_inner_html|
159            would report character error from now.
160    
161    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
162    
163            * HTML.pm.src: White-space-leaded non-white-space character
164            tokens in "before head insertion mode" was not
165            correctly handled.
166            (set_inner_html): Reimplemented using CharString decodehandle
167            class.  Support for $get_wrapper argument.  Support
168            for |{read_until}| feature.
169    
170    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
171    
172            * HTML.pm.src: Make a "bare ero" error for unknown
173            entities point the "&" character.
174    
175    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
176    
177            * HTML.pm.src: It turns out that U+FFFD don't have to
178            be added to the list of excluded characters.
179    
180    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
181    
182            * HTML.pm.src ($char_onerror): Have character decoder's |line|
183            and |column| a higher priority than the one set by the
184            tokenizer's input handler.
185            ($self->{read_until}): Exclude U+FFFD (but this might
186            not be necessary, since now we do line/column fixup in
187            the character decode handle).
188    
189    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
190    
191            * HTML.pm.src: Use |{read_until}| where possible.
192    
193    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
194    
195            * HTML.pm.src: Change |{getc_until}| to |{read_until}|
196            and |manakai_getc_until| to |manakai_read_until| to
197            reduce the number of string copies.
198    
199    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
200    
201            * HTML.pm.src (parse_char_string): Use newly created
202            |Whatpm::Charset::DecodeHandle::CharString| instead of Perl's
203            standard feature to |open| a string as a filehandle,
204            since Perl's string filehandle seems not supporting |ungetc|
205            method correctly.
206            (parse_char_stream): Define |{getc_until}| method.
207            (DATA_STATE): Experimental support for |getc_until| feature.
208    
209    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
210    
211            * HTML.pm.src: Check points added to newly added branches.
212    
213    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
214    
215            * HTML.pm.src: Remove |{char}|, which is no longer used.
216            Remove |{entity_in_attr}| and |{last_attribute_value_state}|
217            and replaced by |{prev_state}|.
218    
219            * mkhtmlparser.pl: Remove |{char}| feature.
220            Remove |!!!back-next-input-character;| macro.
221    
222    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
223    
224            * HTML.pm.src: Finally we get rid of all the inner loops.  Remove
225            entity related tokenizer states in favor of new states
226            implementing the consume character reference algorithm.
227    
228    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
229    
230            * HTML.pm.src: "Consume a character reference" algorithm is
231            now implemented as a tokenizer's state, rather than
232            a method, with minimum changes (more changes will
233            be made, in due course).  "Bogus comment state"'s inner
234            loop gets removed.
235    
236    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
237    
238            * HTML.pm.src: Make |PUBLIC| and |SYSTEM| keyword tokenizing
239            into their own tokenizer states.
240    
241    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
242    
243            * HTML.pm.src: |CDATA_SECTION_STATE| (formally |CDATA_BLOCK_STATE|
244            is split into three states.
245    
246    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
247    
248            * HTML.pm.src: |CLOSE_TAG_OPEN_STATE| is broken into
249            itself and new |CDATA_PCDATA_CLOSE_TAG_STATE| so that
250            no longer does the tokenizer have to push back next input
251            characters in those states.
252    
253    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
254    
255            * HTML.pm.src: |MARKUP_DECLARATION_OPEN_STATE| broken
256            into four states so that no longer does the tokenizer have to push
257            back next input characters in that state.
258    
259    2008-09-11  Wakaba  <wakaba@suika.fam.cx>
260    
261            * HTML.pm.src: Methods now accept additional parameter, $get_wrapper,
262            which can be used to insert some wrapper between the character
263            stream handle and the tokenizer.  (It is currently not supported
264            for |set_inner_html| for |Element|s).
265    
266    2008-09-10  Wakaba  <wakaba@suika.fam.cx>
267    
268            * HTML.pm.src: Ignore punctuations in charset names.
269    
270    2008-09-10  Wakaba  <wakaba@suika.fam.cx>
271    
272            * ContentChecker.pm: Support for charset-layer error levels.
273    
274            * HTML.pm.src: Don't specify |text| argument for the
275            |chardecode:fallback| error, since it is not the encoding
276            being used alternatively.
277    
278    2008-09-06  Wakaba  <wakaba@suika.fam.cx>
279    
280            * HTML.pm.src: Support for |XSLT-compat| (HTML5 revision 2141).
281    
282    2008-08-31  Wakaba  <wakaba@suika.fam.cx>
283    
284            * CacheManifest.pm: Support for extensibility (HTML5 revision 2051).
285    
286    2008-08-31  Wakaba  <wakaba@suika.fam.cx>
287    
288            * HTML.pm.src: Bug fix and sync with the spec with regard
289            to after after frameset insertion mode processing (HTML5
290            revision 1909).  Note that the implementation was wrong
291            per the old spec before the r1909 changes.
292    
293    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
294    
295            * HTMLTable.pm: scope=auto algorithm fix synced with the
296            spec (HTML5 revision 2093).
297            ($process_row): Algorithm step numbers synced with the
298            spec (HTML5 revision 2092).
299    
300    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
301    
302            * HTMLTable.pm: Zs is not what we want; we want White_Space! (HTML5
303            revision 2094).
304    
305    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
306    
307            * ContentType.pm: Support for image/svg+xml (HTML5 revision 2096).
308    
309    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
310    
311            * HTML.pm.src: '"' and "'" at the end of attribute
312            name (after another attribute) now raise parse error (HTML5
313            revision 2123).  Empty unquoted attribute values are no
314            longer allowed (HTML5 revision 2122).
315    
316    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
317    
318            * mkhtmlparser.pl: Support for MathML |definitionURL| attribute (HTML5
319            revision 2130).
320    
321    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
322    
323            * ContentChecker.pm: |xml:lang| attribute value must be same
324            as |lang| attribute value for HTML elements (HTML5 revision 2062
325            and so on).
326    
327    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
328    
329            * ContentChecker.pm: Error level definition for |xml_id_error|
330            was missing.
331    
332            * URIChecker.pm: The end of the URL should be marked as the
333            error location for an empty path error.  The position
334            between the userinfo and the port components should be
335            marked as the error location for an empty host error.
336    
337    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
338    
339            * URIChecker.pm: Set parameters representing where in the
340            value the error occurs for errors.  Report unknown
341            address format error in warning level, since address
342            formats are rarely added.  Path segments starting with "/.."
343            were misinterpreted as a dot-segment.
344    
345    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
346    
347            * URIChecker.pm (check_iri_reference): Requires
348            |Message::DOM::DOMImplementation|.
349    
350    2008-08-29  Wakaba  <wakaba@suika.fam.cx>
351    
352            * IMTChecker.pm: Updated for the new error reporting architecture.
353    
354            * ContentChecker.pm: Error levels for IMTs are added.
355    
356    2008-08-17  Wakaba  <wakaba@suika.fam.cx>
357    
358            * H2H.pm (_shift_token): Support for unquoted HTML attribute
359            values.
360    
361  2008-08-16  Wakaba  <wakaba@suika.fam.cx>  2008-08-16  Wakaba  <wakaba@suika.fam.cx>
362    
363          * CacheManifest.pm: Support for new style of error          * CacheManifest.pm: Support for new style of error

Legend:
Removed from v.1.275  
changed lines
  Added in v.1.331

admin@suikawiki.org
ViewVC Help
Powered by ViewVC 1.1.24