/[suikacvs]/markup/html/whatpm/Whatpm/ChangeLog
Suika

Diff of /markup/html/whatpm/Whatpm/ChangeLog

Parent Directory Parent Directory | Revision Log Revision Log | View Patch Patch

revision 1.276 by wakaba, Sun Aug 17 05:09:12 2008 UTC revision 1.332 by wakaba, Sun Sep 21 11:55:49 2008 UTC
# Line 1  Line 1 
1    2008-09-21  Wakaba  <wakaba@suika.fam.cx>
2    
3            * ContentChecker.pm: |contextmenu| validness is now checked using
4            |id| and |id_type| properties, and |menu| property is removed.
5    
6    2008-09-21  Wakaba  <wakaba@suika.fam.cx>
7    
8            * ContentChecker.pm: Prepare for |form| |name| attribute's
9            duplication checking.
10    
11    2008-09-21  Wakaba  <wakaba@suika.fam.cx>
12    
13            * HTML.pm.src (parse_byte_stream): Support (or non-support) for
14            unsupported charset="" parameter value (HTML5 revision 2131).
15    
16    2008-09-20  Wakaba  <wakaba@suika.fam.cx>
17    
18            * HTML.pm.src: Reminding places where U+000B is allowed as a space
19            character is fixed (cf. HTML5 revision 1738).
20    
21            * ContentChecker.pm, HTMLTable.pm: U+000B is no longer part of
22            space characters (HTML5 revision 1738).
23    
24    2008-09-20  Wakaba  <wakaba@suika.fam.cx>
25    
26            * HTML.pm.src: The "anything else" case for the "after after body"
27            insertion mode was not updated to swtich to the "in body"
28            insertion mode.  U+000B is no longer a space character for the
29            purpose of tree construction phase (HTML5 revision 1738).
30    
31    2008-09-20  Wakaba  <wakaba@suika.fam.cx>
32    
33            * HTML.pm.src: U+000B is no longer a space character (HTML5
34            revision 1738).
35    
36    2008-09-20  Wakaba  <wakaba@suika.fam.cx>
37    
38            * ContentType.pm: 0x0B is no longer a space character (HTML5
39            revision 1738).
40    
41            * HTML.pm.src: U+000B is no longer a space character for the
42            algorithm for extracting an encoding from a Content-Type (HTML5
43            revision 1738).
44    
45    2008-09-20  Wakaba  <wakaba@suika.fam.cx>
46    
47            * ContentChecker.pm ($IsInHTMLInteractiveContent): New.
48    
49    2008-09-18  Wakaba  <wakaba@suika.fam.cx>
50    
51            * LangTag.pm: Add checks for remaining requirements from RFC 4646.
52    
53            * mklangreg.pl: Sort 'Prefix' values by their length, to ease
54            matching.
55    
56    2008-09-18  Wakaba  <wakaba@suika.fam.cx>
57    
58            * LangTag.pm: Warn for private use language subtags.  Error level
59            typos fixed.  Support for Suppress-Script field.
60    
61            * mklangreg.pl: Support for dumping of nested structure.
62    
63    2008-09-18  Wakaba  <wakaba@suika.fam.cx>
64    
65            * LangTag.pm (check_rfc4646_langtag): Check if a tag is in the
66            recommended case as per RFC 4646.
67    
68    2008-09-18  Wakaba  <wakaba@suika.fam.cx>
69    
70            * LangTag.pm (check_rfc4646_langtag): New method.
71    
72    2008-09-18  Wakaba  <wakaba@suika.fam.cx>
73    
74            * mklangreg.pl: New script.
75    
76            * Makefile: Updated for creation of the module for language subtag
77            registry.
78            
79    2008-09-16  Wakaba  <wakaba@suika.fam.cx>
80    
81            * Makefile: WebIDL.html added.
82    
83            * WebIDL.pod: New documentation.
84    
85    2008-09-16  Wakaba  <wakaba@suika.fam.cx>
86    
87            * WebIDL.pm: Checker's error types are redefined.
88    
89    2008-09-16  Wakaba  <wakaba@suika.fam.cx>
90    
91            * WebIDL.pm: Parser's error types are redefined.  Some forward
92            compatible parsing bugs are fixed.  Some unreachable codes are
93            commented out.
94    
95    2008-09-16  Wakaba  <wakaba@suika.fam.cx>
96    
97            * WebIDL.pm: Support for the reminding extended attributes are
98            added.  It does not satisfy the definition that a forward
99            interface declaration has an extended attribute.  It seems that
100            unless explicitly allowed multiple extended attributes with the
101            same name is not allowed, though it is not explicitly mentioned in
102            the spec.
103    
104    2008-09-16  Wakaba  <wakaba@suika.fam.cx>
105    
106            * WebIDL.pm: Unescapes extended attribute names and extended
107            attribute identifiers.  Preserve whether an extended attribute has
108            an argument list of not.  Support for extended attributes:
109            Constructor, ExceptionConsts, IndexGetter, IndexSetter,
110            NameGetter, NameSetter, and Null.
111            (has_argument_list): New attribute.
112            (idl_text): Stringifies argument lists, if any, even if it is
113            empty.
114    
115    2008-09-15  Wakaba  <wakaba@suika.fam.cx>
116    
117            * HTML.pm.src: New state |PCDATA_STATE|.  Use an empty string for
118            |{s_kwd}| in DATA_STATE as default.
119    
120    2008-09-15  Wakaba  <wakaba@suika.fam.cx>
121    
122            * HTML.pm.src, mkhtmlparser.pl: Replace |{prev_char}|
123            by |{s_kwd}| in DATA_STATE.
124    
125    2008-09-15  Wakaba  <wakaba@suika.fam.cx>
126    
127            * HTML.pm.src: Shorten keys.
128    
129    2008-09-15  Wakaba  <wakaba@suika.fam.cx>
130    
131            * HTML.pm.src: Remove checking for control character, surrogate
132            pair, or noncharacter code points and non-Unicode code
133            points (they should be handled by Whatpm::Charset::UnicodeChecker).
134            (parse_char_stream): Support for the |$get_wrapper| argument and
135            character stream error handlers.
136    
137    2008-09-15  Wakaba  <wakaba@suika.fam.cx>
138    
139            * ContentChecker.pm: Don't call |loda_ns_module|
140            for null-namespace elements/attributes.
141    
142            * HTML.pm.src: Fact out $disallowed_control_chars
143            as a hash.
144    
145    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
146    
147            * HTML.pm.src: Regexp typo fixed.  |{prev_char}|
148            and |{next_char}| initializations are moved to initialization
149            method.  |{read_until}| now supports buffering.  Sync |set_inner_html|
150            with |parse_char_stream|.
151    
152    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
153    
154            * HTML.pm.src (parse_char_stream): Make |set_next_char|
155            invoke |manakai_read_until|, not only |read|, where
156            possible, to decrease the number of |read| method calls.
157    
158            * mkhtmlparser.pl: Related changes to the aforementioned
159            modification.
160    
161    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
162    
163            * HTML.pm.src: Use |read| instead of |getc|.  |set_inner_html|
164            would report character error from now.
165    
166    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
167    
168            * HTML.pm.src: White-space-leaded non-white-space character
169            tokens in "before head insertion mode" was not
170            correctly handled.
171            (set_inner_html): Reimplemented using CharString decodehandle
172            class.  Support for $get_wrapper argument.  Support
173            for |{read_until}| feature.
174    
175    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
176    
177            * HTML.pm.src: Make a "bare ero" error for unknown
178            entities point the "&" character.
179    
180    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
181    
182            * HTML.pm.src: It turns out that U+FFFD don't have to
183            be added to the list of excluded characters.
184    
185    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
186    
187            * HTML.pm.src ($char_onerror): Have character decoder's |line|
188            and |column| a higher priority than the one set by the
189            tokenizer's input handler.
190            ($self->{read_until}): Exclude U+FFFD (but this might
191            not be necessary, since now we do line/column fixup in
192            the character decode handle).
193    
194    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
195    
196            * HTML.pm.src: Use |{read_until}| where possible.
197    
198    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
199    
200            * HTML.pm.src: Change |{getc_until}| to |{read_until}|
201            and |manakai_getc_until| to |manakai_read_until| to
202            reduce the number of string copies.
203    
204    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
205    
206            * HTML.pm.src (parse_char_string): Use newly created
207            |Whatpm::Charset::DecodeHandle::CharString| instead of Perl's
208            standard feature to |open| a string as a filehandle,
209            since Perl's string filehandle seems not supporting |ungetc|
210            method correctly.
211            (parse_char_stream): Define |{getc_until}| method.
212            (DATA_STATE): Experimental support for |getc_until| feature.
213    
214    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
215    
216            * HTML.pm.src: Check points added to newly added branches.
217    
218    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
219    
220            * HTML.pm.src: Remove |{char}|, which is no longer used.
221            Remove |{entity_in_attr}| and |{last_attribute_value_state}|
222            and replaced by |{prev_state}|.
223    
224            * mkhtmlparser.pl: Remove |{char}| feature.
225            Remove |!!!back-next-input-character;| macro.
226    
227    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
228    
229            * HTML.pm.src: Finally we get rid of all the inner loops.  Remove
230            entity related tokenizer states in favor of new states
231            implementing the consume character reference algorithm.
232    
233    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
234    
235            * HTML.pm.src: "Consume a character reference" algorithm is
236            now implemented as a tokenizer's state, rather than
237            a method, with minimum changes (more changes will
238            be made, in due course).  "Bogus comment state"'s inner
239            loop gets removed.
240    
241    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
242    
243            * HTML.pm.src: Make |PUBLIC| and |SYSTEM| keyword tokenizing
244            into their own tokenizer states.
245    
246    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
247    
248            * HTML.pm.src: |CDATA_SECTION_STATE| (formally |CDATA_BLOCK_STATE|
249            is split into three states.
250    
251    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
252    
253            * HTML.pm.src: |CLOSE_TAG_OPEN_STATE| is broken into
254            itself and new |CDATA_PCDATA_CLOSE_TAG_STATE| so that
255            no longer does the tokenizer have to push back next input
256            characters in those states.
257    
258    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
259    
260            * HTML.pm.src: |MARKUP_DECLARATION_OPEN_STATE| broken
261            into four states so that no longer does the tokenizer have to push
262            back next input characters in that state.
263    
264    2008-09-11  Wakaba  <wakaba@suika.fam.cx>
265    
266            * HTML.pm.src: Methods now accept additional parameter, $get_wrapper,
267            which can be used to insert some wrapper between the character
268            stream handle and the tokenizer.  (It is currently not supported
269            for |set_inner_html| for |Element|s).
270    
271    2008-09-10  Wakaba  <wakaba@suika.fam.cx>
272    
273            * HTML.pm.src: Ignore punctuations in charset names.
274    
275    2008-09-10  Wakaba  <wakaba@suika.fam.cx>
276    
277            * ContentChecker.pm: Support for charset-layer error levels.
278    
279            * HTML.pm.src: Don't specify |text| argument for the
280            |chardecode:fallback| error, since it is not the encoding
281            being used alternatively.
282    
283    2008-09-06  Wakaba  <wakaba@suika.fam.cx>
284    
285            * HTML.pm.src: Support for |XSLT-compat| (HTML5 revision 2141).
286    
287    2008-08-31  Wakaba  <wakaba@suika.fam.cx>
288    
289            * CacheManifest.pm: Support for extensibility (HTML5 revision 2051).
290    
291    2008-08-31  Wakaba  <wakaba@suika.fam.cx>
292    
293            * HTML.pm.src: Bug fix and sync with the spec with regard
294            to after after frameset insertion mode processing (HTML5
295            revision 1909).  Note that the implementation was wrong
296            per the old spec before the r1909 changes.
297    
298    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
299    
300            * HTMLTable.pm: scope=auto algorithm fix synced with the
301            spec (HTML5 revision 2093).
302            ($process_row): Algorithm step numbers synced with the
303            spec (HTML5 revision 2092).
304    
305    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
306    
307            * HTMLTable.pm: Zs is not what we want; we want White_Space! (HTML5
308            revision 2094).
309    
310    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
311    
312            * ContentType.pm: Support for image/svg+xml (HTML5 revision 2096).
313    
314    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
315    
316            * HTML.pm.src: '"' and "'" at the end of attribute
317            name (after another attribute) now raise parse error (HTML5
318            revision 2123).  Empty unquoted attribute values are no
319            longer allowed (HTML5 revision 2122).
320    
321    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
322    
323            * mkhtmlparser.pl: Support for MathML |definitionURL| attribute (HTML5
324            revision 2130).
325    
326    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
327    
328            * ContentChecker.pm: |xml:lang| attribute value must be same
329            as |lang| attribute value for HTML elements (HTML5 revision 2062
330            and so on).
331    
332    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
333    
334            * ContentChecker.pm: Error level definition for |xml_id_error|
335            was missing.
336    
337            * URIChecker.pm: The end of the URL should be marked as the
338            error location for an empty path error.  The position
339            between the userinfo and the port components should be
340            marked as the error location for an empty host error.
341    
342    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
343    
344            * URIChecker.pm: Set parameters representing where in the
345            value the error occurs for errors.  Report unknown
346            address format error in warning level, since address
347            formats are rarely added.  Path segments starting with "/.."
348            were misinterpreted as a dot-segment.
349    
350    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
351    
352            * URIChecker.pm (check_iri_reference): Requires
353            |Message::DOM::DOMImplementation|.
354    
355    2008-08-29  Wakaba  <wakaba@suika.fam.cx>
356    
357            * IMTChecker.pm: Updated for the new error reporting architecture.
358    
359            * ContentChecker.pm: Error levels for IMTs are added.
360    
361  2008-08-17  Wakaba  <wakaba@suika.fam.cx>  2008-08-17  Wakaba  <wakaba@suika.fam.cx>
362    
363          * H2H.pm (_shift_token): Support for unquoted HTML attribute          * H2H.pm (_shift_token): Support for unquoted HTML attribute

Legend:
Removed from v.1.276  
changed lines
  Added in v.1.332

admin@suikawiki.org
ViewVC Help
Powered by ViewVC 1.1.24