/[suikacvs]/markup/html/whatpm/Whatpm/ChangeLog
Suika

Diff of /markup/html/whatpm/Whatpm/ChangeLog

Parent Directory Parent Directory | Revision Log Revision Log | View Patch Patch

revision 1.275 by wakaba, Sat Aug 16 07:35:22 2008 UTC revision 1.349 by wakaba, Mon Oct 13 06:18:30 2008 UTC
# Line 1  Line 1 
1    2008-10-13  Wakaba  <wakaba@suika.fam.cx>
2    
3            * HTML.pm.src: Steps for CDATA/RCDATA elements in tree
4            construction stage synced with the spec (HTML5 revisions 2139 and
5            2302).
6    
7    2008-10-07  Wakaba  <wakaba@suika.fam.cx>
8    
9            * ContentChecker.pm: New error level "html5_fact" added, which
10            should be tentatively used until all of requirements are properly
11            specced as RFC 2119 "MUST" in HTML5.
12    
13    2008-10-05  Wakaba  <wakaba@suika.fam.cx>
14    
15            * ContentChecker.pod: Note on internal flags is added.
16    
17    2008-10-05  Wakaba  <wakaba@suika.fam.cx>
18    
19            * HTML.pm.src: An AAA bug fixed.
20    
21    2008-10-04  Wakaba  <wakaba@suika.fam.cx>
22    
23            * HTML.pm.src: If another node is inserted by the parser, don't
24            reuse existing Text node to append a character (HTML5 revision
25            2124).
26    
27    2008-10-04  Wakaba  <wakaba@suika.fam.cx>
28    
29            * HTML.pm.src: Support for <option> and <optgroup> in body (HTML5
30            revisions 1731 and 2128).
31    
32    2008-10-04  Wakaba  <wakaba@suika.fam.cx>
33    
34            * HTML.pm.src: Make <foreignObject> scoping (HTML5 revision 1837).
35            Support for end tags of camelCase SVG elements were broken.  A
36            wrong error type text fixed.
37    
38    2008-10-04  Wakaba  <wakaba@suika.fam.cx>
39    
40            * HTML.pm.src: Drop redundant code (HTML5 revision 1731).
41    
42    2008-10-04  Wakaba  <wakaba@suika.fam.cx>
43    
44            * HTML.pm.src: Support for new definition of |param| and |source|
45            start tag parsing (HTML5 revision 1731).
46    
47    2008-10-04  Wakaba  <wakaba@suika.fam.cx>
48    
49            * HTML.pm.src: <p> steps reimplemented (HTML5 revision 1731).
50    
51    2008-10-04  Wakaba  <wakaba@suika.fam.cx>
52    
53            * HTML.pm.src: <li>, <dt>, and <dd> steps reimplemented (HTML5
54            revisions 1731 and 1831).
55    
56    2008-10-04  Wakaba  <wakaba@suika.fam.cx>
57    
58            * HTML.pm.src: Support for new flow (but not phrasing) elements (HTML5
59            revisions 1731 and 1778).  Support for the </sarcasm> end tag (HTML5
60            revision 1731).
61    
62    2008-10-04  Wakaba  <wakaba@suika.fam.cx>
63    
64            * HTML.pm.src: Support for |command| and |eventsource| elements (HTML5
65            revision 1731).  End tags of |option| and |optgroup| elements are
66            now optional (HTML5 revision 1731).
67    
68    2008-10-04  Wakaba  <wakaba@suika.fam.cx>
69    
70            * HTML.pm.src: New "special" elements added to the list (HTML5
71            revision 1778).  "strile" -> "strike".
72    
73    2008-10-02  Wakaba  <wakaba@suika.fam.cx>
74    
75            * ContentType.pm (get_sniffed_type): Support for the "better"
76            content sniffing (HTML5 revision 1927).  In a case the official
77            type was not returned when the method is invoked in the list
78            context.
79    
80    2008-09-22  Wakaba  <wakaba@suika.fam.cx>
81    
82            * HTML.pm.src: Character references for non-space C0 characters,
83            including U+000B VT, DEL character, noncharacter code points, are
84            now converted to the U+FFFD character (cf. HTML5 revision 2138).
85    
86    2008-09-21  Wakaba  <wakaba@suika.fam.cx>
87    
88            * ContentChecker.pm: |form=""| check support added.
89    
90    2008-09-21  Wakaba  <wakaba@suika.fam.cx>
91    
92            * ContentChecker.pm: |contextmenu| validness is now checked using
93            |id| and |id_type| properties, and |menu| property is removed.
94    
95    2008-09-21  Wakaba  <wakaba@suika.fam.cx>
96    
97            * ContentChecker.pm: Prepare for |form| |name| attribute's
98            duplication checking.
99    
100    2008-09-21  Wakaba  <wakaba@suika.fam.cx>
101    
102            * HTML.pm.src (parse_byte_stream): Support (or non-support) for
103            unsupported charset="" parameter value (HTML5 revision 2131).
104    
105    2008-09-20  Wakaba  <wakaba@suika.fam.cx>
106    
107            * HTML.pm.src: Reminding places where U+000B is allowed as a space
108            character is fixed (cf. HTML5 revision 1738).
109    
110            * ContentChecker.pm, HTMLTable.pm: U+000B is no longer part of
111            space characters (HTML5 revision 1738).
112    
113    2008-09-20  Wakaba  <wakaba@suika.fam.cx>
114    
115            * HTML.pm.src: The "anything else" case for the "after after body"
116            insertion mode was not updated to swtich to the "in body"
117            insertion mode.  U+000B is no longer a space character for the
118            purpose of tree construction phase (HTML5 revision 1738).
119    
120    2008-09-20  Wakaba  <wakaba@suika.fam.cx>
121    
122            * HTML.pm.src: U+000B is no longer a space character (HTML5
123            revision 1738).
124    
125    2008-09-20  Wakaba  <wakaba@suika.fam.cx>
126    
127            * ContentType.pm: 0x0B is no longer a space character (HTML5
128            revision 1738).
129    
130            * HTML.pm.src: U+000B is no longer a space character for the
131            algorithm for extracting an encoding from a Content-Type (HTML5
132            revision 1738).
133    
134    2008-09-20  Wakaba  <wakaba@suika.fam.cx>
135    
136            * ContentChecker.pm ($IsInHTMLInteractiveContent): New.
137    
138    2008-09-18  Wakaba  <wakaba@suika.fam.cx>
139    
140            * LangTag.pm: Add checks for remaining requirements from RFC 4646.
141    
142            * mklangreg.pl: Sort 'Prefix' values by their length, to ease
143            matching.
144    
145    2008-09-18  Wakaba  <wakaba@suika.fam.cx>
146    
147            * LangTag.pm: Warn for private use language subtags.  Error level
148            typos fixed.  Support for Suppress-Script field.
149    
150            * mklangreg.pl: Support for dumping of nested structure.
151    
152    2008-09-18  Wakaba  <wakaba@suika.fam.cx>
153    
154            * LangTag.pm (check_rfc4646_langtag): Check if a tag is in the
155            recommended case as per RFC 4646.
156    
157    2008-09-18  Wakaba  <wakaba@suika.fam.cx>
158    
159            * LangTag.pm (check_rfc4646_langtag): New method.
160    
161    2008-09-18  Wakaba  <wakaba@suika.fam.cx>
162    
163            * mklangreg.pl: New script.
164    
165            * Makefile: Updated for creation of the module for language subtag
166            registry.
167            
168    2008-09-16  Wakaba  <wakaba@suika.fam.cx>
169    
170            * Makefile: WebIDL.html added.
171    
172            * WebIDL.pod: New documentation.
173    
174    2008-09-16  Wakaba  <wakaba@suika.fam.cx>
175    
176            * WebIDL.pm: Checker's error types are redefined.
177    
178    2008-09-16  Wakaba  <wakaba@suika.fam.cx>
179    
180            * WebIDL.pm: Parser's error types are redefined.  Some forward
181            compatible parsing bugs are fixed.  Some unreachable codes are
182            commented out.
183    
184    2008-09-16  Wakaba  <wakaba@suika.fam.cx>
185    
186            * WebIDL.pm: Support for the reminding extended attributes are
187            added.  It does not satisfy the definition that a forward
188            interface declaration has an extended attribute.  It seems that
189            unless explicitly allowed multiple extended attributes with the
190            same name is not allowed, though it is not explicitly mentioned in
191            the spec.
192    
193    2008-09-16  Wakaba  <wakaba@suika.fam.cx>
194    
195            * WebIDL.pm: Unescapes extended attribute names and extended
196            attribute identifiers.  Preserve whether an extended attribute has
197            an argument list of not.  Support for extended attributes:
198            Constructor, ExceptionConsts, IndexGetter, IndexSetter,
199            NameGetter, NameSetter, and Null.
200            (has_argument_list): New attribute.
201            (idl_text): Stringifies argument lists, if any, even if it is
202            empty.
203    
204    2008-09-15  Wakaba  <wakaba@suika.fam.cx>
205    
206            * HTML.pm.src: New state |PCDATA_STATE|.  Use an empty string for
207            |{s_kwd}| in DATA_STATE as default.
208    
209    2008-09-15  Wakaba  <wakaba@suika.fam.cx>
210    
211            * HTML.pm.src, mkhtmlparser.pl: Replace |{prev_char}|
212            by |{s_kwd}| in DATA_STATE.
213    
214    2008-09-15  Wakaba  <wakaba@suika.fam.cx>
215    
216            * HTML.pm.src: Shorten keys.
217    
218    2008-09-15  Wakaba  <wakaba@suika.fam.cx>
219    
220            * HTML.pm.src: Remove checking for control character, surrogate
221            pair, or noncharacter code points and non-Unicode code
222            points (they should be handled by Whatpm::Charset::UnicodeChecker).
223            (parse_char_stream): Support for the |$get_wrapper| argument and
224            character stream error handlers.
225    
226    2008-09-15  Wakaba  <wakaba@suika.fam.cx>
227    
228            * ContentChecker.pm: Don't call |loda_ns_module|
229            for null-namespace elements/attributes.
230    
231            * HTML.pm.src: Fact out $disallowed_control_chars
232            as a hash.
233    
234    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
235    
236            * HTML.pm.src: Regexp typo fixed.  |{prev_char}|
237            and |{next_char}| initializations are moved to initialization
238            method.  |{read_until}| now supports buffering.  Sync |set_inner_html|
239            with |parse_char_stream|.
240    
241    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
242    
243            * HTML.pm.src (parse_char_stream): Make |set_next_char|
244            invoke |manakai_read_until|, not only |read|, where
245            possible, to decrease the number of |read| method calls.
246    
247            * mkhtmlparser.pl: Related changes to the aforementioned
248            modification.
249    
250    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
251    
252            * HTML.pm.src: Use |read| instead of |getc|.  |set_inner_html|
253            would report character error from now.
254    
255    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
256    
257            * HTML.pm.src: White-space-leaded non-white-space character
258            tokens in "before head insertion mode" was not
259            correctly handled.
260            (set_inner_html): Reimplemented using CharString decodehandle
261            class.  Support for $get_wrapper argument.  Support
262            for |{read_until}| feature.
263    
264    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
265    
266            * HTML.pm.src: Make a "bare ero" error for unknown
267            entities point the "&" character.
268    
269    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
270    
271            * HTML.pm.src: It turns out that U+FFFD don't have to
272            be added to the list of excluded characters.
273    
274    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
275    
276            * HTML.pm.src ($char_onerror): Have character decoder's |line|
277            and |column| a higher priority than the one set by the
278            tokenizer's input handler.
279            ($self->{read_until}): Exclude U+FFFD (but this might
280            not be necessary, since now we do line/column fixup in
281            the character decode handle).
282    
283    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
284    
285            * HTML.pm.src: Use |{read_until}| where possible.
286    
287    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
288    
289            * HTML.pm.src: Change |{getc_until}| to |{read_until}|
290            and |manakai_getc_until| to |manakai_read_until| to
291            reduce the number of string copies.
292    
293    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
294    
295            * HTML.pm.src (parse_char_string): Use newly created
296            |Whatpm::Charset::DecodeHandle::CharString| instead of Perl's
297            standard feature to |open| a string as a filehandle,
298            since Perl's string filehandle seems not supporting |ungetc|
299            method correctly.
300            (parse_char_stream): Define |{getc_until}| method.
301            (DATA_STATE): Experimental support for |getc_until| feature.
302    
303    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
304    
305            * HTML.pm.src: Check points added to newly added branches.
306    
307    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
308    
309            * HTML.pm.src: Remove |{char}|, which is no longer used.
310            Remove |{entity_in_attr}| and |{last_attribute_value_state}|
311            and replaced by |{prev_state}|.
312    
313            * mkhtmlparser.pl: Remove |{char}| feature.
314            Remove |!!!back-next-input-character;| macro.
315    
316    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
317    
318            * HTML.pm.src: Finally we get rid of all the inner loops.  Remove
319            entity related tokenizer states in favor of new states
320            implementing the consume character reference algorithm.
321    
322    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
323    
324            * HTML.pm.src: "Consume a character reference" algorithm is
325            now implemented as a tokenizer's state, rather than
326            a method, with minimum changes (more changes will
327            be made, in due course).  "Bogus comment state"'s inner
328            loop gets removed.
329    
330    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
331    
332            * HTML.pm.src: Make |PUBLIC| and |SYSTEM| keyword tokenizing
333            into their own tokenizer states.
334    
335    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
336    
337            * HTML.pm.src: |CDATA_SECTION_STATE| (formally |CDATA_BLOCK_STATE|
338            is split into three states.
339    
340    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
341    
342            * HTML.pm.src: |CLOSE_TAG_OPEN_STATE| is broken into
343            itself and new |CDATA_PCDATA_CLOSE_TAG_STATE| so that
344            no longer does the tokenizer have to push back next input
345            characters in those states.
346    
347    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
348    
349            * HTML.pm.src: |MARKUP_DECLARATION_OPEN_STATE| broken
350            into four states so that no longer does the tokenizer have to push
351            back next input characters in that state.
352    
353    2008-09-11  Wakaba  <wakaba@suika.fam.cx>
354    
355            * HTML.pm.src: Methods now accept additional parameter, $get_wrapper,
356            which can be used to insert some wrapper between the character
357            stream handle and the tokenizer.  (It is currently not supported
358            for |set_inner_html| for |Element|s).
359    
360    2008-09-10  Wakaba  <wakaba@suika.fam.cx>
361    
362            * HTML.pm.src: Ignore punctuations in charset names.
363    
364    2008-09-10  Wakaba  <wakaba@suika.fam.cx>
365    
366            * ContentChecker.pm: Support for charset-layer error levels.
367    
368            * HTML.pm.src: Don't specify |text| argument for the
369            |chardecode:fallback| error, since it is not the encoding
370            being used alternatively.
371    
372    2008-09-06  Wakaba  <wakaba@suika.fam.cx>
373    
374            * HTML.pm.src: Support for |XSLT-compat| (HTML5 revision 2141).
375    
376    2008-08-31  Wakaba  <wakaba@suika.fam.cx>
377    
378            * CacheManifest.pm: Support for extensibility (HTML5 revision 2051).
379    
380    2008-08-31  Wakaba  <wakaba@suika.fam.cx>
381    
382            * HTML.pm.src: Bug fix and sync with the spec with regard
383            to after after frameset insertion mode processing (HTML5
384            revision 1909).  Note that the implementation was wrong
385            per the old spec before the r1909 changes.
386    
387    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
388    
389            * HTMLTable.pm: scope=auto algorithm fix synced with the
390            spec (HTML5 revision 2093).
391            ($process_row): Algorithm step numbers synced with the
392            spec (HTML5 revision 2092).
393    
394    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
395    
396            * HTMLTable.pm: Zs is not what we want; we want White_Space! (HTML5
397            revision 2094).
398    
399    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
400    
401            * ContentType.pm: Support for image/svg+xml (HTML5 revision 2096).
402    
403    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
404    
405            * HTML.pm.src: '"' and "'" at the end of attribute
406            name (after another attribute) now raise parse error (HTML5
407            revision 2123).  Empty unquoted attribute values are no
408            longer allowed (HTML5 revision 2122).
409    
410    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
411    
412            * mkhtmlparser.pl: Support for MathML |definitionURL| attribute (HTML5
413            revision 2130).
414    
415    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
416    
417            * ContentChecker.pm: |xml:lang| attribute value must be same
418            as |lang| attribute value for HTML elements (HTML5 revision 2062
419            and so on).
420    
421    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
422    
423            * ContentChecker.pm: Error level definition for |xml_id_error|
424            was missing.
425    
426            * URIChecker.pm: The end of the URL should be marked as the
427            error location for an empty path error.  The position
428            between the userinfo and the port components should be
429            marked as the error location for an empty host error.
430    
431    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
432    
433            * URIChecker.pm: Set parameters representing where in the
434            value the error occurs for errors.  Report unknown
435            address format error in warning level, since address
436            formats are rarely added.  Path segments starting with "/.."
437            were misinterpreted as a dot-segment.
438    
439    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
440    
441            * URIChecker.pm (check_iri_reference): Requires
442            |Message::DOM::DOMImplementation|.
443    
444    2008-08-29  Wakaba  <wakaba@suika.fam.cx>
445    
446            * IMTChecker.pm: Updated for the new error reporting architecture.
447    
448            * ContentChecker.pm: Error levels for IMTs are added.
449    
450    2008-08-17  Wakaba  <wakaba@suika.fam.cx>
451    
452            * H2H.pm (_shift_token): Support for unquoted HTML attribute
453            values.
454    
455  2008-08-16  Wakaba  <wakaba@suika.fam.cx>  2008-08-16  Wakaba  <wakaba@suika.fam.cx>
456    
457          * CacheManifest.pm: Support for new style of error          * CacheManifest.pm: Support for new style of error

Legend:
Removed from v.1.275  
changed lines
  Added in v.1.349

admin@suikawiki.org
ViewVC Help
Powered by ViewVC 1.1.24