/[suikacvs]/markup/html/whatpm/Whatpm/ChangeLog
Suika

Diff of /markup/html/whatpm/Whatpm/ChangeLog

Parent Directory Parent Directory | Revision Log Revision Log | View Patch Patch

revision 1.275 by wakaba, Sat Aug 16 07:35:22 2008 UTC revision 1.348 by wakaba, Tue Oct 7 11:41:41 2008 UTC
# Line 1  Line 1 
1    2008-10-07  Wakaba  <wakaba@suika.fam.cx>
2    
3            * ContentChecker.pm: New error level "html5_fact" added, which
4            should be tentatively used until all of requirements are properly
5            specced as RFC 2119 "MUST" in HTML5.
6    
7    2008-10-05  Wakaba  <wakaba@suika.fam.cx>
8    
9            * ContentChecker.pod: Note on internal flags is added.
10    
11    2008-10-05  Wakaba  <wakaba@suika.fam.cx>
12    
13            * HTML.pm.src: An AAA bug fixed.
14    
15    2008-10-04  Wakaba  <wakaba@suika.fam.cx>
16    
17            * HTML.pm.src: If another node is inserted by the parser, don't
18            reuse existing Text node to append a character (HTML5 revision
19            2124).
20    
21    2008-10-04  Wakaba  <wakaba@suika.fam.cx>
22    
23            * HTML.pm.src: Support for <option> and <optgroup> in body (HTML5
24            revisions 1731 and 2128).
25    
26    2008-10-04  Wakaba  <wakaba@suika.fam.cx>
27    
28            * HTML.pm.src: Make <foreignObject> scoping (HTML5 revision 1837).
29            Support for end tags of camelCase SVG elements were broken.  A
30            wrong error type text fixed.
31    
32    2008-10-04  Wakaba  <wakaba@suika.fam.cx>
33    
34            * HTML.pm.src: Drop redundant code (HTML5 revision 1731).
35    
36    2008-10-04  Wakaba  <wakaba@suika.fam.cx>
37    
38            * HTML.pm.src: Support for new definition of |param| and |source|
39            start tag parsing (HTML5 revision 1731).
40    
41    2008-10-04  Wakaba  <wakaba@suika.fam.cx>
42    
43            * HTML.pm.src: <p> steps reimplemented (HTML5 revision 1731).
44    
45    2008-10-04  Wakaba  <wakaba@suika.fam.cx>
46    
47            * HTML.pm.src: <li>, <dt>, and <dd> steps reimplemented (HTML5
48            revisions 1731 and 1831).
49    
50    2008-10-04  Wakaba  <wakaba@suika.fam.cx>
51    
52            * HTML.pm.src: Support for new flow (but not phrasing) elements (HTML5
53            revisions 1731 and 1778).  Support for the </sarcasm> end tag (HTML5
54            revision 1731).
55    
56    2008-10-04  Wakaba  <wakaba@suika.fam.cx>
57    
58            * HTML.pm.src: Support for |command| and |eventsource| elements (HTML5
59            revision 1731).  End tags of |option| and |optgroup| elements are
60            now optional (HTML5 revision 1731).
61    
62    2008-10-04  Wakaba  <wakaba@suika.fam.cx>
63    
64            * HTML.pm.src: New "special" elements added to the list (HTML5
65            revision 1778).  "strile" -> "strike".
66    
67    2008-10-02  Wakaba  <wakaba@suika.fam.cx>
68    
69            * ContentType.pm (get_sniffed_type): Support for the "better"
70            content sniffing (HTML5 revision 1927).  In a case the official
71            type was not returned when the method is invoked in the list
72            context.
73    
74    2008-09-22  Wakaba  <wakaba@suika.fam.cx>
75    
76            * HTML.pm.src: Character references for non-space C0 characters,
77            including U+000B VT, DEL character, noncharacter code points, are
78            now converted to the U+FFFD character (cf. HTML5 revision 2138).
79    
80    2008-09-21  Wakaba  <wakaba@suika.fam.cx>
81    
82            * ContentChecker.pm: |form=""| check support added.
83    
84    2008-09-21  Wakaba  <wakaba@suika.fam.cx>
85    
86            * ContentChecker.pm: |contextmenu| validness is now checked using
87            |id| and |id_type| properties, and |menu| property is removed.
88    
89    2008-09-21  Wakaba  <wakaba@suika.fam.cx>
90    
91            * ContentChecker.pm: Prepare for |form| |name| attribute's
92            duplication checking.
93    
94    2008-09-21  Wakaba  <wakaba@suika.fam.cx>
95    
96            * HTML.pm.src (parse_byte_stream): Support (or non-support) for
97            unsupported charset="" parameter value (HTML5 revision 2131).
98    
99    2008-09-20  Wakaba  <wakaba@suika.fam.cx>
100    
101            * HTML.pm.src: Reminding places where U+000B is allowed as a space
102            character is fixed (cf. HTML5 revision 1738).
103    
104            * ContentChecker.pm, HTMLTable.pm: U+000B is no longer part of
105            space characters (HTML5 revision 1738).
106    
107    2008-09-20  Wakaba  <wakaba@suika.fam.cx>
108    
109            * HTML.pm.src: The "anything else" case for the "after after body"
110            insertion mode was not updated to swtich to the "in body"
111            insertion mode.  U+000B is no longer a space character for the
112            purpose of tree construction phase (HTML5 revision 1738).
113    
114    2008-09-20  Wakaba  <wakaba@suika.fam.cx>
115    
116            * HTML.pm.src: U+000B is no longer a space character (HTML5
117            revision 1738).
118    
119    2008-09-20  Wakaba  <wakaba@suika.fam.cx>
120    
121            * ContentType.pm: 0x0B is no longer a space character (HTML5
122            revision 1738).
123    
124            * HTML.pm.src: U+000B is no longer a space character for the
125            algorithm for extracting an encoding from a Content-Type (HTML5
126            revision 1738).
127    
128    2008-09-20  Wakaba  <wakaba@suika.fam.cx>
129    
130            * ContentChecker.pm ($IsInHTMLInteractiveContent): New.
131    
132    2008-09-18  Wakaba  <wakaba@suika.fam.cx>
133    
134            * LangTag.pm: Add checks for remaining requirements from RFC 4646.
135    
136            * mklangreg.pl: Sort 'Prefix' values by their length, to ease
137            matching.
138    
139    2008-09-18  Wakaba  <wakaba@suika.fam.cx>
140    
141            * LangTag.pm: Warn for private use language subtags.  Error level
142            typos fixed.  Support for Suppress-Script field.
143    
144            * mklangreg.pl: Support for dumping of nested structure.
145    
146    2008-09-18  Wakaba  <wakaba@suika.fam.cx>
147    
148            * LangTag.pm (check_rfc4646_langtag): Check if a tag is in the
149            recommended case as per RFC 4646.
150    
151    2008-09-18  Wakaba  <wakaba@suika.fam.cx>
152    
153            * LangTag.pm (check_rfc4646_langtag): New method.
154    
155    2008-09-18  Wakaba  <wakaba@suika.fam.cx>
156    
157            * mklangreg.pl: New script.
158    
159            * Makefile: Updated for creation of the module for language subtag
160            registry.
161            
162    2008-09-16  Wakaba  <wakaba@suika.fam.cx>
163    
164            * Makefile: WebIDL.html added.
165    
166            * WebIDL.pod: New documentation.
167    
168    2008-09-16  Wakaba  <wakaba@suika.fam.cx>
169    
170            * WebIDL.pm: Checker's error types are redefined.
171    
172    2008-09-16  Wakaba  <wakaba@suika.fam.cx>
173    
174            * WebIDL.pm: Parser's error types are redefined.  Some forward
175            compatible parsing bugs are fixed.  Some unreachable codes are
176            commented out.
177    
178    2008-09-16  Wakaba  <wakaba@suika.fam.cx>
179    
180            * WebIDL.pm: Support for the reminding extended attributes are
181            added.  It does not satisfy the definition that a forward
182            interface declaration has an extended attribute.  It seems that
183            unless explicitly allowed multiple extended attributes with the
184            same name is not allowed, though it is not explicitly mentioned in
185            the spec.
186    
187    2008-09-16  Wakaba  <wakaba@suika.fam.cx>
188    
189            * WebIDL.pm: Unescapes extended attribute names and extended
190            attribute identifiers.  Preserve whether an extended attribute has
191            an argument list of not.  Support for extended attributes:
192            Constructor, ExceptionConsts, IndexGetter, IndexSetter,
193            NameGetter, NameSetter, and Null.
194            (has_argument_list): New attribute.
195            (idl_text): Stringifies argument lists, if any, even if it is
196            empty.
197    
198    2008-09-15  Wakaba  <wakaba@suika.fam.cx>
199    
200            * HTML.pm.src: New state |PCDATA_STATE|.  Use an empty string for
201            |{s_kwd}| in DATA_STATE as default.
202    
203    2008-09-15  Wakaba  <wakaba@suika.fam.cx>
204    
205            * HTML.pm.src, mkhtmlparser.pl: Replace |{prev_char}|
206            by |{s_kwd}| in DATA_STATE.
207    
208    2008-09-15  Wakaba  <wakaba@suika.fam.cx>
209    
210            * HTML.pm.src: Shorten keys.
211    
212    2008-09-15  Wakaba  <wakaba@suika.fam.cx>
213    
214            * HTML.pm.src: Remove checking for control character, surrogate
215            pair, or noncharacter code points and non-Unicode code
216            points (they should be handled by Whatpm::Charset::UnicodeChecker).
217            (parse_char_stream): Support for the |$get_wrapper| argument and
218            character stream error handlers.
219    
220    2008-09-15  Wakaba  <wakaba@suika.fam.cx>
221    
222            * ContentChecker.pm: Don't call |loda_ns_module|
223            for null-namespace elements/attributes.
224    
225            * HTML.pm.src: Fact out $disallowed_control_chars
226            as a hash.
227    
228    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
229    
230            * HTML.pm.src: Regexp typo fixed.  |{prev_char}|
231            and |{next_char}| initializations are moved to initialization
232            method.  |{read_until}| now supports buffering.  Sync |set_inner_html|
233            with |parse_char_stream|.
234    
235    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
236    
237            * HTML.pm.src (parse_char_stream): Make |set_next_char|
238            invoke |manakai_read_until|, not only |read|, where
239            possible, to decrease the number of |read| method calls.
240    
241            * mkhtmlparser.pl: Related changes to the aforementioned
242            modification.
243    
244    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
245    
246            * HTML.pm.src: Use |read| instead of |getc|.  |set_inner_html|
247            would report character error from now.
248    
249    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
250    
251            * HTML.pm.src: White-space-leaded non-white-space character
252            tokens in "before head insertion mode" was not
253            correctly handled.
254            (set_inner_html): Reimplemented using CharString decodehandle
255            class.  Support for $get_wrapper argument.  Support
256            for |{read_until}| feature.
257    
258    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
259    
260            * HTML.pm.src: Make a "bare ero" error for unknown
261            entities point the "&" character.
262    
263    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
264    
265            * HTML.pm.src: It turns out that U+FFFD don't have to
266            be added to the list of excluded characters.
267    
268    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
269    
270            * HTML.pm.src ($char_onerror): Have character decoder's |line|
271            and |column| a higher priority than the one set by the
272            tokenizer's input handler.
273            ($self->{read_until}): Exclude U+FFFD (but this might
274            not be necessary, since now we do line/column fixup in
275            the character decode handle).
276    
277    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
278    
279            * HTML.pm.src: Use |{read_until}| where possible.
280    
281    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
282    
283            * HTML.pm.src: Change |{getc_until}| to |{read_until}|
284            and |manakai_getc_until| to |manakai_read_until| to
285            reduce the number of string copies.
286    
287    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
288    
289            * HTML.pm.src (parse_char_string): Use newly created
290            |Whatpm::Charset::DecodeHandle::CharString| instead of Perl's
291            standard feature to |open| a string as a filehandle,
292            since Perl's string filehandle seems not supporting |ungetc|
293            method correctly.
294            (parse_char_stream): Define |{getc_until}| method.
295            (DATA_STATE): Experimental support for |getc_until| feature.
296    
297    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
298    
299            * HTML.pm.src: Check points added to newly added branches.
300    
301    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
302    
303            * HTML.pm.src: Remove |{char}|, which is no longer used.
304            Remove |{entity_in_attr}| and |{last_attribute_value_state}|
305            and replaced by |{prev_state}|.
306    
307            * mkhtmlparser.pl: Remove |{char}| feature.
308            Remove |!!!back-next-input-character;| macro.
309    
310    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
311    
312            * HTML.pm.src: Finally we get rid of all the inner loops.  Remove
313            entity related tokenizer states in favor of new states
314            implementing the consume character reference algorithm.
315    
316    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
317    
318            * HTML.pm.src: "Consume a character reference" algorithm is
319            now implemented as a tokenizer's state, rather than
320            a method, with minimum changes (more changes will
321            be made, in due course).  "Bogus comment state"'s inner
322            loop gets removed.
323    
324    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
325    
326            * HTML.pm.src: Make |PUBLIC| and |SYSTEM| keyword tokenizing
327            into their own tokenizer states.
328    
329    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
330    
331            * HTML.pm.src: |CDATA_SECTION_STATE| (formally |CDATA_BLOCK_STATE|
332            is split into three states.
333    
334    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
335    
336            * HTML.pm.src: |CLOSE_TAG_OPEN_STATE| is broken into
337            itself and new |CDATA_PCDATA_CLOSE_TAG_STATE| so that
338            no longer does the tokenizer have to push back next input
339            characters in those states.
340    
341    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
342    
343            * HTML.pm.src: |MARKUP_DECLARATION_OPEN_STATE| broken
344            into four states so that no longer does the tokenizer have to push
345            back next input characters in that state.
346    
347    2008-09-11  Wakaba  <wakaba@suika.fam.cx>
348    
349            * HTML.pm.src: Methods now accept additional parameter, $get_wrapper,
350            which can be used to insert some wrapper between the character
351            stream handle and the tokenizer.  (It is currently not supported
352            for |set_inner_html| for |Element|s).
353    
354    2008-09-10  Wakaba  <wakaba@suika.fam.cx>
355    
356            * HTML.pm.src: Ignore punctuations in charset names.
357    
358    2008-09-10  Wakaba  <wakaba@suika.fam.cx>
359    
360            * ContentChecker.pm: Support for charset-layer error levels.
361    
362            * HTML.pm.src: Don't specify |text| argument for the
363            |chardecode:fallback| error, since it is not the encoding
364            being used alternatively.
365    
366    2008-09-06  Wakaba  <wakaba@suika.fam.cx>
367    
368            * HTML.pm.src: Support for |XSLT-compat| (HTML5 revision 2141).
369    
370    2008-08-31  Wakaba  <wakaba@suika.fam.cx>
371    
372            * CacheManifest.pm: Support for extensibility (HTML5 revision 2051).
373    
374    2008-08-31  Wakaba  <wakaba@suika.fam.cx>
375    
376            * HTML.pm.src: Bug fix and sync with the spec with regard
377            to after after frameset insertion mode processing (HTML5
378            revision 1909).  Note that the implementation was wrong
379            per the old spec before the r1909 changes.
380    
381    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
382    
383            * HTMLTable.pm: scope=auto algorithm fix synced with the
384            spec (HTML5 revision 2093).
385            ($process_row): Algorithm step numbers synced with the
386            spec (HTML5 revision 2092).
387    
388    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
389    
390            * HTMLTable.pm: Zs is not what we want; we want White_Space! (HTML5
391            revision 2094).
392    
393    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
394    
395            * ContentType.pm: Support for image/svg+xml (HTML5 revision 2096).
396    
397    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
398    
399            * HTML.pm.src: '"' and "'" at the end of attribute
400            name (after another attribute) now raise parse error (HTML5
401            revision 2123).  Empty unquoted attribute values are no
402            longer allowed (HTML5 revision 2122).
403    
404    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
405    
406            * mkhtmlparser.pl: Support for MathML |definitionURL| attribute (HTML5
407            revision 2130).
408    
409    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
410    
411            * ContentChecker.pm: |xml:lang| attribute value must be same
412            as |lang| attribute value for HTML elements (HTML5 revision 2062
413            and so on).
414    
415    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
416    
417            * ContentChecker.pm: Error level definition for |xml_id_error|
418            was missing.
419    
420            * URIChecker.pm: The end of the URL should be marked as the
421            error location for an empty path error.  The position
422            between the userinfo and the port components should be
423            marked as the error location for an empty host error.
424    
425    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
426    
427            * URIChecker.pm: Set parameters representing where in the
428            value the error occurs for errors.  Report unknown
429            address format error in warning level, since address
430            formats are rarely added.  Path segments starting with "/.."
431            were misinterpreted as a dot-segment.
432    
433    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
434    
435            * URIChecker.pm (check_iri_reference): Requires
436            |Message::DOM::DOMImplementation|.
437    
438    2008-08-29  Wakaba  <wakaba@suika.fam.cx>
439    
440            * IMTChecker.pm: Updated for the new error reporting architecture.
441    
442            * ContentChecker.pm: Error levels for IMTs are added.
443    
444    2008-08-17  Wakaba  <wakaba@suika.fam.cx>
445    
446            * H2H.pm (_shift_token): Support for unquoted HTML attribute
447            values.
448    
449  2008-08-16  Wakaba  <wakaba@suika.fam.cx>  2008-08-16  Wakaba  <wakaba@suika.fam.cx>
450    
451          * CacheManifest.pm: Support for new style of error          * CacheManifest.pm: Support for new style of error

Legend:
Removed from v.1.275  
changed lines
  Added in v.1.348

admin@suikawiki.org
ViewVC Help
Powered by ViewVC 1.1.24