/[suikacvs]/markup/html/whatpm/Whatpm/ChangeLog
Suika

Diff of /markup/html/whatpm/Whatpm/ChangeLog

Parent Directory Parent Directory | Revision Log Revision Log | View Patch Patch

revision 1.267 by wakaba, Sat Aug 2 15:14:23 2008 UTC revision 1.350 by wakaba, Mon Oct 13 08:22:29 2008 UTC
# Line 1  Line 1 
1    2008-10-13  Wakaba  <wakaba@suika.fam.cx>
2    
3            * HTML.pm.src: Element category constants redefined.
4            
5    2008-10-13  Wakaba  <wakaba@suika.fam.cx>
6    
7            * HTML.pm.src: Steps for CDATA/RCDATA elements in tree
8            construction stage synced with the spec (HTML5 revisions 2139 and
9            2302).
10    
11    2008-10-07  Wakaba  <wakaba@suika.fam.cx>
12    
13            * ContentChecker.pm: New error level "html5_fact" added, which
14            should be tentatively used until all of requirements are properly
15            specced as RFC 2119 "MUST" in HTML5.
16    
17    2008-10-05  Wakaba  <wakaba@suika.fam.cx>
18    
19            * ContentChecker.pod: Note on internal flags is added.
20    
21    2008-10-05  Wakaba  <wakaba@suika.fam.cx>
22    
23            * HTML.pm.src: An AAA bug fixed.
24    
25    2008-10-04  Wakaba  <wakaba@suika.fam.cx>
26    
27            * HTML.pm.src: If another node is inserted by the parser, don't
28            reuse existing Text node to append a character (HTML5 revision
29            2124).
30    
31    2008-10-04  Wakaba  <wakaba@suika.fam.cx>
32    
33            * HTML.pm.src: Support for <option> and <optgroup> in body (HTML5
34            revisions 1731 and 2128).
35    
36    2008-10-04  Wakaba  <wakaba@suika.fam.cx>
37    
38            * HTML.pm.src: Make <foreignObject> scoping (HTML5 revision 1837).
39            Support for end tags of camelCase SVG elements were broken.  A
40            wrong error type text fixed.
41    
42    2008-10-04  Wakaba  <wakaba@suika.fam.cx>
43    
44            * HTML.pm.src: Drop redundant code (HTML5 revision 1731).
45    
46    2008-10-04  Wakaba  <wakaba@suika.fam.cx>
47    
48            * HTML.pm.src: Support for new definition of |param| and |source|
49            start tag parsing (HTML5 revision 1731).
50    
51    2008-10-04  Wakaba  <wakaba@suika.fam.cx>
52    
53            * HTML.pm.src: <p> steps reimplemented (HTML5 revision 1731).
54    
55    2008-10-04  Wakaba  <wakaba@suika.fam.cx>
56    
57            * HTML.pm.src: <li>, <dt>, and <dd> steps reimplemented (HTML5
58            revisions 1731 and 1831).
59    
60    2008-10-04  Wakaba  <wakaba@suika.fam.cx>
61    
62            * HTML.pm.src: Support for new flow (but not phrasing) elements (HTML5
63            revisions 1731 and 1778).  Support for the </sarcasm> end tag (HTML5
64            revision 1731).
65    
66    2008-10-04  Wakaba  <wakaba@suika.fam.cx>
67    
68            * HTML.pm.src: Support for |command| and |eventsource| elements (HTML5
69            revision 1731).  End tags of |option| and |optgroup| elements are
70            now optional (HTML5 revision 1731).
71    
72    2008-10-04  Wakaba  <wakaba@suika.fam.cx>
73    
74            * HTML.pm.src: New "special" elements added to the list (HTML5
75            revision 1778).  "strile" -> "strike".
76    
77    2008-10-02  Wakaba  <wakaba@suika.fam.cx>
78    
79            * ContentType.pm (get_sniffed_type): Support for the "better"
80            content sniffing (HTML5 revision 1927).  In a case the official
81            type was not returned when the method is invoked in the list
82            context.
83    
84    2008-09-22  Wakaba  <wakaba@suika.fam.cx>
85    
86            * HTML.pm.src: Character references for non-space C0 characters,
87            including U+000B VT, DEL character, noncharacter code points, are
88            now converted to the U+FFFD character (cf. HTML5 revision 2138).
89    
90    2008-09-21  Wakaba  <wakaba@suika.fam.cx>
91    
92            * ContentChecker.pm: |form=""| check support added.
93    
94    2008-09-21  Wakaba  <wakaba@suika.fam.cx>
95    
96            * ContentChecker.pm: |contextmenu| validness is now checked using
97            |id| and |id_type| properties, and |menu| property is removed.
98    
99    2008-09-21  Wakaba  <wakaba@suika.fam.cx>
100    
101            * ContentChecker.pm: Prepare for |form| |name| attribute's
102            duplication checking.
103    
104    2008-09-21  Wakaba  <wakaba@suika.fam.cx>
105    
106            * HTML.pm.src (parse_byte_stream): Support (or non-support) for
107            unsupported charset="" parameter value (HTML5 revision 2131).
108    
109    2008-09-20  Wakaba  <wakaba@suika.fam.cx>
110    
111            * HTML.pm.src: Reminding places where U+000B is allowed as a space
112            character is fixed (cf. HTML5 revision 1738).
113    
114            * ContentChecker.pm, HTMLTable.pm: U+000B is no longer part of
115            space characters (HTML5 revision 1738).
116    
117    2008-09-20  Wakaba  <wakaba@suika.fam.cx>
118    
119            * HTML.pm.src: The "anything else" case for the "after after body"
120            insertion mode was not updated to swtich to the "in body"
121            insertion mode.  U+000B is no longer a space character for the
122            purpose of tree construction phase (HTML5 revision 1738).
123    
124    2008-09-20  Wakaba  <wakaba@suika.fam.cx>
125    
126            * HTML.pm.src: U+000B is no longer a space character (HTML5
127            revision 1738).
128    
129    2008-09-20  Wakaba  <wakaba@suika.fam.cx>
130    
131            * ContentType.pm: 0x0B is no longer a space character (HTML5
132            revision 1738).
133    
134            * HTML.pm.src: U+000B is no longer a space character for the
135            algorithm for extracting an encoding from a Content-Type (HTML5
136            revision 1738).
137    
138    2008-09-20  Wakaba  <wakaba@suika.fam.cx>
139    
140            * ContentChecker.pm ($IsInHTMLInteractiveContent): New.
141    
142    2008-09-18  Wakaba  <wakaba@suika.fam.cx>
143    
144            * LangTag.pm: Add checks for remaining requirements from RFC 4646.
145    
146            * mklangreg.pl: Sort 'Prefix' values by their length, to ease
147            matching.
148    
149    2008-09-18  Wakaba  <wakaba@suika.fam.cx>
150    
151            * LangTag.pm: Warn for private use language subtags.  Error level
152            typos fixed.  Support for Suppress-Script field.
153    
154            * mklangreg.pl: Support for dumping of nested structure.
155    
156    2008-09-18  Wakaba  <wakaba@suika.fam.cx>
157    
158            * LangTag.pm (check_rfc4646_langtag): Check if a tag is in the
159            recommended case as per RFC 4646.
160    
161    2008-09-18  Wakaba  <wakaba@suika.fam.cx>
162    
163            * LangTag.pm (check_rfc4646_langtag): New method.
164    
165    2008-09-18  Wakaba  <wakaba@suika.fam.cx>
166    
167            * mklangreg.pl: New script.
168    
169            * Makefile: Updated for creation of the module for language subtag
170            registry.
171            
172    2008-09-16  Wakaba  <wakaba@suika.fam.cx>
173    
174            * Makefile: WebIDL.html added.
175    
176            * WebIDL.pod: New documentation.
177    
178    2008-09-16  Wakaba  <wakaba@suika.fam.cx>
179    
180            * WebIDL.pm: Checker's error types are redefined.
181    
182    2008-09-16  Wakaba  <wakaba@suika.fam.cx>
183    
184            * WebIDL.pm: Parser's error types are redefined.  Some forward
185            compatible parsing bugs are fixed.  Some unreachable codes are
186            commented out.
187    
188    2008-09-16  Wakaba  <wakaba@suika.fam.cx>
189    
190            * WebIDL.pm: Support for the reminding extended attributes are
191            added.  It does not satisfy the definition that a forward
192            interface declaration has an extended attribute.  It seems that
193            unless explicitly allowed multiple extended attributes with the
194            same name is not allowed, though it is not explicitly mentioned in
195            the spec.
196    
197    2008-09-16  Wakaba  <wakaba@suika.fam.cx>
198    
199            * WebIDL.pm: Unescapes extended attribute names and extended
200            attribute identifiers.  Preserve whether an extended attribute has
201            an argument list of not.  Support for extended attributes:
202            Constructor, ExceptionConsts, IndexGetter, IndexSetter,
203            NameGetter, NameSetter, and Null.
204            (has_argument_list): New attribute.
205            (idl_text): Stringifies argument lists, if any, even if it is
206            empty.
207    
208    2008-09-15  Wakaba  <wakaba@suika.fam.cx>
209    
210            * HTML.pm.src: New state |PCDATA_STATE|.  Use an empty string for
211            |{s_kwd}| in DATA_STATE as default.
212    
213    2008-09-15  Wakaba  <wakaba@suika.fam.cx>
214    
215            * HTML.pm.src, mkhtmlparser.pl: Replace |{prev_char}|
216            by |{s_kwd}| in DATA_STATE.
217    
218    2008-09-15  Wakaba  <wakaba@suika.fam.cx>
219    
220            * HTML.pm.src: Shorten keys.
221    
222    2008-09-15  Wakaba  <wakaba@suika.fam.cx>
223    
224            * HTML.pm.src: Remove checking for control character, surrogate
225            pair, or noncharacter code points and non-Unicode code
226            points (they should be handled by Whatpm::Charset::UnicodeChecker).
227            (parse_char_stream): Support for the |$get_wrapper| argument and
228            character stream error handlers.
229    
230    2008-09-15  Wakaba  <wakaba@suika.fam.cx>
231    
232            * ContentChecker.pm: Don't call |loda_ns_module|
233            for null-namespace elements/attributes.
234    
235            * HTML.pm.src: Fact out $disallowed_control_chars
236            as a hash.
237    
238    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
239    
240            * HTML.pm.src: Regexp typo fixed.  |{prev_char}|
241            and |{next_char}| initializations are moved to initialization
242            method.  |{read_until}| now supports buffering.  Sync |set_inner_html|
243            with |parse_char_stream|.
244    
245    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
246    
247            * HTML.pm.src (parse_char_stream): Make |set_next_char|
248            invoke |manakai_read_until|, not only |read|, where
249            possible, to decrease the number of |read| method calls.
250    
251            * mkhtmlparser.pl: Related changes to the aforementioned
252            modification.
253    
254    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
255    
256            * HTML.pm.src: Use |read| instead of |getc|.  |set_inner_html|
257            would report character error from now.
258    
259    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
260    
261            * HTML.pm.src: White-space-leaded non-white-space character
262            tokens in "before head insertion mode" was not
263            correctly handled.
264            (set_inner_html): Reimplemented using CharString decodehandle
265            class.  Support for $get_wrapper argument.  Support
266            for |{read_until}| feature.
267    
268    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
269    
270            * HTML.pm.src: Make a "bare ero" error for unknown
271            entities point the "&" character.
272    
273    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
274    
275            * HTML.pm.src: It turns out that U+FFFD don't have to
276            be added to the list of excluded characters.
277    
278    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
279    
280            * HTML.pm.src ($char_onerror): Have character decoder's |line|
281            and |column| a higher priority than the one set by the
282            tokenizer's input handler.
283            ($self->{read_until}): Exclude U+FFFD (but this might
284            not be necessary, since now we do line/column fixup in
285            the character decode handle).
286    
287    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
288    
289            * HTML.pm.src: Use |{read_until}| where possible.
290    
291    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
292    
293            * HTML.pm.src: Change |{getc_until}| to |{read_until}|
294            and |manakai_getc_until| to |manakai_read_until| to
295            reduce the number of string copies.
296    
297    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
298    
299            * HTML.pm.src (parse_char_string): Use newly created
300            |Whatpm::Charset::DecodeHandle::CharString| instead of Perl's
301            standard feature to |open| a string as a filehandle,
302            since Perl's string filehandle seems not supporting |ungetc|
303            method correctly.
304            (parse_char_stream): Define |{getc_until}| method.
305            (DATA_STATE): Experimental support for |getc_until| feature.
306    
307    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
308    
309            * HTML.pm.src: Check points added to newly added branches.
310    
311    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
312    
313            * HTML.pm.src: Remove |{char}|, which is no longer used.
314            Remove |{entity_in_attr}| and |{last_attribute_value_state}|
315            and replaced by |{prev_state}|.
316    
317            * mkhtmlparser.pl: Remove |{char}| feature.
318            Remove |!!!back-next-input-character;| macro.
319    
320    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
321    
322            * HTML.pm.src: Finally we get rid of all the inner loops.  Remove
323            entity related tokenizer states in favor of new states
324            implementing the consume character reference algorithm.
325    
326    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
327    
328            * HTML.pm.src: "Consume a character reference" algorithm is
329            now implemented as a tokenizer's state, rather than
330            a method, with minimum changes (more changes will
331            be made, in due course).  "Bogus comment state"'s inner
332            loop gets removed.
333    
334    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
335    
336            * HTML.pm.src: Make |PUBLIC| and |SYSTEM| keyword tokenizing
337            into their own tokenizer states.
338    
339    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
340    
341            * HTML.pm.src: |CDATA_SECTION_STATE| (formally |CDATA_BLOCK_STATE|
342            is split into three states.
343    
344    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
345    
346            * HTML.pm.src: |CLOSE_TAG_OPEN_STATE| is broken into
347            itself and new |CDATA_PCDATA_CLOSE_TAG_STATE| so that
348            no longer does the tokenizer have to push back next input
349            characters in those states.
350    
351    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
352    
353            * HTML.pm.src: |MARKUP_DECLARATION_OPEN_STATE| broken
354            into four states so that no longer does the tokenizer have to push
355            back next input characters in that state.
356    
357    2008-09-11  Wakaba  <wakaba@suika.fam.cx>
358    
359            * HTML.pm.src: Methods now accept additional parameter, $get_wrapper,
360            which can be used to insert some wrapper between the character
361            stream handle and the tokenizer.  (It is currently not supported
362            for |set_inner_html| for |Element|s).
363    
364    2008-09-10  Wakaba  <wakaba@suika.fam.cx>
365    
366            * HTML.pm.src: Ignore punctuations in charset names.
367    
368    2008-09-10  Wakaba  <wakaba@suika.fam.cx>
369    
370            * ContentChecker.pm: Support for charset-layer error levels.
371    
372            * HTML.pm.src: Don't specify |text| argument for the
373            |chardecode:fallback| error, since it is not the encoding
374            being used alternatively.
375    
376    2008-09-06  Wakaba  <wakaba@suika.fam.cx>
377    
378            * HTML.pm.src: Support for |XSLT-compat| (HTML5 revision 2141).
379    
380    2008-08-31  Wakaba  <wakaba@suika.fam.cx>
381    
382            * CacheManifest.pm: Support for extensibility (HTML5 revision 2051).
383    
384    2008-08-31  Wakaba  <wakaba@suika.fam.cx>
385    
386            * HTML.pm.src: Bug fix and sync with the spec with regard
387            to after after frameset insertion mode processing (HTML5
388            revision 1909).  Note that the implementation was wrong
389            per the old spec before the r1909 changes.
390    
391    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
392    
393            * HTMLTable.pm: scope=auto algorithm fix synced with the
394            spec (HTML5 revision 2093).
395            ($process_row): Algorithm step numbers synced with the
396            spec (HTML5 revision 2092).
397    
398    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
399    
400            * HTMLTable.pm: Zs is not what we want; we want White_Space! (HTML5
401            revision 2094).
402    
403    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
404    
405            * ContentType.pm: Support for image/svg+xml (HTML5 revision 2096).
406    
407    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
408    
409            * HTML.pm.src: '"' and "'" at the end of attribute
410            name (after another attribute) now raise parse error (HTML5
411            revision 2123).  Empty unquoted attribute values are no
412            longer allowed (HTML5 revision 2122).
413    
414    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
415    
416            * mkhtmlparser.pl: Support for MathML |definitionURL| attribute (HTML5
417            revision 2130).
418    
419    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
420    
421            * ContentChecker.pm: |xml:lang| attribute value must be same
422            as |lang| attribute value for HTML elements (HTML5 revision 2062
423            and so on).
424    
425    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
426    
427            * ContentChecker.pm: Error level definition for |xml_id_error|
428            was missing.
429    
430            * URIChecker.pm: The end of the URL should be marked as the
431            error location for an empty path error.  The position
432            between the userinfo and the port components should be
433            marked as the error location for an empty host error.
434    
435    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
436    
437            * URIChecker.pm: Set parameters representing where in the
438            value the error occurs for errors.  Report unknown
439            address format error in warning level, since address
440            formats are rarely added.  Path segments starting with "/.."
441            were misinterpreted as a dot-segment.
442    
443    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
444    
445            * URIChecker.pm (check_iri_reference): Requires
446            |Message::DOM::DOMImplementation|.
447    
448    2008-08-29  Wakaba  <wakaba@suika.fam.cx>
449    
450            * IMTChecker.pm: Updated for the new error reporting architecture.
451    
452            * ContentChecker.pm: Error levels for IMTs are added.
453    
454    2008-08-17  Wakaba  <wakaba@suika.fam.cx>
455    
456            * H2H.pm (_shift_token): Support for unquoted HTML attribute
457            values.
458    
459    2008-08-16  Wakaba  <wakaba@suika.fam.cx>
460    
461            * CacheManifest.pm: Support for new style of error
462            reports.
463    
464            * HTML.pm.src: Set line=1, column=1 to the document node.
465    
466    2008-08-16  Wakaba  <wakaba@suika.fam.cx>
467    
468            * ContentChecker.pm, RDFXML.pm: Pass {level} object to language tag
469            and URL checkers.  Support for more error levels for bogus
470            langauge tag and URL "standards".
471    
472            * LangTag.pm, URIChecker.pm: Support for new style error
473            level reporting.
474    
475    2008-08-15  Wakaba  <wakaba@suika.fam.cx>
476    
477            * ContentChecker.pm: Support for RDF/XML error levels.
478    
479            * HTMLTable.pm, RDFXML.pm: Support for new style of error level
480            specifying.  Error types are revised.
481    
482    2008-08-15  Wakaba  <wakaba@suika.fam.cx>
483    
484            * ContentChecker.pm: All error reporting method calls are
485            renewed.
486    
487    2008-08-15  Wakaba  <wakaba@suika.fam.cx>
488    
489            * HTML.pm.src: All error type names and "text" parameters
490            are revised.  Use new style for "level" specification.
491    
492            * mkhtmlparser.pl: Use new style for "level" specification.
493    
494    2008-08-03  Wakaba  <wakaba@suika.fam.cx>
495    
496            * WebIDL.pm (parse_char_string): Simplified error
497            reporting process for broken ignored valuetype definition.
498            (Valuetype idl_text): Support for special "DOMString" name.
499    
500    2008-08-03  Wakaba  <wakaba@suika.fam.cx>
501    
502            * WebIDL.pm ($get_scoped_name): Append "::::" if the last
503            terminal of the ScopedName is "DOMString", such that whether
504            the last part of the scoped name is "DOMString" or "_DOMString"
505            later.  It is necessary to determine whether a |typedef|
506            definition should be ignored or not.
507            (parse_char_string): Unescape the identifier of
508            exception members.
509            ($resolve): Return undef for builtin types and sequence<T>
510            types (we might not have to do this, however...).
511            (check): Support checking for Exceptions, Valuetypes,
512            and Typedefs.
513            ($serialize_type): Support for "DOMString::::" syntax.
514            (Typedef idl_text): Output Type as "DOMString" if it
515            is really "DOMString" (i.e. its internal representation
516            is "::DOMString::").
517    
518    2008-08-03  Wakaba  <wakaba@suika.fam.cx>
519    
520            * WebIDL.pm ($resolve): New code, based on resolve code
521            for constant types in the |check| method.
522            (check): Support for checking of attributes, operations, and
523            arguments.
524            (Attribute/Operation idl_text): Exception names in getraises,
525            setraises, and raises clauses is serizlied by |$serialize_type|
526            code.
527    
528  2008-08-02  Wakaba  <wakaba@suika.fam.cx>  2008-08-02  Wakaba  <wakaba@suika.fam.cx>
529    
530          * WebIDL.pm ($integer): Order of selections are changed to match          * WebIDL.pm ($integer): Order of selections are changed to match

Legend:
Removed from v.1.267  
changed lines
  Added in v.1.350

admin@suikawiki.org
ViewVC Help
Powered by ViewVC 1.1.24