/[suikacvs]/markup/html/whatpm/Whatpm/ChangeLog
Suika

Diff of /markup/html/whatpm/Whatpm/ChangeLog

Parent Directory Parent Directory | Revision Log Revision Log | View Patch Patch

revision 1.163 by wakaba, Thu Mar 6 15:56:52 2008 UTC revision 1.333 by wakaba, Sun Sep 21 12:37:09 2008 UTC
# Line 1  Line 1 
1    2008-09-21  Wakaba  <wakaba@suika.fam.cx>
2    
3            * ContentChecker.pm: |form=""| check support added.
4    
5    2008-09-21  Wakaba  <wakaba@suika.fam.cx>
6    
7            * ContentChecker.pm: |contextmenu| validness is now checked using
8            |id| and |id_type| properties, and |menu| property is removed.
9    
10    2008-09-21  Wakaba  <wakaba@suika.fam.cx>
11    
12            * ContentChecker.pm: Prepare for |form| |name| attribute's
13            duplication checking.
14    
15    2008-09-21  Wakaba  <wakaba@suika.fam.cx>
16    
17            * HTML.pm.src (parse_byte_stream): Support (or non-support) for
18            unsupported charset="" parameter value (HTML5 revision 2131).
19    
20    2008-09-20  Wakaba  <wakaba@suika.fam.cx>
21    
22            * HTML.pm.src: Reminding places where U+000B is allowed as a space
23            character is fixed (cf. HTML5 revision 1738).
24    
25            * ContentChecker.pm, HTMLTable.pm: U+000B is no longer part of
26            space characters (HTML5 revision 1738).
27    
28    2008-09-20  Wakaba  <wakaba@suika.fam.cx>
29    
30            * HTML.pm.src: The "anything else" case for the "after after body"
31            insertion mode was not updated to swtich to the "in body"
32            insertion mode.  U+000B is no longer a space character for the
33            purpose of tree construction phase (HTML5 revision 1738).
34    
35    2008-09-20  Wakaba  <wakaba@suika.fam.cx>
36    
37            * HTML.pm.src: U+000B is no longer a space character (HTML5
38            revision 1738).
39    
40    2008-09-20  Wakaba  <wakaba@suika.fam.cx>
41    
42            * ContentType.pm: 0x0B is no longer a space character (HTML5
43            revision 1738).
44    
45            * HTML.pm.src: U+000B is no longer a space character for the
46            algorithm for extracting an encoding from a Content-Type (HTML5
47            revision 1738).
48    
49    2008-09-20  Wakaba  <wakaba@suika.fam.cx>
50    
51            * ContentChecker.pm ($IsInHTMLInteractiveContent): New.
52    
53    2008-09-18  Wakaba  <wakaba@suika.fam.cx>
54    
55            * LangTag.pm: Add checks for remaining requirements from RFC 4646.
56    
57            * mklangreg.pl: Sort 'Prefix' values by their length, to ease
58            matching.
59    
60    2008-09-18  Wakaba  <wakaba@suika.fam.cx>
61    
62            * LangTag.pm: Warn for private use language subtags.  Error level
63            typos fixed.  Support for Suppress-Script field.
64    
65            * mklangreg.pl: Support for dumping of nested structure.
66    
67    2008-09-18  Wakaba  <wakaba@suika.fam.cx>
68    
69            * LangTag.pm (check_rfc4646_langtag): Check if a tag is in the
70            recommended case as per RFC 4646.
71    
72    2008-09-18  Wakaba  <wakaba@suika.fam.cx>
73    
74            * LangTag.pm (check_rfc4646_langtag): New method.
75    
76    2008-09-18  Wakaba  <wakaba@suika.fam.cx>
77    
78            * mklangreg.pl: New script.
79    
80            * Makefile: Updated for creation of the module for language subtag
81            registry.
82            
83    2008-09-16  Wakaba  <wakaba@suika.fam.cx>
84    
85            * Makefile: WebIDL.html added.
86    
87            * WebIDL.pod: New documentation.
88    
89    2008-09-16  Wakaba  <wakaba@suika.fam.cx>
90    
91            * WebIDL.pm: Checker's error types are redefined.
92    
93    2008-09-16  Wakaba  <wakaba@suika.fam.cx>
94    
95            * WebIDL.pm: Parser's error types are redefined.  Some forward
96            compatible parsing bugs are fixed.  Some unreachable codes are
97            commented out.
98    
99    2008-09-16  Wakaba  <wakaba@suika.fam.cx>
100    
101            * WebIDL.pm: Support for the reminding extended attributes are
102            added.  It does not satisfy the definition that a forward
103            interface declaration has an extended attribute.  It seems that
104            unless explicitly allowed multiple extended attributes with the
105            same name is not allowed, though it is not explicitly mentioned in
106            the spec.
107    
108    2008-09-16  Wakaba  <wakaba@suika.fam.cx>
109    
110            * WebIDL.pm: Unescapes extended attribute names and extended
111            attribute identifiers.  Preserve whether an extended attribute has
112            an argument list of not.  Support for extended attributes:
113            Constructor, ExceptionConsts, IndexGetter, IndexSetter,
114            NameGetter, NameSetter, and Null.
115            (has_argument_list): New attribute.
116            (idl_text): Stringifies argument lists, if any, even if it is
117            empty.
118    
119    2008-09-15  Wakaba  <wakaba@suika.fam.cx>
120    
121            * HTML.pm.src: New state |PCDATA_STATE|.  Use an empty string for
122            |{s_kwd}| in DATA_STATE as default.
123    
124    2008-09-15  Wakaba  <wakaba@suika.fam.cx>
125    
126            * HTML.pm.src, mkhtmlparser.pl: Replace |{prev_char}|
127            by |{s_kwd}| in DATA_STATE.
128    
129    2008-09-15  Wakaba  <wakaba@suika.fam.cx>
130    
131            * HTML.pm.src: Shorten keys.
132    
133    2008-09-15  Wakaba  <wakaba@suika.fam.cx>
134    
135            * HTML.pm.src: Remove checking for control character, surrogate
136            pair, or noncharacter code points and non-Unicode code
137            points (they should be handled by Whatpm::Charset::UnicodeChecker).
138            (parse_char_stream): Support for the |$get_wrapper| argument and
139            character stream error handlers.
140    
141    2008-09-15  Wakaba  <wakaba@suika.fam.cx>
142    
143            * ContentChecker.pm: Don't call |loda_ns_module|
144            for null-namespace elements/attributes.
145    
146            * HTML.pm.src: Fact out $disallowed_control_chars
147            as a hash.
148    
149    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
150    
151            * HTML.pm.src: Regexp typo fixed.  |{prev_char}|
152            and |{next_char}| initializations are moved to initialization
153            method.  |{read_until}| now supports buffering.  Sync |set_inner_html|
154            with |parse_char_stream|.
155    
156    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
157    
158            * HTML.pm.src (parse_char_stream): Make |set_next_char|
159            invoke |manakai_read_until|, not only |read|, where
160            possible, to decrease the number of |read| method calls.
161    
162            * mkhtmlparser.pl: Related changes to the aforementioned
163            modification.
164    
165    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
166    
167            * HTML.pm.src: Use |read| instead of |getc|.  |set_inner_html|
168            would report character error from now.
169    
170    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
171    
172            * HTML.pm.src: White-space-leaded non-white-space character
173            tokens in "before head insertion mode" was not
174            correctly handled.
175            (set_inner_html): Reimplemented using CharString decodehandle
176            class.  Support for $get_wrapper argument.  Support
177            for |{read_until}| feature.
178    
179    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
180    
181            * HTML.pm.src: Make a "bare ero" error for unknown
182            entities point the "&" character.
183    
184    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
185    
186            * HTML.pm.src: It turns out that U+FFFD don't have to
187            be added to the list of excluded characters.
188    
189    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
190    
191            * HTML.pm.src ($char_onerror): Have character decoder's |line|
192            and |column| a higher priority than the one set by the
193            tokenizer's input handler.
194            ($self->{read_until}): Exclude U+FFFD (but this might
195            not be necessary, since now we do line/column fixup in
196            the character decode handle).
197    
198    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
199    
200            * HTML.pm.src: Use |{read_until}| where possible.
201    
202    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
203    
204            * HTML.pm.src: Change |{getc_until}| to |{read_until}|
205            and |manakai_getc_until| to |manakai_read_until| to
206            reduce the number of string copies.
207    
208    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
209    
210            * HTML.pm.src (parse_char_string): Use newly created
211            |Whatpm::Charset::DecodeHandle::CharString| instead of Perl's
212            standard feature to |open| a string as a filehandle,
213            since Perl's string filehandle seems not supporting |ungetc|
214            method correctly.
215            (parse_char_stream): Define |{getc_until}| method.
216            (DATA_STATE): Experimental support for |getc_until| feature.
217    
218    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
219    
220            * HTML.pm.src: Check points added to newly added branches.
221    
222    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
223    
224            * HTML.pm.src: Remove |{char}|, which is no longer used.
225            Remove |{entity_in_attr}| and |{last_attribute_value_state}|
226            and replaced by |{prev_state}|.
227    
228            * mkhtmlparser.pl: Remove |{char}| feature.
229            Remove |!!!back-next-input-character;| macro.
230    
231    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
232    
233            * HTML.pm.src: Finally we get rid of all the inner loops.  Remove
234            entity related tokenizer states in favor of new states
235            implementing the consume character reference algorithm.
236    
237    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
238    
239            * HTML.pm.src: "Consume a character reference" algorithm is
240            now implemented as a tokenizer's state, rather than
241            a method, with minimum changes (more changes will
242            be made, in due course).  "Bogus comment state"'s inner
243            loop gets removed.
244    
245    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
246    
247            * HTML.pm.src: Make |PUBLIC| and |SYSTEM| keyword tokenizing
248            into their own tokenizer states.
249    
250    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
251    
252            * HTML.pm.src: |CDATA_SECTION_STATE| (formally |CDATA_BLOCK_STATE|
253            is split into three states.
254    
255    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
256    
257            * HTML.pm.src: |CLOSE_TAG_OPEN_STATE| is broken into
258            itself and new |CDATA_PCDATA_CLOSE_TAG_STATE| so that
259            no longer does the tokenizer have to push back next input
260            characters in those states.
261    
262    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
263    
264            * HTML.pm.src: |MARKUP_DECLARATION_OPEN_STATE| broken
265            into four states so that no longer does the tokenizer have to push
266            back next input characters in that state.
267    
268    2008-09-11  Wakaba  <wakaba@suika.fam.cx>
269    
270            * HTML.pm.src: Methods now accept additional parameter, $get_wrapper,
271            which can be used to insert some wrapper between the character
272            stream handle and the tokenizer.  (It is currently not supported
273            for |set_inner_html| for |Element|s).
274    
275    2008-09-10  Wakaba  <wakaba@suika.fam.cx>
276    
277            * HTML.pm.src: Ignore punctuations in charset names.
278    
279    2008-09-10  Wakaba  <wakaba@suika.fam.cx>
280    
281            * ContentChecker.pm: Support for charset-layer error levels.
282    
283            * HTML.pm.src: Don't specify |text| argument for the
284            |chardecode:fallback| error, since it is not the encoding
285            being used alternatively.
286    
287    2008-09-06  Wakaba  <wakaba@suika.fam.cx>
288    
289            * HTML.pm.src: Support for |XSLT-compat| (HTML5 revision 2141).
290    
291    2008-08-31  Wakaba  <wakaba@suika.fam.cx>
292    
293            * CacheManifest.pm: Support for extensibility (HTML5 revision 2051).
294    
295    2008-08-31  Wakaba  <wakaba@suika.fam.cx>
296    
297            * HTML.pm.src: Bug fix and sync with the spec with regard
298            to after after frameset insertion mode processing (HTML5
299            revision 1909).  Note that the implementation was wrong
300            per the old spec before the r1909 changes.
301    
302    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
303    
304            * HTMLTable.pm: scope=auto algorithm fix synced with the
305            spec (HTML5 revision 2093).
306            ($process_row): Algorithm step numbers synced with the
307            spec (HTML5 revision 2092).
308    
309    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
310    
311            * HTMLTable.pm: Zs is not what we want; we want White_Space! (HTML5
312            revision 2094).
313    
314    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
315    
316            * ContentType.pm: Support for image/svg+xml (HTML5 revision 2096).
317    
318    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
319    
320            * HTML.pm.src: '"' and "'" at the end of attribute
321            name (after another attribute) now raise parse error (HTML5
322            revision 2123).  Empty unquoted attribute values are no
323            longer allowed (HTML5 revision 2122).
324    
325    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
326    
327            * mkhtmlparser.pl: Support for MathML |definitionURL| attribute (HTML5
328            revision 2130).
329    
330    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
331    
332            * ContentChecker.pm: |xml:lang| attribute value must be same
333            as |lang| attribute value for HTML elements (HTML5 revision 2062
334            and so on).
335    
336    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
337    
338            * ContentChecker.pm: Error level definition for |xml_id_error|
339            was missing.
340    
341            * URIChecker.pm: The end of the URL should be marked as the
342            error location for an empty path error.  The position
343            between the userinfo and the port components should be
344            marked as the error location for an empty host error.
345    
346    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
347    
348            * URIChecker.pm: Set parameters representing where in the
349            value the error occurs for errors.  Report unknown
350            address format error in warning level, since address
351            formats are rarely added.  Path segments starting with "/.."
352            were misinterpreted as a dot-segment.
353    
354    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
355    
356            * URIChecker.pm (check_iri_reference): Requires
357            |Message::DOM::DOMImplementation|.
358    
359    2008-08-29  Wakaba  <wakaba@suika.fam.cx>
360    
361            * IMTChecker.pm: Updated for the new error reporting architecture.
362    
363            * ContentChecker.pm: Error levels for IMTs are added.
364    
365    2008-08-17  Wakaba  <wakaba@suika.fam.cx>
366    
367            * H2H.pm (_shift_token): Support for unquoted HTML attribute
368            values.
369    
370    2008-08-16  Wakaba  <wakaba@suika.fam.cx>
371    
372            * CacheManifest.pm: Support for new style of error
373            reports.
374    
375            * HTML.pm.src: Set line=1, column=1 to the document node.
376    
377    2008-08-16  Wakaba  <wakaba@suika.fam.cx>
378    
379            * ContentChecker.pm, RDFXML.pm: Pass {level} object to language tag
380            and URL checkers.  Support for more error levels for bogus
381            langauge tag and URL "standards".
382    
383            * LangTag.pm, URIChecker.pm: Support for new style error
384            level reporting.
385    
386    2008-08-15  Wakaba  <wakaba@suika.fam.cx>
387    
388            * ContentChecker.pm: Support for RDF/XML error levels.
389    
390            * HTMLTable.pm, RDFXML.pm: Support for new style of error level
391            specifying.  Error types are revised.
392    
393    2008-08-15  Wakaba  <wakaba@suika.fam.cx>
394    
395            * ContentChecker.pm: All error reporting method calls are
396            renewed.
397    
398    2008-08-15  Wakaba  <wakaba@suika.fam.cx>
399    
400            * HTML.pm.src: All error type names and "text" parameters
401            are revised.  Use new style for "level" specification.
402    
403            * mkhtmlparser.pl: Use new style for "level" specification.
404    
405    2008-08-03  Wakaba  <wakaba@suika.fam.cx>
406    
407            * WebIDL.pm (parse_char_string): Simplified error
408            reporting process for broken ignored valuetype definition.
409            (Valuetype idl_text): Support for special "DOMString" name.
410    
411    2008-08-03  Wakaba  <wakaba@suika.fam.cx>
412    
413            * WebIDL.pm ($get_scoped_name): Append "::::" if the last
414            terminal of the ScopedName is "DOMString", such that whether
415            the last part of the scoped name is "DOMString" or "_DOMString"
416            later.  It is necessary to determine whether a |typedef|
417            definition should be ignored or not.
418            (parse_char_string): Unescape the identifier of
419            exception members.
420            ($resolve): Return undef for builtin types and sequence<T>
421            types (we might not have to do this, however...).
422            (check): Support checking for Exceptions, Valuetypes,
423            and Typedefs.
424            ($serialize_type): Support for "DOMString::::" syntax.
425            (Typedef idl_text): Output Type as "DOMString" if it
426            is really "DOMString" (i.e. its internal representation
427            is "::DOMString::").
428    
429    2008-08-03  Wakaba  <wakaba@suika.fam.cx>
430    
431            * WebIDL.pm ($resolve): New code, based on resolve code
432            for constant types in the |check| method.
433            (check): Support for checking of attributes, operations, and
434            arguments.
435            (Attribute/Operation idl_text): Exception names in getraises,
436            setraises, and raises clauses is serizlied by |$serialize_type|
437            code.
438    
439    2008-08-02  Wakaba  <wakaba@suika.fam.cx>
440    
441            * WebIDL.pm ($integer): Order of selections are changed to match
442            hexadecimal numbers (the original pattern, taken from the spec,
443            was not work for hexadecimal numbers, because the "0" prefix
444            matches to the [0-7]* part (as an empty string) and therefore
445            it does not match with remaining "x..." part of a "0x..." integer
446            literal.
447            ($get_type): It now returns a string, not an array reference,
448            for regular types and |sequence| types (i.e. it in any case
449            returns a string).
450            ($get_next_token): The second item in the array that represents
451            a integer or float token is now a Perl number value, not the
452            original string representation of the number.
453            (check): Support for const value consistency checking.
454            No extended attribute is defined for constants.
455            (Node subclasses): Use simple strings rather than array references
456            for default data type values.
457            ($serialize_type): Type values are now simple strings.
458            (value): If the new attribute value is a false value, then
459            a FALSE value is set to the attribute.
460    
461    2008-08-02  Wakaba  <wakaba@suika.fam.cx>
462    
463            * WebIDL.pm ($get_scoped_name): Now scoped names are stored
464            in its stringified format ("scoped name" as defined in the
465            spec).  Note that future version of this module should not use
466            array references for type values and the |type_text| attribute
467            should be made obsolete.
468            (parse_char_string): Unescape attribute names.
469            (check): Support for checking of whether inherited interfaces
470            are actually defined or not.  Support for checking of whether
471            interface member identifiers are duplicated or not.
472            ($serialize_type): Scoped names are returned as is.  A future
473            version of this code should escape identifiers other than "DOMString",
474            otherwise the idl_text would be non-conforming.
475    
476    2008-08-02  Wakaba  <wakaba@suika.fam.cx>
477    
478            * WebIDL.pm (parse_char_string): Set line/column numbers
479            to generated nodes.  Unescape identifiers.  Extended attributes
480            for Definition's were ignored.
481            (append_child): Set |parent_node| attribute.
482            (parent_node): New attribute.
483            (check): Support interface/exception members.  Support
484            extended attributes.  Support definition identifier uniqueness
485            constraint.
486            (qualified_name): New attribute.
487            (Interface/Exception idl_text): Extended attributes were
488            not prepended to the returned text.
489    
490    2008-08-02  Wakaba  <wakaba@suika.fam.cx>
491    
492            * WebIDL.pm (parse_char_string): Set line/column numbers
493            to interface object experimentally.  s/shift/pop/g, shift
494            would make things wrong.  Support for interface forward
495            declarations was missing.  Broken interface declarations
496            with no block were not ignored entirely.
497            (Whatpm::WebIDL::Node): New abstract class.  This class
498            makes things easier.
499            (child_nodes): New attribute.  Unlike DOM's attribute with
500            same name, this attribute returns a dead list of nodes for
501            simplicity.
502            (get_user_data, set_user_data): New methods.
503            (Module idl_text): A SPACE character should be inserted
504            before the |{| character.
505            (Interface idl_text): Support for interface forward declarations.
506            (is_forward_declaration): New attribute.
507    
508    2008-07-19  Wakaba  <wakaba@suika.fam.cx>
509    
510            * WebIDL.pm (type_text): Better serializer.
511    
512    2008-07-19  Wakaba  <wakaba@suika.fam.cx>
513    
514            * WebIDL.pm: Revise forward-compatible parsing so that
515            it now can handle broken extended attributes and as such.
516    
517    2008-07-19  Wakaba  <wakaba@suika.fam.cx>
518    
519            * WebIDL.pm: Real support for extended attributes.
520            Support for extended attributes with arguments.
521    
522    2008-07-19  Wakaba  <wakaba@suika.fam.cx>
523    
524            * WebIDL.pm: Support for |exception| syntax.
525            (Interface->idl_text): Tentative support for inheritances.
526    
527    2008-07-19  Wakaba  <wakaba@suika.fam.cx>
528    
529            * WebIDL.pm: Hierarchical scoped name support was broken.
530            Support for raises, setraises, and getraises syntaxes.
531    
532    2008-07-18  Wakaba  <wakaba@suika.fam.cx>
533    
534            * WebIDL.pm: Support for |idl_text| attribute, version 1 (no
535            proper support for types, extended attributes, and exceptions yet).
536            WebIDL parser, version 1 (no support for exceptions yet,
537            no proper support for extended attributes yet).
538    
539    2008-07-09  Wakaba  <wakaba@suika.fam.cx>
540    
541            * WebIDL.pm (parse_char_string): Support for basic attribute syntax.
542    
543    2008-06-29  Wakaba  <wakaba@suika.fam.cx>
544    
545            * WebIDL.pm: Support for valuetype and const.
546    
547    2008-06-29  Wakaba  <wakaba@suika.fam.cx>
548            
549            * WebIDL.pm: New module.
550    
551    2008-06-15  Wakaba  <wakaba@suika.fam.cx>
552    
553            * Makefile (Entities.html): URI changed.
554    
555    2008-06-08  Wakaba  <wakaba@suika.fam.cx>
556    
557            * HTML.pm.src: Support for ruby parsing (HTML5 revision 1704).
558    
559    2008-06-01  Wakaba  <wakaba@suika.fam.cx>
560    
561            * HTML.pm.src (_get_next_token): A parse error was missing.
562    
563    2008-06-01  Wakaba  <wakaba@suika.fam.cx>
564    
565            * mklinktypelist.pl: rel=contact is no longer part of the HTML5
566            spec (commented out). (HTML5 revision 1711).
567    
568    2008-05-25  Wakaba  <wakaba@suika.fam.cx>
569    
570            * ContentType.pm: Drop support for UTF-32 (HTML5 revision 1701).
571    
572            * HTML.pm.src: UTF-16BE and UTF-16LE should be considered
573            as UTF-16 (HTML5 revision 1701).
574    
575    2008-05-25  Wakaba  <wakaba@suika.fam.cx>
576    
577            * HTML.pm.src: Support for <noframes> in <head> (HTML5 revision
578            1692).
579    
580    2008-05-25  Wakaba  <wakaba@suika.fam.cx>
581    
582            * HTML.pm.src: The secondary insertion mode used when switching
583            to foreign content is the "in body" insertion mode (HTML5 revision
584            1696).
585    
586    2008-05-25  Wakaba  <wakaba@suika.fam.cx>
587    
588            * HTML.pm.src: Don't raise parse error for <isindex/> (HTML5
589            revision 1697).
590    
591    2008-05-24  Wakaba  <wakaba@suika.fam.cx>
592    
593            * HTML.pm.src: Support for end-of-file token in foreign content
594            insertion mode (HTML5 revision 1693).  Update SVG camelCase
595            attribute list (HTML5 revision 1700).  <textarea> closes
596            </select> (HTML5 revision 1699).  More start tags close in
597            foreign content insertion mode (HTML5 revision 1698).
598    
599    2008-05-24  Wakaba  <wakaba@suika.fam.cx>
600    
601            * HTML.pm.src: ";" is not part of charset name (HTML5 revision 1665).
602    
603    2008-05-24  Wakaba  <wakaba@suika.fam.cx>
604    
605            * HTML.pm.src: More robust charset parameter detection (HTML5
606            revision 1674).
607    
608    2008-05-24  Wakaba  <wakaba@suika.fam.cx>
609    
610            * ContentType.pm: Support for image/vnd.microsoft.icon (HTML5
611            revision 1676).
612    
613    2008-05-24  Wakaba  <wakaba@suika.fam.cx>
614    
615            * HTML.pm.src: Ignore language part of public identifiers for
616            quriks mode detection (HTML5 revision 1679).
617    
618    2008-05-24  Wakaba  <wakaba@suika.fam.cx>
619    
620            * HTML.pm.src: Reduce the number of errors in truncated doctypes (HTML5
621            revision 1685).
622    
623    2008-05-24  Wakaba  <wakaba@suika.fam.cx>
624    
625            * HTML.pm.src: Support for EOF in new states for tags (HTML5
626            revision 1684).
627    
628    2008-05-24  Wakaba  <wakaba@suika.fam.cx>
629    
630            * HTML.pm.src (_reset_insertion_mode): Make <td>.innerHTML
631            work (HTML5 revision 1690).
632    
633    2008-05-24  Wakaba  <wakaba@suika.fam.cx>
634    
635            * HTML.pm.src (_tree_construction_main): Change handling of
636            end tags in head insertion modes (HTML5 revision 1686).
637            (parse_char_string): Bug fix for non-utf8 character string handlings.
638            (parse_char_stream): |ungetc| does not work well for this context.
639    
640    2008-05-18  Wakaba  <wakaba@suika.fam.cx>
641    
642            * HTML.pm.src (parse_byte_string): Redefined to invoke
643            |parse_byte_stream|.
644            (parse_byte_stream): New method.
645    
646    2008-05-18  Wakaba  <wakaba@suika.fam.cx>
647    
648            * HTML.pm.src (parse_byte_string): Fix the column number reported
649            by encoding layer error reporter.
650    
651    2008-05-17  Wakaba  <wakaba@suika.fam.cx>
652    
653            * HTML.pm.src (parse_byte_string): Use streaming decoder
654            rather than converting the whole byte string and then parsing.
655            Propagate errors in character encoding layer.
656            (get_next_token): Precise error reporting for |bare stago| error.
657    
658    2008-05-17  Wakaba  <wakaba@suika.fam.cx>
659    
660            * HTML.pm.src (parse_char_stream): New method.
661            (parse_char_string): This method is now defined as an invocation
662            of the |parse_char_stream| method.
663    
664    2008-05-17  Wakaba  <wakaba@suika.fam.cx>
665    
666            * HTML.pm.src (parse_byte_string): Report various status
667            of the sniffing as info-level errors.  Support for new
668            decoding framework in parser resestting.
669            (new): Various default error levels were not set.
670    
671    2008-05-17  Wakaba  <wakaba@suika.fam.cx>
672    
673            * HTML.pm.src (parse_byte_string): HTML5 encoding siniffing
674            algorithm, except for the actual sniffing, is implemented
675            with new framework with Message::Charset::Info.
676    
677    2008-05-16  Wakaba  <wakaba@suika.fam.cx>
678    
679            * CacheManifest.pm (_parse): Drop fragment identifiers from
680            URIs in fallback section (HTML5 revision 1596).
681    
682    2008-05-10  Wakaba  <wakaba@suika.fam.cx>
683    
684            * Makefile (Entities.html): URI has changed.
685    
686    2008-05-10  Wakaba  <wakaba@suika.fam.cx>
687    
688            * CacheManifest.pm: Don't replace U+0000 NULL (HTML5 revision
689            1553).
690    
691    2008-05-06  Wakaba  <wakaba@suika.fam.cx>
692    
693            * ContentChecker.pm: Noted that those returned in |table| are
694            no longer table elements, but table objects returned
695            by Whatpm::HTMLTable.
696    
697            * HTMLTable.pm (form_table): Return table element node
698            as |$table->{element}|.
699            (assign_header): Support for the |headers=""| attribute.
700    
701    2008-05-06  Wakaba  <wakaba@suika.fam.cx>
702    
703            * HTMLTable.pm (assign_header): New function; first version
704            with no support for headers="".
705            (form_table): Include table width and height to the returned
706            table object for covenience.  Indexing in column assignement was wrong.
707            Set whether a data cell is empty or not for convenience.
708    
709    2008-05-05  Wakaba  <wakaba@suika.fam.cx>
710    
711            * HTMLTable.pm: Robuster caption support (HTML5 revision 1393).
712    
713    2008-05-05  Wakaba  <wakaba@suika.fam.cx>
714    
715            * HTMLTable.pm: How table model errors are detected is
716            changed (HTML5 revision 1387).
717    
718    2008-05-05  Wakaba  <wakaba@suika.fam.cx>
719    
720            * HTMLTable.pm: The algorithm now moves |tfoot| elements
721            to the end of the table (HTML5 revision 1380).
722    
723    2008-05-05  Wakaba  <wakaba@suika.fam.cx>
724    
725            * HTMLTable.pm: The algorithm is now 0-based indexing, instead
726            of 1-based (HTML5 revision 1376).
727    
728    2008-05-05  Wakaba  <wakaba@suika.fam.cx>
729    
730            * ContentType.pm: "Content-Type: text/plain; charset=UTF-8"
731            and "Content-Encoding" no longer prevent sniffing (HTML5
732            revision 1288).
733    
734    2008-05-05  Wakaba  <wakaba@suika.fam.cx>
735    
736            * ContentType.pm: Skip BOMs n feed or HTML algorithm (HTML5 revision
737            1282).
738    
739    2008-05-03  Wakaba  <wakaba@suika.fam.cx>
740    
741            * ContentChecker.pm: Support for global attributes.
742            Status of XML specs are added.
743    
744    2008-05-03  Wakaba  <wakaba@suika.fam.cx>
745    
746            * ContentChecker.pm (check_element): Support for |template=""|
747            and |ref=""| attribute (referent element type checking).
748    
749    2008-04-29  Wakaba  <wakaba@suika.fam.cx>
750    
751            * CacheManifest.pm (_parse): New same origin definition (HTML5 revision
752            1500) is implemented (except for IDNA part and URI-scheme-specific
753            knowledge).  Line number counting was wrong for LF-only
754            documents.
755    
756    2008-04-13  Wakaba  <wakaba@suika.fam.cx>
757    
758            * HTML.pm.src: Raise an parse error for any disallowed
759            character (HTML5 revision 1263).
760    
761    2008-04-13  Wakaba  <wakaba@suika.fam.cx>
762    
763            * mkentitylist.pl: Support for new HTML5 entity table format (the
764            definition for |AElig;| was missing).
765    
766    2008-04-13  Wakaba  <wakaba@suika.fam.cx>
767    
768            * HTML.pm.src, mkhtmlparser.pl: Support for element/attribute
769            name/namespace fixup (HTML5 revisions 1413, 1415, 1416, and 1417).
770    
771    2008-04-13  Wakaba  <wakaba@suika.fam.cx>
772    
773            * HTML.pm.src: List of element names that close foreign content
774            insertion mode is added (HTML5 revisions 1412 and 1418).
775    
776    2008-04-13  Wakaba  <wakaba@suika.fam.cx>
777    
778            * HTML.pm.src: Support for |mglyph| and |malignmark|
779            elements (HTML5 revision 1410).
780    
781    2008-04-13  Wakaba  <wakaba@suika.fam.cx>
782    
783            * HTML.pm.src: Support for new long MathML entities (HTML5
784            revision 1406).
785    
786    2008-04-13  Wakaba  <wakaba@suika.fam.cx>
787    
788            * HTML.pm.src: CDATA section support for MathML and SVG
789            elements (HTML5 revision 1404 and 1420).
790    
791    2008-04-12  Wakaba  <wakaba@suika.fam.cx>
792    
793            * HTML.pm.src, mkhtmlparser.pl: Support for MathML and SVG
794            elements (HTML5 revision 1404).  Unused !!!macro definitions
795            are removed.
796    
797    2008-04-12  Wakaba  <wakaba@suika.fam.cx>
798    
799            * HTML.pm.src, mkhtmlparser.pl: The way permitted slash errors
800            are raised is changed (HTML5 revision 1404).
801    
802    2008-04-06  Wakaba  <wakaba@suika.fam.cx>
803    
804            * HTML.pm.src: Category sets that are no longer used are removed.
805    
806    2008-04-06  Wakaba  <wakaba@suika.fam.cx>
807    
808            * HTML.pm.src: The ->[1] property of stack entries are now
809            replaced by constants representing element category.
810    
811    2008-04-06  Wakaba  <wakaba@suika.fam.cx>
812    
813            * HTML.pm.src: Don't use local name stored in stack (i.e. ->[1])
814            for error reporting.  (This is a preparation for using constant
815            value for ->[1].)
816    
817    2008-03-22  Wakaba  <wakaba@suika.fam.cx>
818    
819            * RDFXML.pm: Typo fixed.
820    
821    2008-03-22  Wakaba  <wakaba@suika.fam.cx>
822    
823            * ContentChecker.pm: |fact_level| is now treated
824            as same as |must_level|, i.e. level = |m|.
825            (check_element): Make list of URIs in the DOM.
826    
827    2008-03-21  Wakaba  <wakaba@suika.fam.cx>
828    
829            * RDFXML.pm: Language accessor implemented.
830            Local (null-namespace) attribute support.
831    
832    2008-03-21  Wakaba  <wakaba@suika.fam.cx>
833    
834            * RDFXML.pm: Factored out ID checking code.
835    
836    2008-03-21  Wakaba  <wakaba@suika.fam.cx>
837    
838            * RDFXML.pm: TODO items noted.  Validation of ID
839            and URI attributes is implemented.  Warn if unknown
840            value is used in rdf:parseType="" attribute.
841    
842            * URIChecker.pm (check_rdf_uri_reference): New function.
843    
844    2008-03-21  Wakaba  <wakaba@suika.fam.cx>
845    
846            * RDFXML.pm: bnodeid implemented.  Relative references
847            are now resolved.
848    
849    2008-03-21  Wakaba  <wakaba@suika.fam.cx>
850    
851            * ContentChecker.pm: RDF reification implemented.
852    
853            * RDFXML.pm: undef vs false bug fixed.
854            Reification implemented.
855    
856    2008-03-21  Wakaba  <wakaba@suika.fam.cx>
857    
858            * RDFXML.pm: s/id/ID/ for attribute name.
859            The |node| arguments are added for |ontriple| calls.
860            Too many "attribute not allowed" errors were raised.
861    
862            * ContentChecker.pm: Initial experimental support for rdf:RDF
863            element.
864    
865    2008-03-21  Wakaba  <wakaba@suika.fam.cx>
866    
867            * RDFXML.pm: New module.
868    
869    2008-03-20  Wakaba  <wakaba@suika.fam.cx>
870    
871            * HTML.pm.src (set_inner_html): Line/column number
872            code was old one yet.
873    
874    2008-03-20  Wakaba  <wakaba@suika.fam.cx>
875    
876            * HTML.pm.src: Better line/column reporting for "duplicate attribute"
877            errors.  Line/column markings for DOCTYPE, comment, and
878            character tokens are reintroduced; otherwise,
879            error location for "not HTML5" error and errors
880            for implied elements are not attached.
881    
882    2008-03-20  Wakaba  <wakaba@suika.fam.cx>
883    
884            * HTML.pm.src: Set line/column numbers to attributes.
885    
886            * NanoDOM.pm (create_attribute_ns, set_attribute_node_ns): Added.
887            (value): Setter implemented.
888    
889            * mkhtmlparser.pl: Set line/column numbers to Attr nodes.
890    
891    2008-03-20  Wakaba  <wakaba@suika.fam.cx>
892    
893            * HTML.pm.src: Unused line/column markings are removed.
894    
895    2008-03-20  Wakaba  <wakaba@suika.fam.cx>
896    
897            * HTML.pm.src (_get_next_token): Remove |first_start_tag|
898            flag, which is no longer used.
899    
900    2008-03-17  Wakaba  <wakaba@suika.fam.cx>
901    
902            * HTML.pm.src: Set line/column information to element nodes.
903    
904            * mkhtmlparser.pl (!!!create-element, !!!insert-element,
905            and !!!insert-element-t): Set line/column information to
906            element nodes.
907    
908    2008-03-17  Wakaba  <wakaba@suika.fam.cx>
909    
910            * HTML.pm.src (_get_next_token): The first "<" character
911            in "<?", "<>", or "</>" should be the error point.
912    
913    2008-03-16  Wakaba  <wakaba@suika.fam.cx>
914    
915            * HTML.pm.src: Some more fixes on error position reporting.
916    
917    2008-03-16  Wakaba  <wakaba@suika.fam.cx>
918    
919            * HTML.pm.src: Token-level precious error reporting.
920    
921    2008-03-16  Wakaba  <wakaba@suika.fam.cx>
922    
923            * HTML.pm.src: Preparation for more precious error point
924            reporting.
925    
926    2008-03-11  Wakaba  <wakaba@suika.fam.cx>
927    
928            * HTML.pm.src: Error type revised.
929    
930    2008-03-11  Wakaba  <wakaba@suika.fam.cx>
931    
932            * HTML.pm.src: Similar codes are merged together, again.
933    
934    2008-03-11  Wakaba  <wakaba@suika.fam.cx>
935    
936            * HTML.pm.src: Similar codes are merged together.
937    
938    2008-03-10  Wakaba  <wakaba@suika.fam.cx>
939    
940            * mkhtmlparser.pl: Set "level" parameter to parse errors.
941    
942            * HTML.pm.src: Code refined.
943    
944    2008-03-09  Wakaba  <wakaba@suika.fam.cx>
945    
946            * HTML.pm.src: |</body>| treatement has been changed (HTML5
947            revision 1348).  Note that I really don't know this makes
948            any difference in the black-box behavior of the parser.
949    
950    2008-03-09  Wakaba  <wakaba@suika.fam.cx>
951    
952            * HTML.pm.src: New end-of-file token implementation (HTML5
953            revision 1348).
954    
955    2008-03-09  Wakaba  <wakaba@suika.fam.cx>
956    
957            * HTML.pm.src: |applet| support (HTML5 revision 1347).
958    
959    2008-03-09  Wakaba  <wakaba@suika.fam.cx>
960    
961            * HTML.pm.src: Foster parenting in AAA (HTML5 revision 1343).
962    
963    2008-03-09  Wakaba  <wakaba@suika.fam.cx>
964    
965            * HTML.pm.src: Support for |<input>| in the "in select" insertion
966            mode and support for the "in select in table" insertion mode (HTML5
967            revision 1342).
968    
969    2008-03-09  Wakaba  <wakaba@suika.fam.cx>
970    
971            * HTML.pm.src: No foster parenting for <script> and <script>
972            in non-tainted <table>s (HTML5 revision 1336).
973    
974    2008-03-09  Wakaba  <wakaba@suika.fam.cx>
975    
976            * HTML.pm.src: Ignore white space characters between <html>
977            and <head> (HTML5 revision 1332).
978    
979    2008-03-09  Wakaba  <wakaba@suika.fam.cx>
980    
981            * HTML.pm.src: Treat <input type=hidden> as if it were a
982            white space (HTML5 revision 1331).
983    
984    2008-03-08  Wakaba  <wakaba@suika.fam.cx>
985    
986            * HTML.pm.src: Ignore U+000A at the beginning of a |listing|
987            element (HTML5 revision 1330).
988    
989    2008-03-08  Wakaba  <wakaba@suika.fam.cx>
990    
991            * HTML.pm.src: <title> is always appended to the current
992            element (HTML5 revision 1328).
993    
994    2008-03-08  Wakaba  <wakaba@suika.fam.cx>
995    
996            * HTML.pm.src: White space in tainted tables are moved
997            into foster parents (HTML5 revision 1326).
998    
999    2008-03-08  Wakaba  <wakaba@suika.fam.cx>
1000    
1001            * HTML.pm.src: Reduce errors from foster parenting cases (HTML5
1002            revision 1321).
1003    
1004    2008-03-08  Wakaba  <wakaba@suika.fam.cx>
1005    
1006            * HTML.pm.src: |</h/n/>| case code rearranged to align with
1007            the spec (HTML5 revision 1320).  Note that we finally complete
1008            all of HTML5 revision 1320 changes.
1009    
1010    2008-03-08  Wakaba  <wakaba@suika.fam.cx>
1011    
1012            * HTML.pm.src: |</form>| now works similar to |</div>| for unclosed
1013            tags (HTML5 revision 1320).
1014    
1015    2008-03-08  Wakaba  <wakaba@suika.fam.cx>
1016    
1017            * HTML.pm.src: |</p>| case rearranged with no actual change in fact.
1018    
1019    2008-03-08  Wakaba  <wakaba@suika.fam.cx>
1020    
1021            * HTML.pm.src: A "generate implied end tags" code (t409.1)
1022            could not be reached so that it is now removed (HTML5 revision
1023            1320).
1024    
1025    2008-03-08  Wakaba  <wakaba@suika.fam.cx>
1026    
1027            * HTML.pm.src: Code for the case of |</div>| and so on are revised
1028            to align with new spec text (HTML5 revision 1320).
1029    
1030    2008-03-08  Wakaba  <wakaba@suika.fam.cx>
1031    
1032            * HTML.pm.src: Remove strange |if| condition; however, it should
1033            have had no harm in theory.
1034    
1035    2008-03-08  Wakaba  <wakaba@suika.fam.cx>
1036    
1037            * HTML.pm.src (_tree_construction_main): '</p>' in body
1038            case is split from other end tags for the preparation
1039            of implementing HTML revision 1320.
1040    
1041  2008-03-07  Wakaba  <wakaba@suika.fam.cx>  2008-03-07  Wakaba  <wakaba@suika.fam.cx>
1042    
1043          * HTML.pm.src: Simplified "generate implied end tag" (HTML5          * HTML.pm.src: Simplified "generate implied end tag" (HTML5

Legend:
Removed from v.1.163  
changed lines
  Added in v.1.333

admin@suikawiki.org
ViewVC Help
Powered by ViewVC 1.1.24