/[suikacvs]/markup/html/whatpm/Whatpm/ChangeLog
Suika

Diff of /markup/html/whatpm/Whatpm/ChangeLog

Parent Directory Parent Directory | Revision Log Revision Log | View Patch Patch

revision 1.183 by wakaba, Mon Mar 10 10:55:56 2008 UTC revision 1.337 by wakaba, Sat Oct 4 05:53:45 2008 UTC
# Line 1  Line 1 
1    2008-10-04  Wakaba  <wakaba@suika.fam.cx>
2    
3            * HTML.pm.src: Support for |command| and |eventsource| elements (HTML5
4            revision 1731).  End tags of |option| and |optgroup| elements are
5            now optional (HTML5 revision 1731).
6    
7    2008-10-04  Wakaba  <wakaba@suika.fam.cx>
8    
9            * HTML.pm.src: New "special" elements added to the list (HTML5
10            revision 1778).  "strile" -> "strike".
11    
12    2008-10-02  Wakaba  <wakaba@suika.fam.cx>
13    
14            * ContentType.pm (get_sniffed_type): Support for the "better"
15            content sniffing (HTML5 revision 1927).  In a case the official
16            type was not returned when the method is invoked in the list
17            context.
18    
19    2008-09-22  Wakaba  <wakaba@suika.fam.cx>
20    
21            * HTML.pm.src: Character references for non-space C0 characters,
22            including U+000B VT, DEL character, noncharacter code points, are
23            now converted to the U+FFFD character (cf. HTML5 revision 2138).
24    
25    2008-09-21  Wakaba  <wakaba@suika.fam.cx>
26    
27            * ContentChecker.pm: |form=""| check support added.
28    
29    2008-09-21  Wakaba  <wakaba@suika.fam.cx>
30    
31            * ContentChecker.pm: |contextmenu| validness is now checked using
32            |id| and |id_type| properties, and |menu| property is removed.
33    
34    2008-09-21  Wakaba  <wakaba@suika.fam.cx>
35    
36            * ContentChecker.pm: Prepare for |form| |name| attribute's
37            duplication checking.
38    
39    2008-09-21  Wakaba  <wakaba@suika.fam.cx>
40    
41            * HTML.pm.src (parse_byte_stream): Support (or non-support) for
42            unsupported charset="" parameter value (HTML5 revision 2131).
43    
44    2008-09-20  Wakaba  <wakaba@suika.fam.cx>
45    
46            * HTML.pm.src: Reminding places where U+000B is allowed as a space
47            character is fixed (cf. HTML5 revision 1738).
48    
49            * ContentChecker.pm, HTMLTable.pm: U+000B is no longer part of
50            space characters (HTML5 revision 1738).
51    
52    2008-09-20  Wakaba  <wakaba@suika.fam.cx>
53    
54            * HTML.pm.src: The "anything else" case for the "after after body"
55            insertion mode was not updated to swtich to the "in body"
56            insertion mode.  U+000B is no longer a space character for the
57            purpose of tree construction phase (HTML5 revision 1738).
58    
59    2008-09-20  Wakaba  <wakaba@suika.fam.cx>
60    
61            * HTML.pm.src: U+000B is no longer a space character (HTML5
62            revision 1738).
63    
64    2008-09-20  Wakaba  <wakaba@suika.fam.cx>
65    
66            * ContentType.pm: 0x0B is no longer a space character (HTML5
67            revision 1738).
68    
69            * HTML.pm.src: U+000B is no longer a space character for the
70            algorithm for extracting an encoding from a Content-Type (HTML5
71            revision 1738).
72    
73    2008-09-20  Wakaba  <wakaba@suika.fam.cx>
74    
75            * ContentChecker.pm ($IsInHTMLInteractiveContent): New.
76    
77    2008-09-18  Wakaba  <wakaba@suika.fam.cx>
78    
79            * LangTag.pm: Add checks for remaining requirements from RFC 4646.
80    
81            * mklangreg.pl: Sort 'Prefix' values by their length, to ease
82            matching.
83    
84    2008-09-18  Wakaba  <wakaba@suika.fam.cx>
85    
86            * LangTag.pm: Warn for private use language subtags.  Error level
87            typos fixed.  Support for Suppress-Script field.
88    
89            * mklangreg.pl: Support for dumping of nested structure.
90    
91    2008-09-18  Wakaba  <wakaba@suika.fam.cx>
92    
93            * LangTag.pm (check_rfc4646_langtag): Check if a tag is in the
94            recommended case as per RFC 4646.
95    
96    2008-09-18  Wakaba  <wakaba@suika.fam.cx>
97    
98            * LangTag.pm (check_rfc4646_langtag): New method.
99    
100    2008-09-18  Wakaba  <wakaba@suika.fam.cx>
101    
102            * mklangreg.pl: New script.
103    
104            * Makefile: Updated for creation of the module for language subtag
105            registry.
106            
107    2008-09-16  Wakaba  <wakaba@suika.fam.cx>
108    
109            * Makefile: WebIDL.html added.
110    
111            * WebIDL.pod: New documentation.
112    
113    2008-09-16  Wakaba  <wakaba@suika.fam.cx>
114    
115            * WebIDL.pm: Checker's error types are redefined.
116    
117    2008-09-16  Wakaba  <wakaba@suika.fam.cx>
118    
119            * WebIDL.pm: Parser's error types are redefined.  Some forward
120            compatible parsing bugs are fixed.  Some unreachable codes are
121            commented out.
122    
123    2008-09-16  Wakaba  <wakaba@suika.fam.cx>
124    
125            * WebIDL.pm: Support for the reminding extended attributes are
126            added.  It does not satisfy the definition that a forward
127            interface declaration has an extended attribute.  It seems that
128            unless explicitly allowed multiple extended attributes with the
129            same name is not allowed, though it is not explicitly mentioned in
130            the spec.
131    
132    2008-09-16  Wakaba  <wakaba@suika.fam.cx>
133    
134            * WebIDL.pm: Unescapes extended attribute names and extended
135            attribute identifiers.  Preserve whether an extended attribute has
136            an argument list of not.  Support for extended attributes:
137            Constructor, ExceptionConsts, IndexGetter, IndexSetter,
138            NameGetter, NameSetter, and Null.
139            (has_argument_list): New attribute.
140            (idl_text): Stringifies argument lists, if any, even if it is
141            empty.
142    
143    2008-09-15  Wakaba  <wakaba@suika.fam.cx>
144    
145            * HTML.pm.src: New state |PCDATA_STATE|.  Use an empty string for
146            |{s_kwd}| in DATA_STATE as default.
147    
148    2008-09-15  Wakaba  <wakaba@suika.fam.cx>
149    
150            * HTML.pm.src, mkhtmlparser.pl: Replace |{prev_char}|
151            by |{s_kwd}| in DATA_STATE.
152    
153    2008-09-15  Wakaba  <wakaba@suika.fam.cx>
154    
155            * HTML.pm.src: Shorten keys.
156    
157    2008-09-15  Wakaba  <wakaba@suika.fam.cx>
158    
159            * HTML.pm.src: Remove checking for control character, surrogate
160            pair, or noncharacter code points and non-Unicode code
161            points (they should be handled by Whatpm::Charset::UnicodeChecker).
162            (parse_char_stream): Support for the |$get_wrapper| argument and
163            character stream error handlers.
164    
165    2008-09-15  Wakaba  <wakaba@suika.fam.cx>
166    
167            * ContentChecker.pm: Don't call |loda_ns_module|
168            for null-namespace elements/attributes.
169    
170            * HTML.pm.src: Fact out $disallowed_control_chars
171            as a hash.
172    
173    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
174    
175            * HTML.pm.src: Regexp typo fixed.  |{prev_char}|
176            and |{next_char}| initializations are moved to initialization
177            method.  |{read_until}| now supports buffering.  Sync |set_inner_html|
178            with |parse_char_stream|.
179    
180    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
181    
182            * HTML.pm.src (parse_char_stream): Make |set_next_char|
183            invoke |manakai_read_until|, not only |read|, where
184            possible, to decrease the number of |read| method calls.
185    
186            * mkhtmlparser.pl: Related changes to the aforementioned
187            modification.
188    
189    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
190    
191            * HTML.pm.src: Use |read| instead of |getc|.  |set_inner_html|
192            would report character error from now.
193    
194    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
195    
196            * HTML.pm.src: White-space-leaded non-white-space character
197            tokens in "before head insertion mode" was not
198            correctly handled.
199            (set_inner_html): Reimplemented using CharString decodehandle
200            class.  Support for $get_wrapper argument.  Support
201            for |{read_until}| feature.
202    
203    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
204    
205            * HTML.pm.src: Make a "bare ero" error for unknown
206            entities point the "&" character.
207    
208    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
209    
210            * HTML.pm.src: It turns out that U+FFFD don't have to
211            be added to the list of excluded characters.
212    
213    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
214    
215            * HTML.pm.src ($char_onerror): Have character decoder's |line|
216            and |column| a higher priority than the one set by the
217            tokenizer's input handler.
218            ($self->{read_until}): Exclude U+FFFD (but this might
219            not be necessary, since now we do line/column fixup in
220            the character decode handle).
221    
222    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
223    
224            * HTML.pm.src: Use |{read_until}| where possible.
225    
226    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
227    
228            * HTML.pm.src: Change |{getc_until}| to |{read_until}|
229            and |manakai_getc_until| to |manakai_read_until| to
230            reduce the number of string copies.
231    
232    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
233    
234            * HTML.pm.src (parse_char_string): Use newly created
235            |Whatpm::Charset::DecodeHandle::CharString| instead of Perl's
236            standard feature to |open| a string as a filehandle,
237            since Perl's string filehandle seems not supporting |ungetc|
238            method correctly.
239            (parse_char_stream): Define |{getc_until}| method.
240            (DATA_STATE): Experimental support for |getc_until| feature.
241    
242    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
243    
244            * HTML.pm.src: Check points added to newly added branches.
245    
246    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
247    
248            * HTML.pm.src: Remove |{char}|, which is no longer used.
249            Remove |{entity_in_attr}| and |{last_attribute_value_state}|
250            and replaced by |{prev_state}|.
251    
252            * mkhtmlparser.pl: Remove |{char}| feature.
253            Remove |!!!back-next-input-character;| macro.
254    
255    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
256    
257            * HTML.pm.src: Finally we get rid of all the inner loops.  Remove
258            entity related tokenizer states in favor of new states
259            implementing the consume character reference algorithm.
260    
261    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
262    
263            * HTML.pm.src: "Consume a character reference" algorithm is
264            now implemented as a tokenizer's state, rather than
265            a method, with minimum changes (more changes will
266            be made, in due course).  "Bogus comment state"'s inner
267            loop gets removed.
268    
269    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
270    
271            * HTML.pm.src: Make |PUBLIC| and |SYSTEM| keyword tokenizing
272            into their own tokenizer states.
273    
274    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
275    
276            * HTML.pm.src: |CDATA_SECTION_STATE| (formally |CDATA_BLOCK_STATE|
277            is split into three states.
278    
279    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
280    
281            * HTML.pm.src: |CLOSE_TAG_OPEN_STATE| is broken into
282            itself and new |CDATA_PCDATA_CLOSE_TAG_STATE| so that
283            no longer does the tokenizer have to push back next input
284            characters in those states.
285    
286    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
287    
288            * HTML.pm.src: |MARKUP_DECLARATION_OPEN_STATE| broken
289            into four states so that no longer does the tokenizer have to push
290            back next input characters in that state.
291    
292    2008-09-11  Wakaba  <wakaba@suika.fam.cx>
293    
294            * HTML.pm.src: Methods now accept additional parameter, $get_wrapper,
295            which can be used to insert some wrapper between the character
296            stream handle and the tokenizer.  (It is currently not supported
297            for |set_inner_html| for |Element|s).
298    
299    2008-09-10  Wakaba  <wakaba@suika.fam.cx>
300    
301            * HTML.pm.src: Ignore punctuations in charset names.
302    
303    2008-09-10  Wakaba  <wakaba@suika.fam.cx>
304    
305            * ContentChecker.pm: Support for charset-layer error levels.
306    
307            * HTML.pm.src: Don't specify |text| argument for the
308            |chardecode:fallback| error, since it is not the encoding
309            being used alternatively.
310    
311    2008-09-06  Wakaba  <wakaba@suika.fam.cx>
312    
313            * HTML.pm.src: Support for |XSLT-compat| (HTML5 revision 2141).
314    
315    2008-08-31  Wakaba  <wakaba@suika.fam.cx>
316    
317            * CacheManifest.pm: Support for extensibility (HTML5 revision 2051).
318    
319    2008-08-31  Wakaba  <wakaba@suika.fam.cx>
320    
321            * HTML.pm.src: Bug fix and sync with the spec with regard
322            to after after frameset insertion mode processing (HTML5
323            revision 1909).  Note that the implementation was wrong
324            per the old spec before the r1909 changes.
325    
326    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
327    
328            * HTMLTable.pm: scope=auto algorithm fix synced with the
329            spec (HTML5 revision 2093).
330            ($process_row): Algorithm step numbers synced with the
331            spec (HTML5 revision 2092).
332    
333    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
334    
335            * HTMLTable.pm: Zs is not what we want; we want White_Space! (HTML5
336            revision 2094).
337    
338    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
339    
340            * ContentType.pm: Support for image/svg+xml (HTML5 revision 2096).
341    
342    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
343    
344            * HTML.pm.src: '"' and "'" at the end of attribute
345            name (after another attribute) now raise parse error (HTML5
346            revision 2123).  Empty unquoted attribute values are no
347            longer allowed (HTML5 revision 2122).
348    
349    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
350    
351            * mkhtmlparser.pl: Support for MathML |definitionURL| attribute (HTML5
352            revision 2130).
353    
354    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
355    
356            * ContentChecker.pm: |xml:lang| attribute value must be same
357            as |lang| attribute value for HTML elements (HTML5 revision 2062
358            and so on).
359    
360    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
361    
362            * ContentChecker.pm: Error level definition for |xml_id_error|
363            was missing.
364    
365            * URIChecker.pm: The end of the URL should be marked as the
366            error location for an empty path error.  The position
367            between the userinfo and the port components should be
368            marked as the error location for an empty host error.
369    
370    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
371    
372            * URIChecker.pm: Set parameters representing where in the
373            value the error occurs for errors.  Report unknown
374            address format error in warning level, since address
375            formats are rarely added.  Path segments starting with "/.."
376            were misinterpreted as a dot-segment.
377    
378    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
379    
380            * URIChecker.pm (check_iri_reference): Requires
381            |Message::DOM::DOMImplementation|.
382    
383    2008-08-29  Wakaba  <wakaba@suika.fam.cx>
384    
385            * IMTChecker.pm: Updated for the new error reporting architecture.
386    
387            * ContentChecker.pm: Error levels for IMTs are added.
388    
389    2008-08-17  Wakaba  <wakaba@suika.fam.cx>
390    
391            * H2H.pm (_shift_token): Support for unquoted HTML attribute
392            values.
393    
394    2008-08-16  Wakaba  <wakaba@suika.fam.cx>
395    
396            * CacheManifest.pm: Support for new style of error
397            reports.
398    
399            * HTML.pm.src: Set line=1, column=1 to the document node.
400    
401    2008-08-16  Wakaba  <wakaba@suika.fam.cx>
402    
403            * ContentChecker.pm, RDFXML.pm: Pass {level} object to language tag
404            and URL checkers.  Support for more error levels for bogus
405            langauge tag and URL "standards".
406    
407            * LangTag.pm, URIChecker.pm: Support for new style error
408            level reporting.
409    
410    2008-08-15  Wakaba  <wakaba@suika.fam.cx>
411    
412            * ContentChecker.pm: Support for RDF/XML error levels.
413    
414            * HTMLTable.pm, RDFXML.pm: Support for new style of error level
415            specifying.  Error types are revised.
416    
417    2008-08-15  Wakaba  <wakaba@suika.fam.cx>
418    
419            * ContentChecker.pm: All error reporting method calls are
420            renewed.
421    
422    2008-08-15  Wakaba  <wakaba@suika.fam.cx>
423    
424            * HTML.pm.src: All error type names and "text" parameters
425            are revised.  Use new style for "level" specification.
426    
427            * mkhtmlparser.pl: Use new style for "level" specification.
428    
429    2008-08-03  Wakaba  <wakaba@suika.fam.cx>
430    
431            * WebIDL.pm (parse_char_string): Simplified error
432            reporting process for broken ignored valuetype definition.
433            (Valuetype idl_text): Support for special "DOMString" name.
434    
435    2008-08-03  Wakaba  <wakaba@suika.fam.cx>
436    
437            * WebIDL.pm ($get_scoped_name): Append "::::" if the last
438            terminal of the ScopedName is "DOMString", such that whether
439            the last part of the scoped name is "DOMString" or "_DOMString"
440            later.  It is necessary to determine whether a |typedef|
441            definition should be ignored or not.
442            (parse_char_string): Unescape the identifier of
443            exception members.
444            ($resolve): Return undef for builtin types and sequence<T>
445            types (we might not have to do this, however...).
446            (check): Support checking for Exceptions, Valuetypes,
447            and Typedefs.
448            ($serialize_type): Support for "DOMString::::" syntax.
449            (Typedef idl_text): Output Type as "DOMString" if it
450            is really "DOMString" (i.e. its internal representation
451            is "::DOMString::").
452    
453    2008-08-03  Wakaba  <wakaba@suika.fam.cx>
454    
455            * WebIDL.pm ($resolve): New code, based on resolve code
456            for constant types in the |check| method.
457            (check): Support for checking of attributes, operations, and
458            arguments.
459            (Attribute/Operation idl_text): Exception names in getraises,
460            setraises, and raises clauses is serizlied by |$serialize_type|
461            code.
462    
463    2008-08-02  Wakaba  <wakaba@suika.fam.cx>
464    
465            * WebIDL.pm ($integer): Order of selections are changed to match
466            hexadecimal numbers (the original pattern, taken from the spec,
467            was not work for hexadecimal numbers, because the "0" prefix
468            matches to the [0-7]* part (as an empty string) and therefore
469            it does not match with remaining "x..." part of a "0x..." integer
470            literal.
471            ($get_type): It now returns a string, not an array reference,
472            for regular types and |sequence| types (i.e. it in any case
473            returns a string).
474            ($get_next_token): The second item in the array that represents
475            a integer or float token is now a Perl number value, not the
476            original string representation of the number.
477            (check): Support for const value consistency checking.
478            No extended attribute is defined for constants.
479            (Node subclasses): Use simple strings rather than array references
480            for default data type values.
481            ($serialize_type): Type values are now simple strings.
482            (value): If the new attribute value is a false value, then
483            a FALSE value is set to the attribute.
484    
485    2008-08-02  Wakaba  <wakaba@suika.fam.cx>
486    
487            * WebIDL.pm ($get_scoped_name): Now scoped names are stored
488            in its stringified format ("scoped name" as defined in the
489            spec).  Note that future version of this module should not use
490            array references for type values and the |type_text| attribute
491            should be made obsolete.
492            (parse_char_string): Unescape attribute names.
493            (check): Support for checking of whether inherited interfaces
494            are actually defined or not.  Support for checking of whether
495            interface member identifiers are duplicated or not.
496            ($serialize_type): Scoped names are returned as is.  A future
497            version of this code should escape identifiers other than "DOMString",
498            otherwise the idl_text would be non-conforming.
499    
500    2008-08-02  Wakaba  <wakaba@suika.fam.cx>
501    
502            * WebIDL.pm (parse_char_string): Set line/column numbers
503            to generated nodes.  Unescape identifiers.  Extended attributes
504            for Definition's were ignored.
505            (append_child): Set |parent_node| attribute.
506            (parent_node): New attribute.
507            (check): Support interface/exception members.  Support
508            extended attributes.  Support definition identifier uniqueness
509            constraint.
510            (qualified_name): New attribute.
511            (Interface/Exception idl_text): Extended attributes were
512            not prepended to the returned text.
513    
514    2008-08-02  Wakaba  <wakaba@suika.fam.cx>
515    
516            * WebIDL.pm (parse_char_string): Set line/column numbers
517            to interface object experimentally.  s/shift/pop/g, shift
518            would make things wrong.  Support for interface forward
519            declarations was missing.  Broken interface declarations
520            with no block were not ignored entirely.
521            (Whatpm::WebIDL::Node): New abstract class.  This class
522            makes things easier.
523            (child_nodes): New attribute.  Unlike DOM's attribute with
524            same name, this attribute returns a dead list of nodes for
525            simplicity.
526            (get_user_data, set_user_data): New methods.
527            (Module idl_text): A SPACE character should be inserted
528            before the |{| character.
529            (Interface idl_text): Support for interface forward declarations.
530            (is_forward_declaration): New attribute.
531    
532    2008-07-19  Wakaba  <wakaba@suika.fam.cx>
533    
534            * WebIDL.pm (type_text): Better serializer.
535    
536    2008-07-19  Wakaba  <wakaba@suika.fam.cx>
537    
538            * WebIDL.pm: Revise forward-compatible parsing so that
539            it now can handle broken extended attributes and as such.
540    
541    2008-07-19  Wakaba  <wakaba@suika.fam.cx>
542    
543            * WebIDL.pm: Real support for extended attributes.
544            Support for extended attributes with arguments.
545    
546    2008-07-19  Wakaba  <wakaba@suika.fam.cx>
547    
548            * WebIDL.pm: Support for |exception| syntax.
549            (Interface->idl_text): Tentative support for inheritances.
550    
551    2008-07-19  Wakaba  <wakaba@suika.fam.cx>
552    
553            * WebIDL.pm: Hierarchical scoped name support was broken.
554            Support for raises, setraises, and getraises syntaxes.
555    
556    2008-07-18  Wakaba  <wakaba@suika.fam.cx>
557    
558            * WebIDL.pm: Support for |idl_text| attribute, version 1 (no
559            proper support for types, extended attributes, and exceptions yet).
560            WebIDL parser, version 1 (no support for exceptions yet,
561            no proper support for extended attributes yet).
562    
563    2008-07-09  Wakaba  <wakaba@suika.fam.cx>
564    
565            * WebIDL.pm (parse_char_string): Support for basic attribute syntax.
566    
567    2008-06-29  Wakaba  <wakaba@suika.fam.cx>
568    
569            * WebIDL.pm: Support for valuetype and const.
570    
571    2008-06-29  Wakaba  <wakaba@suika.fam.cx>
572            
573            * WebIDL.pm: New module.
574    
575    2008-06-15  Wakaba  <wakaba@suika.fam.cx>
576    
577            * Makefile (Entities.html): URI changed.
578    
579    2008-06-08  Wakaba  <wakaba@suika.fam.cx>
580    
581            * HTML.pm.src: Support for ruby parsing (HTML5 revision 1704).
582    
583    2008-06-01  Wakaba  <wakaba@suika.fam.cx>
584    
585            * HTML.pm.src (_get_next_token): A parse error was missing.
586    
587    2008-06-01  Wakaba  <wakaba@suika.fam.cx>
588    
589            * mklinktypelist.pl: rel=contact is no longer part of the HTML5
590            spec (commented out). (HTML5 revision 1711).
591    
592    2008-05-25  Wakaba  <wakaba@suika.fam.cx>
593    
594            * ContentType.pm: Drop support for UTF-32 (HTML5 revision 1701).
595    
596            * HTML.pm.src: UTF-16BE and UTF-16LE should be considered
597            as UTF-16 (HTML5 revision 1701).
598    
599    2008-05-25  Wakaba  <wakaba@suika.fam.cx>
600    
601            * HTML.pm.src: Support for <noframes> in <head> (HTML5 revision
602            1692).
603    
604    2008-05-25  Wakaba  <wakaba@suika.fam.cx>
605    
606            * HTML.pm.src: The secondary insertion mode used when switching
607            to foreign content is the "in body" insertion mode (HTML5 revision
608            1696).
609    
610    2008-05-25  Wakaba  <wakaba@suika.fam.cx>
611    
612            * HTML.pm.src: Don't raise parse error for <isindex/> (HTML5
613            revision 1697).
614    
615    2008-05-24  Wakaba  <wakaba@suika.fam.cx>
616    
617            * HTML.pm.src: Support for end-of-file token in foreign content
618            insertion mode (HTML5 revision 1693).  Update SVG camelCase
619            attribute list (HTML5 revision 1700).  <textarea> closes
620            </select> (HTML5 revision 1699).  More start tags close in
621            foreign content insertion mode (HTML5 revision 1698).
622    
623    2008-05-24  Wakaba  <wakaba@suika.fam.cx>
624    
625            * HTML.pm.src: ";" is not part of charset name (HTML5 revision 1665).
626    
627    2008-05-24  Wakaba  <wakaba@suika.fam.cx>
628    
629            * HTML.pm.src: More robust charset parameter detection (HTML5
630            revision 1674).
631    
632    2008-05-24  Wakaba  <wakaba@suika.fam.cx>
633    
634            * ContentType.pm: Support for image/vnd.microsoft.icon (HTML5
635            revision 1676).
636    
637    2008-05-24  Wakaba  <wakaba@suika.fam.cx>
638    
639            * HTML.pm.src: Ignore language part of public identifiers for
640            quriks mode detection (HTML5 revision 1679).
641    
642    2008-05-24  Wakaba  <wakaba@suika.fam.cx>
643    
644            * HTML.pm.src: Reduce the number of errors in truncated doctypes (HTML5
645            revision 1685).
646    
647    2008-05-24  Wakaba  <wakaba@suika.fam.cx>
648    
649            * HTML.pm.src: Support for EOF in new states for tags (HTML5
650            revision 1684).
651    
652    2008-05-24  Wakaba  <wakaba@suika.fam.cx>
653    
654            * HTML.pm.src (_reset_insertion_mode): Make <td>.innerHTML
655            work (HTML5 revision 1690).
656    
657    2008-05-24  Wakaba  <wakaba@suika.fam.cx>
658    
659            * HTML.pm.src (_tree_construction_main): Change handling of
660            end tags in head insertion modes (HTML5 revision 1686).
661            (parse_char_string): Bug fix for non-utf8 character string handlings.
662            (parse_char_stream): |ungetc| does not work well for this context.
663    
664    2008-05-18  Wakaba  <wakaba@suika.fam.cx>
665    
666            * HTML.pm.src (parse_byte_string): Redefined to invoke
667            |parse_byte_stream|.
668            (parse_byte_stream): New method.
669    
670    2008-05-18  Wakaba  <wakaba@suika.fam.cx>
671    
672            * HTML.pm.src (parse_byte_string): Fix the column number reported
673            by encoding layer error reporter.
674    
675    2008-05-17  Wakaba  <wakaba@suika.fam.cx>
676    
677            * HTML.pm.src (parse_byte_string): Use streaming decoder
678            rather than converting the whole byte string and then parsing.
679            Propagate errors in character encoding layer.
680            (get_next_token): Precise error reporting for |bare stago| error.
681    
682    2008-05-17  Wakaba  <wakaba@suika.fam.cx>
683    
684            * HTML.pm.src (parse_char_stream): New method.
685            (parse_char_string): This method is now defined as an invocation
686            of the |parse_char_stream| method.
687    
688    2008-05-17  Wakaba  <wakaba@suika.fam.cx>
689    
690            * HTML.pm.src (parse_byte_string): Report various status
691            of the sniffing as info-level errors.  Support for new
692            decoding framework in parser resestting.
693            (new): Various default error levels were not set.
694    
695    2008-05-17  Wakaba  <wakaba@suika.fam.cx>
696    
697            * HTML.pm.src (parse_byte_string): HTML5 encoding siniffing
698            algorithm, except for the actual sniffing, is implemented
699            with new framework with Message::Charset::Info.
700    
701    2008-05-16  Wakaba  <wakaba@suika.fam.cx>
702    
703            * CacheManifest.pm (_parse): Drop fragment identifiers from
704            URIs in fallback section (HTML5 revision 1596).
705    
706    2008-05-10  Wakaba  <wakaba@suika.fam.cx>
707    
708            * Makefile (Entities.html): URI has changed.
709    
710    2008-05-10  Wakaba  <wakaba@suika.fam.cx>
711    
712            * CacheManifest.pm: Don't replace U+0000 NULL (HTML5 revision
713            1553).
714    
715    2008-05-06  Wakaba  <wakaba@suika.fam.cx>
716    
717            * ContentChecker.pm: Noted that those returned in |table| are
718            no longer table elements, but table objects returned
719            by Whatpm::HTMLTable.
720    
721            * HTMLTable.pm (form_table): Return table element node
722            as |$table->{element}|.
723            (assign_header): Support for the |headers=""| attribute.
724    
725    2008-05-06  Wakaba  <wakaba@suika.fam.cx>
726    
727            * HTMLTable.pm (assign_header): New function; first version
728            with no support for headers="".
729            (form_table): Include table width and height to the returned
730            table object for covenience.  Indexing in column assignement was wrong.
731            Set whether a data cell is empty or not for convenience.
732    
733    2008-05-05  Wakaba  <wakaba@suika.fam.cx>
734    
735            * HTMLTable.pm: Robuster caption support (HTML5 revision 1393).
736    
737    2008-05-05  Wakaba  <wakaba@suika.fam.cx>
738    
739            * HTMLTable.pm: How table model errors are detected is
740            changed (HTML5 revision 1387).
741    
742    2008-05-05  Wakaba  <wakaba@suika.fam.cx>
743    
744            * HTMLTable.pm: The algorithm now moves |tfoot| elements
745            to the end of the table (HTML5 revision 1380).
746    
747    2008-05-05  Wakaba  <wakaba@suika.fam.cx>
748    
749            * HTMLTable.pm: The algorithm is now 0-based indexing, instead
750            of 1-based (HTML5 revision 1376).
751    
752    2008-05-05  Wakaba  <wakaba@suika.fam.cx>
753    
754            * ContentType.pm: "Content-Type: text/plain; charset=UTF-8"
755            and "Content-Encoding" no longer prevent sniffing (HTML5
756            revision 1288).
757    
758    2008-05-05  Wakaba  <wakaba@suika.fam.cx>
759    
760            * ContentType.pm: Skip BOMs n feed or HTML algorithm (HTML5 revision
761            1282).
762    
763    2008-05-03  Wakaba  <wakaba@suika.fam.cx>
764    
765            * ContentChecker.pm: Support for global attributes.
766            Status of XML specs are added.
767    
768    2008-05-03  Wakaba  <wakaba@suika.fam.cx>
769    
770            * ContentChecker.pm (check_element): Support for |template=""|
771            and |ref=""| attribute (referent element type checking).
772    
773    2008-04-29  Wakaba  <wakaba@suika.fam.cx>
774    
775            * CacheManifest.pm (_parse): New same origin definition (HTML5 revision
776            1500) is implemented (except for IDNA part and URI-scheme-specific
777            knowledge).  Line number counting was wrong for LF-only
778            documents.
779    
780    2008-04-13  Wakaba  <wakaba@suika.fam.cx>
781    
782            * HTML.pm.src: Raise an parse error for any disallowed
783            character (HTML5 revision 1263).
784    
785    2008-04-13  Wakaba  <wakaba@suika.fam.cx>
786    
787            * mkentitylist.pl: Support for new HTML5 entity table format (the
788            definition for |AElig;| was missing).
789    
790    2008-04-13  Wakaba  <wakaba@suika.fam.cx>
791    
792            * HTML.pm.src, mkhtmlparser.pl: Support for element/attribute
793            name/namespace fixup (HTML5 revisions 1413, 1415, 1416, and 1417).
794    
795    2008-04-13  Wakaba  <wakaba@suika.fam.cx>
796    
797            * HTML.pm.src: List of element names that close foreign content
798            insertion mode is added (HTML5 revisions 1412 and 1418).
799    
800    2008-04-13  Wakaba  <wakaba@suika.fam.cx>
801    
802            * HTML.pm.src: Support for |mglyph| and |malignmark|
803            elements (HTML5 revision 1410).
804    
805    2008-04-13  Wakaba  <wakaba@suika.fam.cx>
806    
807            * HTML.pm.src: Support for new long MathML entities (HTML5
808            revision 1406).
809    
810    2008-04-13  Wakaba  <wakaba@suika.fam.cx>
811    
812            * HTML.pm.src: CDATA section support for MathML and SVG
813            elements (HTML5 revision 1404 and 1420).
814    
815    2008-04-12  Wakaba  <wakaba@suika.fam.cx>
816    
817            * HTML.pm.src, mkhtmlparser.pl: Support for MathML and SVG
818            elements (HTML5 revision 1404).  Unused !!!macro definitions
819            are removed.
820    
821    2008-04-12  Wakaba  <wakaba@suika.fam.cx>
822    
823            * HTML.pm.src, mkhtmlparser.pl: The way permitted slash errors
824            are raised is changed (HTML5 revision 1404).
825    
826    2008-04-06  Wakaba  <wakaba@suika.fam.cx>
827    
828            * HTML.pm.src: Category sets that are no longer used are removed.
829    
830    2008-04-06  Wakaba  <wakaba@suika.fam.cx>
831    
832            * HTML.pm.src: The ->[1] property of stack entries are now
833            replaced by constants representing element category.
834    
835    2008-04-06  Wakaba  <wakaba@suika.fam.cx>
836    
837            * HTML.pm.src: Don't use local name stored in stack (i.e. ->[1])
838            for error reporting.  (This is a preparation for using constant
839            value for ->[1].)
840    
841    2008-03-22  Wakaba  <wakaba@suika.fam.cx>
842    
843            * RDFXML.pm: Typo fixed.
844    
845    2008-03-22  Wakaba  <wakaba@suika.fam.cx>
846    
847            * ContentChecker.pm: |fact_level| is now treated
848            as same as |must_level|, i.e. level = |m|.
849            (check_element): Make list of URIs in the DOM.
850    
851    2008-03-21  Wakaba  <wakaba@suika.fam.cx>
852    
853            * RDFXML.pm: Language accessor implemented.
854            Local (null-namespace) attribute support.
855    
856    2008-03-21  Wakaba  <wakaba@suika.fam.cx>
857    
858            * RDFXML.pm: Factored out ID checking code.
859    
860    2008-03-21  Wakaba  <wakaba@suika.fam.cx>
861    
862            * RDFXML.pm: TODO items noted.  Validation of ID
863            and URI attributes is implemented.  Warn if unknown
864            value is used in rdf:parseType="" attribute.
865    
866            * URIChecker.pm (check_rdf_uri_reference): New function.
867    
868    2008-03-21  Wakaba  <wakaba@suika.fam.cx>
869    
870            * RDFXML.pm: bnodeid implemented.  Relative references
871            are now resolved.
872    
873    2008-03-21  Wakaba  <wakaba@suika.fam.cx>
874    
875            * ContentChecker.pm: RDF reification implemented.
876    
877            * RDFXML.pm: undef vs false bug fixed.
878            Reification implemented.
879    
880    2008-03-21  Wakaba  <wakaba@suika.fam.cx>
881    
882            * RDFXML.pm: s/id/ID/ for attribute name.
883            The |node| arguments are added for |ontriple| calls.
884            Too many "attribute not allowed" errors were raised.
885    
886            * ContentChecker.pm: Initial experimental support for rdf:RDF
887            element.
888    
889    2008-03-21  Wakaba  <wakaba@suika.fam.cx>
890    
891            * RDFXML.pm: New module.
892    
893    2008-03-20  Wakaba  <wakaba@suika.fam.cx>
894    
895            * HTML.pm.src (set_inner_html): Line/column number
896            code was old one yet.
897    
898    2008-03-20  Wakaba  <wakaba@suika.fam.cx>
899    
900            * HTML.pm.src: Better line/column reporting for "duplicate attribute"
901            errors.  Line/column markings for DOCTYPE, comment, and
902            character tokens are reintroduced; otherwise,
903            error location for "not HTML5" error and errors
904            for implied elements are not attached.
905    
906    2008-03-20  Wakaba  <wakaba@suika.fam.cx>
907    
908            * HTML.pm.src: Set line/column numbers to attributes.
909    
910            * NanoDOM.pm (create_attribute_ns, set_attribute_node_ns): Added.
911            (value): Setter implemented.
912    
913            * mkhtmlparser.pl: Set line/column numbers to Attr nodes.
914    
915    2008-03-20  Wakaba  <wakaba@suika.fam.cx>
916    
917            * HTML.pm.src: Unused line/column markings are removed.
918    
919    2008-03-20  Wakaba  <wakaba@suika.fam.cx>
920    
921            * HTML.pm.src (_get_next_token): Remove |first_start_tag|
922            flag, which is no longer used.
923    
924    2008-03-17  Wakaba  <wakaba@suika.fam.cx>
925    
926            * HTML.pm.src: Set line/column information to element nodes.
927    
928            * mkhtmlparser.pl (!!!create-element, !!!insert-element,
929            and !!!insert-element-t): Set line/column information to
930            element nodes.
931    
932    2008-03-17  Wakaba  <wakaba@suika.fam.cx>
933    
934            * HTML.pm.src (_get_next_token): The first "<" character
935            in "<?", "<>", or "</>" should be the error point.
936    
937    2008-03-16  Wakaba  <wakaba@suika.fam.cx>
938    
939            * HTML.pm.src: Some more fixes on error position reporting.
940    
941    2008-03-16  Wakaba  <wakaba@suika.fam.cx>
942    
943            * HTML.pm.src: Token-level precious error reporting.
944    
945    2008-03-16  Wakaba  <wakaba@suika.fam.cx>
946    
947            * HTML.pm.src: Preparation for more precious error point
948            reporting.
949    
950    2008-03-11  Wakaba  <wakaba@suika.fam.cx>
951    
952            * HTML.pm.src: Error type revised.
953    
954    2008-03-11  Wakaba  <wakaba@suika.fam.cx>
955    
956            * HTML.pm.src: Similar codes are merged together, again.
957    
958    2008-03-11  Wakaba  <wakaba@suika.fam.cx>
959    
960            * HTML.pm.src: Similar codes are merged together.
961    
962  2008-03-10  Wakaba  <wakaba@suika.fam.cx>  2008-03-10  Wakaba  <wakaba@suika.fam.cx>
963    
964          * mkhtmlparser.pl: Set "level" parameter to parse errors.          * mkhtmlparser.pl: Set "level" parameter to parse errors.

Legend:
Removed from v.1.183  
changed lines
  Added in v.1.337

admin@suikawiki.org
ViewVC Help
Powered by ViewVC 1.1.24