/[suikacvs]/markup/html/whatpm/Whatpm/ChangeLog
Suika

Diff of /markup/html/whatpm/Whatpm/ChangeLog

Parent Directory Parent Directory | Revision Log Revision Log | View Patch Patch

revision 1.163 by wakaba, Thu Mar 6 15:56:52 2008 UTC revision 1.336 by wakaba, Sat Oct 4 04:06:32 2008 UTC
# Line 1  Line 1 
1    2008-10-04  Wakaba  <wakaba@suika.fam.cx>
2    
3            * HTML.pm.src: New "special" elements added to the list (HTML5
4            revision 1778).  "strile" -> "strike".
5    
6    2008-10-02  Wakaba  <wakaba@suika.fam.cx>
7    
8            * ContentType.pm (get_sniffed_type): Support for the "better"
9            content sniffing (HTML5 revision 1927).  In a case the official
10            type was not returned when the method is invoked in the list
11            context.
12    
13    2008-09-22  Wakaba  <wakaba@suika.fam.cx>
14    
15            * HTML.pm.src: Character references for non-space C0 characters,
16            including U+000B VT, DEL character, noncharacter code points, are
17            now converted to the U+FFFD character (cf. HTML5 revision 2138).
18    
19    2008-09-21  Wakaba  <wakaba@suika.fam.cx>
20    
21            * ContentChecker.pm: |form=""| check support added.
22    
23    2008-09-21  Wakaba  <wakaba@suika.fam.cx>
24    
25            * ContentChecker.pm: |contextmenu| validness is now checked using
26            |id| and |id_type| properties, and |menu| property is removed.
27    
28    2008-09-21  Wakaba  <wakaba@suika.fam.cx>
29    
30            * ContentChecker.pm: Prepare for |form| |name| attribute's
31            duplication checking.
32    
33    2008-09-21  Wakaba  <wakaba@suika.fam.cx>
34    
35            * HTML.pm.src (parse_byte_stream): Support (or non-support) for
36            unsupported charset="" parameter value (HTML5 revision 2131).
37    
38    2008-09-20  Wakaba  <wakaba@suika.fam.cx>
39    
40            * HTML.pm.src: Reminding places where U+000B is allowed as a space
41            character is fixed (cf. HTML5 revision 1738).
42    
43            * ContentChecker.pm, HTMLTable.pm: U+000B is no longer part of
44            space characters (HTML5 revision 1738).
45    
46    2008-09-20  Wakaba  <wakaba@suika.fam.cx>
47    
48            * HTML.pm.src: The "anything else" case for the "after after body"
49            insertion mode was not updated to swtich to the "in body"
50            insertion mode.  U+000B is no longer a space character for the
51            purpose of tree construction phase (HTML5 revision 1738).
52    
53    2008-09-20  Wakaba  <wakaba@suika.fam.cx>
54    
55            * HTML.pm.src: U+000B is no longer a space character (HTML5
56            revision 1738).
57    
58    2008-09-20  Wakaba  <wakaba@suika.fam.cx>
59    
60            * ContentType.pm: 0x0B is no longer a space character (HTML5
61            revision 1738).
62    
63            * HTML.pm.src: U+000B is no longer a space character for the
64            algorithm for extracting an encoding from a Content-Type (HTML5
65            revision 1738).
66    
67    2008-09-20  Wakaba  <wakaba@suika.fam.cx>
68    
69            * ContentChecker.pm ($IsInHTMLInteractiveContent): New.
70    
71    2008-09-18  Wakaba  <wakaba@suika.fam.cx>
72    
73            * LangTag.pm: Add checks for remaining requirements from RFC 4646.
74    
75            * mklangreg.pl: Sort 'Prefix' values by their length, to ease
76            matching.
77    
78    2008-09-18  Wakaba  <wakaba@suika.fam.cx>
79    
80            * LangTag.pm: Warn for private use language subtags.  Error level
81            typos fixed.  Support for Suppress-Script field.
82    
83            * mklangreg.pl: Support for dumping of nested structure.
84    
85    2008-09-18  Wakaba  <wakaba@suika.fam.cx>
86    
87            * LangTag.pm (check_rfc4646_langtag): Check if a tag is in the
88            recommended case as per RFC 4646.
89    
90    2008-09-18  Wakaba  <wakaba@suika.fam.cx>
91    
92            * LangTag.pm (check_rfc4646_langtag): New method.
93    
94    2008-09-18  Wakaba  <wakaba@suika.fam.cx>
95    
96            * mklangreg.pl: New script.
97    
98            * Makefile: Updated for creation of the module for language subtag
99            registry.
100            
101    2008-09-16  Wakaba  <wakaba@suika.fam.cx>
102    
103            * Makefile: WebIDL.html added.
104    
105            * WebIDL.pod: New documentation.
106    
107    2008-09-16  Wakaba  <wakaba@suika.fam.cx>
108    
109            * WebIDL.pm: Checker's error types are redefined.
110    
111    2008-09-16  Wakaba  <wakaba@suika.fam.cx>
112    
113            * WebIDL.pm: Parser's error types are redefined.  Some forward
114            compatible parsing bugs are fixed.  Some unreachable codes are
115            commented out.
116    
117    2008-09-16  Wakaba  <wakaba@suika.fam.cx>
118    
119            * WebIDL.pm: Support for the reminding extended attributes are
120            added.  It does not satisfy the definition that a forward
121            interface declaration has an extended attribute.  It seems that
122            unless explicitly allowed multiple extended attributes with the
123            same name is not allowed, though it is not explicitly mentioned in
124            the spec.
125    
126    2008-09-16  Wakaba  <wakaba@suika.fam.cx>
127    
128            * WebIDL.pm: Unescapes extended attribute names and extended
129            attribute identifiers.  Preserve whether an extended attribute has
130            an argument list of not.  Support for extended attributes:
131            Constructor, ExceptionConsts, IndexGetter, IndexSetter,
132            NameGetter, NameSetter, and Null.
133            (has_argument_list): New attribute.
134            (idl_text): Stringifies argument lists, if any, even if it is
135            empty.
136    
137    2008-09-15  Wakaba  <wakaba@suika.fam.cx>
138    
139            * HTML.pm.src: New state |PCDATA_STATE|.  Use an empty string for
140            |{s_kwd}| in DATA_STATE as default.
141    
142    2008-09-15  Wakaba  <wakaba@suika.fam.cx>
143    
144            * HTML.pm.src, mkhtmlparser.pl: Replace |{prev_char}|
145            by |{s_kwd}| in DATA_STATE.
146    
147    2008-09-15  Wakaba  <wakaba@suika.fam.cx>
148    
149            * HTML.pm.src: Shorten keys.
150    
151    2008-09-15  Wakaba  <wakaba@suika.fam.cx>
152    
153            * HTML.pm.src: Remove checking for control character, surrogate
154            pair, or noncharacter code points and non-Unicode code
155            points (they should be handled by Whatpm::Charset::UnicodeChecker).
156            (parse_char_stream): Support for the |$get_wrapper| argument and
157            character stream error handlers.
158    
159    2008-09-15  Wakaba  <wakaba@suika.fam.cx>
160    
161            * ContentChecker.pm: Don't call |loda_ns_module|
162            for null-namespace elements/attributes.
163    
164            * HTML.pm.src: Fact out $disallowed_control_chars
165            as a hash.
166    
167    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
168    
169            * HTML.pm.src: Regexp typo fixed.  |{prev_char}|
170            and |{next_char}| initializations are moved to initialization
171            method.  |{read_until}| now supports buffering.  Sync |set_inner_html|
172            with |parse_char_stream|.
173    
174    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
175    
176            * HTML.pm.src (parse_char_stream): Make |set_next_char|
177            invoke |manakai_read_until|, not only |read|, where
178            possible, to decrease the number of |read| method calls.
179    
180            * mkhtmlparser.pl: Related changes to the aforementioned
181            modification.
182    
183    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
184    
185            * HTML.pm.src: Use |read| instead of |getc|.  |set_inner_html|
186            would report character error from now.
187    
188    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
189    
190            * HTML.pm.src: White-space-leaded non-white-space character
191            tokens in "before head insertion mode" was not
192            correctly handled.
193            (set_inner_html): Reimplemented using CharString decodehandle
194            class.  Support for $get_wrapper argument.  Support
195            for |{read_until}| feature.
196    
197    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
198    
199            * HTML.pm.src: Make a "bare ero" error for unknown
200            entities point the "&" character.
201    
202    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
203    
204            * HTML.pm.src: It turns out that U+FFFD don't have to
205            be added to the list of excluded characters.
206    
207    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
208    
209            * HTML.pm.src ($char_onerror): Have character decoder's |line|
210            and |column| a higher priority than the one set by the
211            tokenizer's input handler.
212            ($self->{read_until}): Exclude U+FFFD (but this might
213            not be necessary, since now we do line/column fixup in
214            the character decode handle).
215    
216    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
217    
218            * HTML.pm.src: Use |{read_until}| where possible.
219    
220    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
221    
222            * HTML.pm.src: Change |{getc_until}| to |{read_until}|
223            and |manakai_getc_until| to |manakai_read_until| to
224            reduce the number of string copies.
225    
226    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
227    
228            * HTML.pm.src (parse_char_string): Use newly created
229            |Whatpm::Charset::DecodeHandle::CharString| instead of Perl's
230            standard feature to |open| a string as a filehandle,
231            since Perl's string filehandle seems not supporting |ungetc|
232            method correctly.
233            (parse_char_stream): Define |{getc_until}| method.
234            (DATA_STATE): Experimental support for |getc_until| feature.
235    
236    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
237    
238            * HTML.pm.src: Check points added to newly added branches.
239    
240    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
241    
242            * HTML.pm.src: Remove |{char}|, which is no longer used.
243            Remove |{entity_in_attr}| and |{last_attribute_value_state}|
244            and replaced by |{prev_state}|.
245    
246            * mkhtmlparser.pl: Remove |{char}| feature.
247            Remove |!!!back-next-input-character;| macro.
248    
249    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
250    
251            * HTML.pm.src: Finally we get rid of all the inner loops.  Remove
252            entity related tokenizer states in favor of new states
253            implementing the consume character reference algorithm.
254    
255    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
256    
257            * HTML.pm.src: "Consume a character reference" algorithm is
258            now implemented as a tokenizer's state, rather than
259            a method, with minimum changes (more changes will
260            be made, in due course).  "Bogus comment state"'s inner
261            loop gets removed.
262    
263    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
264    
265            * HTML.pm.src: Make |PUBLIC| and |SYSTEM| keyword tokenizing
266            into their own tokenizer states.
267    
268    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
269    
270            * HTML.pm.src: |CDATA_SECTION_STATE| (formally |CDATA_BLOCK_STATE|
271            is split into three states.
272    
273    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
274    
275            * HTML.pm.src: |CLOSE_TAG_OPEN_STATE| is broken into
276            itself and new |CDATA_PCDATA_CLOSE_TAG_STATE| so that
277            no longer does the tokenizer have to push back next input
278            characters in those states.
279    
280    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
281    
282            * HTML.pm.src: |MARKUP_DECLARATION_OPEN_STATE| broken
283            into four states so that no longer does the tokenizer have to push
284            back next input characters in that state.
285    
286    2008-09-11  Wakaba  <wakaba@suika.fam.cx>
287    
288            * HTML.pm.src: Methods now accept additional parameter, $get_wrapper,
289            which can be used to insert some wrapper between the character
290            stream handle and the tokenizer.  (It is currently not supported
291            for |set_inner_html| for |Element|s).
292    
293    2008-09-10  Wakaba  <wakaba@suika.fam.cx>
294    
295            * HTML.pm.src: Ignore punctuations in charset names.
296    
297    2008-09-10  Wakaba  <wakaba@suika.fam.cx>
298    
299            * ContentChecker.pm: Support for charset-layer error levels.
300    
301            * HTML.pm.src: Don't specify |text| argument for the
302            |chardecode:fallback| error, since it is not the encoding
303            being used alternatively.
304    
305    2008-09-06  Wakaba  <wakaba@suika.fam.cx>
306    
307            * HTML.pm.src: Support for |XSLT-compat| (HTML5 revision 2141).
308    
309    2008-08-31  Wakaba  <wakaba@suika.fam.cx>
310    
311            * CacheManifest.pm: Support for extensibility (HTML5 revision 2051).
312    
313    2008-08-31  Wakaba  <wakaba@suika.fam.cx>
314    
315            * HTML.pm.src: Bug fix and sync with the spec with regard
316            to after after frameset insertion mode processing (HTML5
317            revision 1909).  Note that the implementation was wrong
318            per the old spec before the r1909 changes.
319    
320    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
321    
322            * HTMLTable.pm: scope=auto algorithm fix synced with the
323            spec (HTML5 revision 2093).
324            ($process_row): Algorithm step numbers synced with the
325            spec (HTML5 revision 2092).
326    
327    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
328    
329            * HTMLTable.pm: Zs is not what we want; we want White_Space! (HTML5
330            revision 2094).
331    
332    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
333    
334            * ContentType.pm: Support for image/svg+xml (HTML5 revision 2096).
335    
336    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
337    
338            * HTML.pm.src: '"' and "'" at the end of attribute
339            name (after another attribute) now raise parse error (HTML5
340            revision 2123).  Empty unquoted attribute values are no
341            longer allowed (HTML5 revision 2122).
342    
343    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
344    
345            * mkhtmlparser.pl: Support for MathML |definitionURL| attribute (HTML5
346            revision 2130).
347    
348    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
349    
350            * ContentChecker.pm: |xml:lang| attribute value must be same
351            as |lang| attribute value for HTML elements (HTML5 revision 2062
352            and so on).
353    
354    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
355    
356            * ContentChecker.pm: Error level definition for |xml_id_error|
357            was missing.
358    
359            * URIChecker.pm: The end of the URL should be marked as the
360            error location for an empty path error.  The position
361            between the userinfo and the port components should be
362            marked as the error location for an empty host error.
363    
364    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
365    
366            * URIChecker.pm: Set parameters representing where in the
367            value the error occurs for errors.  Report unknown
368            address format error in warning level, since address
369            formats are rarely added.  Path segments starting with "/.."
370            were misinterpreted as a dot-segment.
371    
372    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
373    
374            * URIChecker.pm (check_iri_reference): Requires
375            |Message::DOM::DOMImplementation|.
376    
377    2008-08-29  Wakaba  <wakaba@suika.fam.cx>
378    
379            * IMTChecker.pm: Updated for the new error reporting architecture.
380    
381            * ContentChecker.pm: Error levels for IMTs are added.
382    
383    2008-08-17  Wakaba  <wakaba@suika.fam.cx>
384    
385            * H2H.pm (_shift_token): Support for unquoted HTML attribute
386            values.
387    
388    2008-08-16  Wakaba  <wakaba@suika.fam.cx>
389    
390            * CacheManifest.pm: Support for new style of error
391            reports.
392    
393            * HTML.pm.src: Set line=1, column=1 to the document node.
394    
395    2008-08-16  Wakaba  <wakaba@suika.fam.cx>
396    
397            * ContentChecker.pm, RDFXML.pm: Pass {level} object to language tag
398            and URL checkers.  Support for more error levels for bogus
399            langauge tag and URL "standards".
400    
401            * LangTag.pm, URIChecker.pm: Support for new style error
402            level reporting.
403    
404    2008-08-15  Wakaba  <wakaba@suika.fam.cx>
405    
406            * ContentChecker.pm: Support for RDF/XML error levels.
407    
408            * HTMLTable.pm, RDFXML.pm: Support for new style of error level
409            specifying.  Error types are revised.
410    
411    2008-08-15  Wakaba  <wakaba@suika.fam.cx>
412    
413            * ContentChecker.pm: All error reporting method calls are
414            renewed.
415    
416    2008-08-15  Wakaba  <wakaba@suika.fam.cx>
417    
418            * HTML.pm.src: All error type names and "text" parameters
419            are revised.  Use new style for "level" specification.
420    
421            * mkhtmlparser.pl: Use new style for "level" specification.
422    
423    2008-08-03  Wakaba  <wakaba@suika.fam.cx>
424    
425            * WebIDL.pm (parse_char_string): Simplified error
426            reporting process for broken ignored valuetype definition.
427            (Valuetype idl_text): Support for special "DOMString" name.
428    
429    2008-08-03  Wakaba  <wakaba@suika.fam.cx>
430    
431            * WebIDL.pm ($get_scoped_name): Append "::::" if the last
432            terminal of the ScopedName is "DOMString", such that whether
433            the last part of the scoped name is "DOMString" or "_DOMString"
434            later.  It is necessary to determine whether a |typedef|
435            definition should be ignored or not.
436            (parse_char_string): Unescape the identifier of
437            exception members.
438            ($resolve): Return undef for builtin types and sequence<T>
439            types (we might not have to do this, however...).
440            (check): Support checking for Exceptions, Valuetypes,
441            and Typedefs.
442            ($serialize_type): Support for "DOMString::::" syntax.
443            (Typedef idl_text): Output Type as "DOMString" if it
444            is really "DOMString" (i.e. its internal representation
445            is "::DOMString::").
446    
447    2008-08-03  Wakaba  <wakaba@suika.fam.cx>
448    
449            * WebIDL.pm ($resolve): New code, based on resolve code
450            for constant types in the |check| method.
451            (check): Support for checking of attributes, operations, and
452            arguments.
453            (Attribute/Operation idl_text): Exception names in getraises,
454            setraises, and raises clauses is serizlied by |$serialize_type|
455            code.
456    
457    2008-08-02  Wakaba  <wakaba@suika.fam.cx>
458    
459            * WebIDL.pm ($integer): Order of selections are changed to match
460            hexadecimal numbers (the original pattern, taken from the spec,
461            was not work for hexadecimal numbers, because the "0" prefix
462            matches to the [0-7]* part (as an empty string) and therefore
463            it does not match with remaining "x..." part of a "0x..." integer
464            literal.
465            ($get_type): It now returns a string, not an array reference,
466            for regular types and |sequence| types (i.e. it in any case
467            returns a string).
468            ($get_next_token): The second item in the array that represents
469            a integer or float token is now a Perl number value, not the
470            original string representation of the number.
471            (check): Support for const value consistency checking.
472            No extended attribute is defined for constants.
473            (Node subclasses): Use simple strings rather than array references
474            for default data type values.
475            ($serialize_type): Type values are now simple strings.
476            (value): If the new attribute value is a false value, then
477            a FALSE value is set to the attribute.
478    
479    2008-08-02  Wakaba  <wakaba@suika.fam.cx>
480    
481            * WebIDL.pm ($get_scoped_name): Now scoped names are stored
482            in its stringified format ("scoped name" as defined in the
483            spec).  Note that future version of this module should not use
484            array references for type values and the |type_text| attribute
485            should be made obsolete.
486            (parse_char_string): Unescape attribute names.
487            (check): Support for checking of whether inherited interfaces
488            are actually defined or not.  Support for checking of whether
489            interface member identifiers are duplicated or not.
490            ($serialize_type): Scoped names are returned as is.  A future
491            version of this code should escape identifiers other than "DOMString",
492            otherwise the idl_text would be non-conforming.
493    
494    2008-08-02  Wakaba  <wakaba@suika.fam.cx>
495    
496            * WebIDL.pm (parse_char_string): Set line/column numbers
497            to generated nodes.  Unescape identifiers.  Extended attributes
498            for Definition's were ignored.
499            (append_child): Set |parent_node| attribute.
500            (parent_node): New attribute.
501            (check): Support interface/exception members.  Support
502            extended attributes.  Support definition identifier uniqueness
503            constraint.
504            (qualified_name): New attribute.
505            (Interface/Exception idl_text): Extended attributes were
506            not prepended to the returned text.
507    
508    2008-08-02  Wakaba  <wakaba@suika.fam.cx>
509    
510            * WebIDL.pm (parse_char_string): Set line/column numbers
511            to interface object experimentally.  s/shift/pop/g, shift
512            would make things wrong.  Support for interface forward
513            declarations was missing.  Broken interface declarations
514            with no block were not ignored entirely.
515            (Whatpm::WebIDL::Node): New abstract class.  This class
516            makes things easier.
517            (child_nodes): New attribute.  Unlike DOM's attribute with
518            same name, this attribute returns a dead list of nodes for
519            simplicity.
520            (get_user_data, set_user_data): New methods.
521            (Module idl_text): A SPACE character should be inserted
522            before the |{| character.
523            (Interface idl_text): Support for interface forward declarations.
524            (is_forward_declaration): New attribute.
525    
526    2008-07-19  Wakaba  <wakaba@suika.fam.cx>
527    
528            * WebIDL.pm (type_text): Better serializer.
529    
530    2008-07-19  Wakaba  <wakaba@suika.fam.cx>
531    
532            * WebIDL.pm: Revise forward-compatible parsing so that
533            it now can handle broken extended attributes and as such.
534    
535    2008-07-19  Wakaba  <wakaba@suika.fam.cx>
536    
537            * WebIDL.pm: Real support for extended attributes.
538            Support for extended attributes with arguments.
539    
540    2008-07-19  Wakaba  <wakaba@suika.fam.cx>
541    
542            * WebIDL.pm: Support for |exception| syntax.
543            (Interface->idl_text): Tentative support for inheritances.
544    
545    2008-07-19  Wakaba  <wakaba@suika.fam.cx>
546    
547            * WebIDL.pm: Hierarchical scoped name support was broken.
548            Support for raises, setraises, and getraises syntaxes.
549    
550    2008-07-18  Wakaba  <wakaba@suika.fam.cx>
551    
552            * WebIDL.pm: Support for |idl_text| attribute, version 1 (no
553            proper support for types, extended attributes, and exceptions yet).
554            WebIDL parser, version 1 (no support for exceptions yet,
555            no proper support for extended attributes yet).
556    
557    2008-07-09  Wakaba  <wakaba@suika.fam.cx>
558    
559            * WebIDL.pm (parse_char_string): Support for basic attribute syntax.
560    
561    2008-06-29  Wakaba  <wakaba@suika.fam.cx>
562    
563            * WebIDL.pm: Support for valuetype and const.
564    
565    2008-06-29  Wakaba  <wakaba@suika.fam.cx>
566            
567            * WebIDL.pm: New module.
568    
569    2008-06-15  Wakaba  <wakaba@suika.fam.cx>
570    
571            * Makefile (Entities.html): URI changed.
572    
573    2008-06-08  Wakaba  <wakaba@suika.fam.cx>
574    
575            * HTML.pm.src: Support for ruby parsing (HTML5 revision 1704).
576    
577    2008-06-01  Wakaba  <wakaba@suika.fam.cx>
578    
579            * HTML.pm.src (_get_next_token): A parse error was missing.
580    
581    2008-06-01  Wakaba  <wakaba@suika.fam.cx>
582    
583            * mklinktypelist.pl: rel=contact is no longer part of the HTML5
584            spec (commented out). (HTML5 revision 1711).
585    
586    2008-05-25  Wakaba  <wakaba@suika.fam.cx>
587    
588            * ContentType.pm: Drop support for UTF-32 (HTML5 revision 1701).
589    
590            * HTML.pm.src: UTF-16BE and UTF-16LE should be considered
591            as UTF-16 (HTML5 revision 1701).
592    
593    2008-05-25  Wakaba  <wakaba@suika.fam.cx>
594    
595            * HTML.pm.src: Support for <noframes> in <head> (HTML5 revision
596            1692).
597    
598    2008-05-25  Wakaba  <wakaba@suika.fam.cx>
599    
600            * HTML.pm.src: The secondary insertion mode used when switching
601            to foreign content is the "in body" insertion mode (HTML5 revision
602            1696).
603    
604    2008-05-25  Wakaba  <wakaba@suika.fam.cx>
605    
606            * HTML.pm.src: Don't raise parse error for <isindex/> (HTML5
607            revision 1697).
608    
609    2008-05-24  Wakaba  <wakaba@suika.fam.cx>
610    
611            * HTML.pm.src: Support for end-of-file token in foreign content
612            insertion mode (HTML5 revision 1693).  Update SVG camelCase
613            attribute list (HTML5 revision 1700).  <textarea> closes
614            </select> (HTML5 revision 1699).  More start tags close in
615            foreign content insertion mode (HTML5 revision 1698).
616    
617    2008-05-24  Wakaba  <wakaba@suika.fam.cx>
618    
619            * HTML.pm.src: ";" is not part of charset name (HTML5 revision 1665).
620    
621    2008-05-24  Wakaba  <wakaba@suika.fam.cx>
622    
623            * HTML.pm.src: More robust charset parameter detection (HTML5
624            revision 1674).
625    
626    2008-05-24  Wakaba  <wakaba@suika.fam.cx>
627    
628            * ContentType.pm: Support for image/vnd.microsoft.icon (HTML5
629            revision 1676).
630    
631    2008-05-24  Wakaba  <wakaba@suika.fam.cx>
632    
633            * HTML.pm.src: Ignore language part of public identifiers for
634            quriks mode detection (HTML5 revision 1679).
635    
636    2008-05-24  Wakaba  <wakaba@suika.fam.cx>
637    
638            * HTML.pm.src: Reduce the number of errors in truncated doctypes (HTML5
639            revision 1685).
640    
641    2008-05-24  Wakaba  <wakaba@suika.fam.cx>
642    
643            * HTML.pm.src: Support for EOF in new states for tags (HTML5
644            revision 1684).
645    
646    2008-05-24  Wakaba  <wakaba@suika.fam.cx>
647    
648            * HTML.pm.src (_reset_insertion_mode): Make <td>.innerHTML
649            work (HTML5 revision 1690).
650    
651    2008-05-24  Wakaba  <wakaba@suika.fam.cx>
652    
653            * HTML.pm.src (_tree_construction_main): Change handling of
654            end tags in head insertion modes (HTML5 revision 1686).
655            (parse_char_string): Bug fix for non-utf8 character string handlings.
656            (parse_char_stream): |ungetc| does not work well for this context.
657    
658    2008-05-18  Wakaba  <wakaba@suika.fam.cx>
659    
660            * HTML.pm.src (parse_byte_string): Redefined to invoke
661            |parse_byte_stream|.
662            (parse_byte_stream): New method.
663    
664    2008-05-18  Wakaba  <wakaba@suika.fam.cx>
665    
666            * HTML.pm.src (parse_byte_string): Fix the column number reported
667            by encoding layer error reporter.
668    
669    2008-05-17  Wakaba  <wakaba@suika.fam.cx>
670    
671            * HTML.pm.src (parse_byte_string): Use streaming decoder
672            rather than converting the whole byte string and then parsing.
673            Propagate errors in character encoding layer.
674            (get_next_token): Precise error reporting for |bare stago| error.
675    
676    2008-05-17  Wakaba  <wakaba@suika.fam.cx>
677    
678            * HTML.pm.src (parse_char_stream): New method.
679            (parse_char_string): This method is now defined as an invocation
680            of the |parse_char_stream| method.
681    
682    2008-05-17  Wakaba  <wakaba@suika.fam.cx>
683    
684            * HTML.pm.src (parse_byte_string): Report various status
685            of the sniffing as info-level errors.  Support for new
686            decoding framework in parser resestting.
687            (new): Various default error levels were not set.
688    
689    2008-05-17  Wakaba  <wakaba@suika.fam.cx>
690    
691            * HTML.pm.src (parse_byte_string): HTML5 encoding siniffing
692            algorithm, except for the actual sniffing, is implemented
693            with new framework with Message::Charset::Info.
694    
695    2008-05-16  Wakaba  <wakaba@suika.fam.cx>
696    
697            * CacheManifest.pm (_parse): Drop fragment identifiers from
698            URIs in fallback section (HTML5 revision 1596).
699    
700    2008-05-10  Wakaba  <wakaba@suika.fam.cx>
701    
702            * Makefile (Entities.html): URI has changed.
703    
704    2008-05-10  Wakaba  <wakaba@suika.fam.cx>
705    
706            * CacheManifest.pm: Don't replace U+0000 NULL (HTML5 revision
707            1553).
708    
709    2008-05-06  Wakaba  <wakaba@suika.fam.cx>
710    
711            * ContentChecker.pm: Noted that those returned in |table| are
712            no longer table elements, but table objects returned
713            by Whatpm::HTMLTable.
714    
715            * HTMLTable.pm (form_table): Return table element node
716            as |$table->{element}|.
717            (assign_header): Support for the |headers=""| attribute.
718    
719    2008-05-06  Wakaba  <wakaba@suika.fam.cx>
720    
721            * HTMLTable.pm (assign_header): New function; first version
722            with no support for headers="".
723            (form_table): Include table width and height to the returned
724            table object for covenience.  Indexing in column assignement was wrong.
725            Set whether a data cell is empty or not for convenience.
726    
727    2008-05-05  Wakaba  <wakaba@suika.fam.cx>
728    
729            * HTMLTable.pm: Robuster caption support (HTML5 revision 1393).
730    
731    2008-05-05  Wakaba  <wakaba@suika.fam.cx>
732    
733            * HTMLTable.pm: How table model errors are detected is
734            changed (HTML5 revision 1387).
735    
736    2008-05-05  Wakaba  <wakaba@suika.fam.cx>
737    
738            * HTMLTable.pm: The algorithm now moves |tfoot| elements
739            to the end of the table (HTML5 revision 1380).
740    
741    2008-05-05  Wakaba  <wakaba@suika.fam.cx>
742    
743            * HTMLTable.pm: The algorithm is now 0-based indexing, instead
744            of 1-based (HTML5 revision 1376).
745    
746    2008-05-05  Wakaba  <wakaba@suika.fam.cx>
747    
748            * ContentType.pm: "Content-Type: text/plain; charset=UTF-8"
749            and "Content-Encoding" no longer prevent sniffing (HTML5
750            revision 1288).
751    
752    2008-05-05  Wakaba  <wakaba@suika.fam.cx>
753    
754            * ContentType.pm: Skip BOMs n feed or HTML algorithm (HTML5 revision
755            1282).
756    
757    2008-05-03  Wakaba  <wakaba@suika.fam.cx>
758    
759            * ContentChecker.pm: Support for global attributes.
760            Status of XML specs are added.
761    
762    2008-05-03  Wakaba  <wakaba@suika.fam.cx>
763    
764            * ContentChecker.pm (check_element): Support for |template=""|
765            and |ref=""| attribute (referent element type checking).
766    
767    2008-04-29  Wakaba  <wakaba@suika.fam.cx>
768    
769            * CacheManifest.pm (_parse): New same origin definition (HTML5 revision
770            1500) is implemented (except for IDNA part and URI-scheme-specific
771            knowledge).  Line number counting was wrong for LF-only
772            documents.
773    
774    2008-04-13  Wakaba  <wakaba@suika.fam.cx>
775    
776            * HTML.pm.src: Raise an parse error for any disallowed
777            character (HTML5 revision 1263).
778    
779    2008-04-13  Wakaba  <wakaba@suika.fam.cx>
780    
781            * mkentitylist.pl: Support for new HTML5 entity table format (the
782            definition for |AElig;| was missing).
783    
784    2008-04-13  Wakaba  <wakaba@suika.fam.cx>
785    
786            * HTML.pm.src, mkhtmlparser.pl: Support for element/attribute
787            name/namespace fixup (HTML5 revisions 1413, 1415, 1416, and 1417).
788    
789    2008-04-13  Wakaba  <wakaba@suika.fam.cx>
790    
791            * HTML.pm.src: List of element names that close foreign content
792            insertion mode is added (HTML5 revisions 1412 and 1418).
793    
794    2008-04-13  Wakaba  <wakaba@suika.fam.cx>
795    
796            * HTML.pm.src: Support for |mglyph| and |malignmark|
797            elements (HTML5 revision 1410).
798    
799    2008-04-13  Wakaba  <wakaba@suika.fam.cx>
800    
801            * HTML.pm.src: Support for new long MathML entities (HTML5
802            revision 1406).
803    
804    2008-04-13  Wakaba  <wakaba@suika.fam.cx>
805    
806            * HTML.pm.src: CDATA section support for MathML and SVG
807            elements (HTML5 revision 1404 and 1420).
808    
809    2008-04-12  Wakaba  <wakaba@suika.fam.cx>
810    
811            * HTML.pm.src, mkhtmlparser.pl: Support for MathML and SVG
812            elements (HTML5 revision 1404).  Unused !!!macro definitions
813            are removed.
814    
815    2008-04-12  Wakaba  <wakaba@suika.fam.cx>
816    
817            * HTML.pm.src, mkhtmlparser.pl: The way permitted slash errors
818            are raised is changed (HTML5 revision 1404).
819    
820    2008-04-06  Wakaba  <wakaba@suika.fam.cx>
821    
822            * HTML.pm.src: Category sets that are no longer used are removed.
823    
824    2008-04-06  Wakaba  <wakaba@suika.fam.cx>
825    
826            * HTML.pm.src: The ->[1] property of stack entries are now
827            replaced by constants representing element category.
828    
829    2008-04-06  Wakaba  <wakaba@suika.fam.cx>
830    
831            * HTML.pm.src: Don't use local name stored in stack (i.e. ->[1])
832            for error reporting.  (This is a preparation for using constant
833            value for ->[1].)
834    
835    2008-03-22  Wakaba  <wakaba@suika.fam.cx>
836    
837            * RDFXML.pm: Typo fixed.
838    
839    2008-03-22  Wakaba  <wakaba@suika.fam.cx>
840    
841            * ContentChecker.pm: |fact_level| is now treated
842            as same as |must_level|, i.e. level = |m|.
843            (check_element): Make list of URIs in the DOM.
844    
845    2008-03-21  Wakaba  <wakaba@suika.fam.cx>
846    
847            * RDFXML.pm: Language accessor implemented.
848            Local (null-namespace) attribute support.
849    
850    2008-03-21  Wakaba  <wakaba@suika.fam.cx>
851    
852            * RDFXML.pm: Factored out ID checking code.
853    
854    2008-03-21  Wakaba  <wakaba@suika.fam.cx>
855    
856            * RDFXML.pm: TODO items noted.  Validation of ID
857            and URI attributes is implemented.  Warn if unknown
858            value is used in rdf:parseType="" attribute.
859    
860            * URIChecker.pm (check_rdf_uri_reference): New function.
861    
862    2008-03-21  Wakaba  <wakaba@suika.fam.cx>
863    
864            * RDFXML.pm: bnodeid implemented.  Relative references
865            are now resolved.
866    
867    2008-03-21  Wakaba  <wakaba@suika.fam.cx>
868    
869            * ContentChecker.pm: RDF reification implemented.
870    
871            * RDFXML.pm: undef vs false bug fixed.
872            Reification implemented.
873    
874    2008-03-21  Wakaba  <wakaba@suika.fam.cx>
875    
876            * RDFXML.pm: s/id/ID/ for attribute name.
877            The |node| arguments are added for |ontriple| calls.
878            Too many "attribute not allowed" errors were raised.
879    
880            * ContentChecker.pm: Initial experimental support for rdf:RDF
881            element.
882    
883    2008-03-21  Wakaba  <wakaba@suika.fam.cx>
884    
885            * RDFXML.pm: New module.
886    
887    2008-03-20  Wakaba  <wakaba@suika.fam.cx>
888    
889            * HTML.pm.src (set_inner_html): Line/column number
890            code was old one yet.
891    
892    2008-03-20  Wakaba  <wakaba@suika.fam.cx>
893    
894            * HTML.pm.src: Better line/column reporting for "duplicate attribute"
895            errors.  Line/column markings for DOCTYPE, comment, and
896            character tokens are reintroduced; otherwise,
897            error location for "not HTML5" error and errors
898            for implied elements are not attached.
899    
900    2008-03-20  Wakaba  <wakaba@suika.fam.cx>
901    
902            * HTML.pm.src: Set line/column numbers to attributes.
903    
904            * NanoDOM.pm (create_attribute_ns, set_attribute_node_ns): Added.
905            (value): Setter implemented.
906    
907            * mkhtmlparser.pl: Set line/column numbers to Attr nodes.
908    
909    2008-03-20  Wakaba  <wakaba@suika.fam.cx>
910    
911            * HTML.pm.src: Unused line/column markings are removed.
912    
913    2008-03-20  Wakaba  <wakaba@suika.fam.cx>
914    
915            * HTML.pm.src (_get_next_token): Remove |first_start_tag|
916            flag, which is no longer used.
917    
918    2008-03-17  Wakaba  <wakaba@suika.fam.cx>
919    
920            * HTML.pm.src: Set line/column information to element nodes.
921    
922            * mkhtmlparser.pl (!!!create-element, !!!insert-element,
923            and !!!insert-element-t): Set line/column information to
924            element nodes.
925    
926    2008-03-17  Wakaba  <wakaba@suika.fam.cx>
927    
928            * HTML.pm.src (_get_next_token): The first "<" character
929            in "<?", "<>", or "</>" should be the error point.
930    
931    2008-03-16  Wakaba  <wakaba@suika.fam.cx>
932    
933            * HTML.pm.src: Some more fixes on error position reporting.
934    
935    2008-03-16  Wakaba  <wakaba@suika.fam.cx>
936    
937            * HTML.pm.src: Token-level precious error reporting.
938    
939    2008-03-16  Wakaba  <wakaba@suika.fam.cx>
940    
941            * HTML.pm.src: Preparation for more precious error point
942            reporting.
943    
944    2008-03-11  Wakaba  <wakaba@suika.fam.cx>
945    
946            * HTML.pm.src: Error type revised.
947    
948    2008-03-11  Wakaba  <wakaba@suika.fam.cx>
949    
950            * HTML.pm.src: Similar codes are merged together, again.
951    
952    2008-03-11  Wakaba  <wakaba@suika.fam.cx>
953    
954            * HTML.pm.src: Similar codes are merged together.
955    
956    2008-03-10  Wakaba  <wakaba@suika.fam.cx>
957    
958            * mkhtmlparser.pl: Set "level" parameter to parse errors.
959    
960            * HTML.pm.src: Code refined.
961    
962    2008-03-09  Wakaba  <wakaba@suika.fam.cx>
963    
964            * HTML.pm.src: |</body>| treatement has been changed (HTML5
965            revision 1348).  Note that I really don't know this makes
966            any difference in the black-box behavior of the parser.
967    
968    2008-03-09  Wakaba  <wakaba@suika.fam.cx>
969    
970            * HTML.pm.src: New end-of-file token implementation (HTML5
971            revision 1348).
972    
973    2008-03-09  Wakaba  <wakaba@suika.fam.cx>
974    
975            * HTML.pm.src: |applet| support (HTML5 revision 1347).
976    
977    2008-03-09  Wakaba  <wakaba@suika.fam.cx>
978    
979            * HTML.pm.src: Foster parenting in AAA (HTML5 revision 1343).
980    
981    2008-03-09  Wakaba  <wakaba@suika.fam.cx>
982    
983            * HTML.pm.src: Support for |<input>| in the "in select" insertion
984            mode and support for the "in select in table" insertion mode (HTML5
985            revision 1342).
986    
987    2008-03-09  Wakaba  <wakaba@suika.fam.cx>
988    
989            * HTML.pm.src: No foster parenting for <script> and <script>
990            in non-tainted <table>s (HTML5 revision 1336).
991    
992    2008-03-09  Wakaba  <wakaba@suika.fam.cx>
993    
994            * HTML.pm.src: Ignore white space characters between <html>
995            and <head> (HTML5 revision 1332).
996    
997    2008-03-09  Wakaba  <wakaba@suika.fam.cx>
998    
999            * HTML.pm.src: Treat <input type=hidden> as if it were a
1000            white space (HTML5 revision 1331).
1001    
1002    2008-03-08  Wakaba  <wakaba@suika.fam.cx>
1003    
1004            * HTML.pm.src: Ignore U+000A at the beginning of a |listing|
1005            element (HTML5 revision 1330).
1006    
1007    2008-03-08  Wakaba  <wakaba@suika.fam.cx>
1008    
1009            * HTML.pm.src: <title> is always appended to the current
1010            element (HTML5 revision 1328).
1011    
1012    2008-03-08  Wakaba  <wakaba@suika.fam.cx>
1013    
1014            * HTML.pm.src: White space in tainted tables are moved
1015            into foster parents (HTML5 revision 1326).
1016    
1017    2008-03-08  Wakaba  <wakaba@suika.fam.cx>
1018    
1019            * HTML.pm.src: Reduce errors from foster parenting cases (HTML5
1020            revision 1321).
1021    
1022    2008-03-08  Wakaba  <wakaba@suika.fam.cx>
1023    
1024            * HTML.pm.src: |</h/n/>| case code rearranged to align with
1025            the spec (HTML5 revision 1320).  Note that we finally complete
1026            all of HTML5 revision 1320 changes.
1027    
1028    2008-03-08  Wakaba  <wakaba@suika.fam.cx>
1029    
1030            * HTML.pm.src: |</form>| now works similar to |</div>| for unclosed
1031            tags (HTML5 revision 1320).
1032    
1033    2008-03-08  Wakaba  <wakaba@suika.fam.cx>
1034    
1035            * HTML.pm.src: |</p>| case rearranged with no actual change in fact.
1036    
1037    2008-03-08  Wakaba  <wakaba@suika.fam.cx>
1038    
1039            * HTML.pm.src: A "generate implied end tags" code (t409.1)
1040            could not be reached so that it is now removed (HTML5 revision
1041            1320).
1042    
1043    2008-03-08  Wakaba  <wakaba@suika.fam.cx>
1044    
1045            * HTML.pm.src: Code for the case of |</div>| and so on are revised
1046            to align with new spec text (HTML5 revision 1320).
1047    
1048    2008-03-08  Wakaba  <wakaba@suika.fam.cx>
1049    
1050            * HTML.pm.src: Remove strange |if| condition; however, it should
1051            have had no harm in theory.
1052    
1053    2008-03-08  Wakaba  <wakaba@suika.fam.cx>
1054    
1055            * HTML.pm.src (_tree_construction_main): '</p>' in body
1056            case is split from other end tags for the preparation
1057            of implementing HTML revision 1320.
1058    
1059  2008-03-07  Wakaba  <wakaba@suika.fam.cx>  2008-03-07  Wakaba  <wakaba@suika.fam.cx>
1060    
1061          * HTML.pm.src: Simplified "generate implied end tag" (HTML5          * HTML.pm.src: Simplified "generate implied end tag" (HTML5

Legend:
Removed from v.1.163  
changed lines
  Added in v.1.336

admin@suikawiki.org
ViewVC Help
Powered by ViewVC 1.1.24