/[suikacvs]/markup/html/whatpm/Whatpm/ChangeLog
Suika

Diff of /markup/html/whatpm/Whatpm/ChangeLog

Parent Directory Parent Directory | Revision Log Revision Log | View Patch Patch

revision 1.141 by wakaba, Sat Feb 23 15:24:49 2008 UTC revision 1.329 by wakaba, Sat Sep 20 11:25:56 2008 UTC
# Line 1  Line 1 
1    2008-09-20  Wakaba  <wakaba@suika.fam.cx>
2    
3            * HTML.pm.src: Reminding places where U+000B is allowed as a space
4            character is fixed (cf. HTML5 revision 1738).
5    
6            * ContentChecker.pm, HTMLTable.pm: U+000B is no longer part of
7            space characters (HTML5 revision 1738).
8    
9    2008-09-20  Wakaba  <wakaba@suika.fam.cx>
10    
11            * HTML.pm.src: The "anything else" case for the "after after body"
12            insertion mode was not updated to swtich to the "in body"
13            insertion mode.  U+000B is no longer a space character for the
14            purpose of tree construction phase (HTML5 revision 1738).
15    
16    2008-09-20  Wakaba  <wakaba@suika.fam.cx>
17    
18            * HTML.pm.src: U+000B is no longer a space character (HTML5
19            revision 1738).
20    
21    2008-09-20  Wakaba  <wakaba@suika.fam.cx>
22    
23            * ContentType.pm: 0x0B is no longer a space character (HTML5
24            revision 1738).
25    
26            * HTML.pm.src: U+000B is no longer a space character for the
27            algorithm for extracting an encoding from a Content-Type (HTML5
28            revision 1738).
29    
30    2008-09-20  Wakaba  <wakaba@suika.fam.cx>
31    
32            * ContentChecker.pm ($IsInHTMLInteractiveContent): New.
33    
34    2008-09-18  Wakaba  <wakaba@suika.fam.cx>
35    
36            * LangTag.pm: Add checks for remaining requirements from RFC 4646.
37    
38            * mklangreg.pl: Sort 'Prefix' values by their length, to ease
39            matching.
40    
41    2008-09-18  Wakaba  <wakaba@suika.fam.cx>
42    
43            * LangTag.pm: Warn for private use language subtags.  Error level
44            typos fixed.  Support for Suppress-Script field.
45    
46            * mklangreg.pl: Support for dumping of nested structure.
47    
48    2008-09-18  Wakaba  <wakaba@suika.fam.cx>
49    
50            * LangTag.pm (check_rfc4646_langtag): Check if a tag is in the
51            recommended case as per RFC 4646.
52    
53    2008-09-18  Wakaba  <wakaba@suika.fam.cx>
54    
55            * LangTag.pm (check_rfc4646_langtag): New method.
56    
57    2008-09-18  Wakaba  <wakaba@suika.fam.cx>
58    
59            * mklangreg.pl: New script.
60    
61            * Makefile: Updated for creation of the module for language subtag
62            registry.
63            
64    2008-09-16  Wakaba  <wakaba@suika.fam.cx>
65    
66            * Makefile: WebIDL.html added.
67    
68            * WebIDL.pod: New documentation.
69    
70    2008-09-16  Wakaba  <wakaba@suika.fam.cx>
71    
72            * WebIDL.pm: Checker's error types are redefined.
73    
74    2008-09-16  Wakaba  <wakaba@suika.fam.cx>
75    
76            * WebIDL.pm: Parser's error types are redefined.  Some forward
77            compatible parsing bugs are fixed.  Some unreachable codes are
78            commented out.
79    
80    2008-09-16  Wakaba  <wakaba@suika.fam.cx>
81    
82            * WebIDL.pm: Support for the reminding extended attributes are
83            added.  It does not satisfy the definition that a forward
84            interface declaration has an extended attribute.  It seems that
85            unless explicitly allowed multiple extended attributes with the
86            same name is not allowed, though it is not explicitly mentioned in
87            the spec.
88    
89    2008-09-16  Wakaba  <wakaba@suika.fam.cx>
90    
91            * WebIDL.pm: Unescapes extended attribute names and extended
92            attribute identifiers.  Preserve whether an extended attribute has
93            an argument list of not.  Support for extended attributes:
94            Constructor, ExceptionConsts, IndexGetter, IndexSetter,
95            NameGetter, NameSetter, and Null.
96            (has_argument_list): New attribute.
97            (idl_text): Stringifies argument lists, if any, even if it is
98            empty.
99    
100    2008-09-15  Wakaba  <wakaba@suika.fam.cx>
101    
102            * HTML.pm.src: New state |PCDATA_STATE|.  Use an empty string for
103            |{s_kwd}| in DATA_STATE as default.
104    
105    2008-09-15  Wakaba  <wakaba@suika.fam.cx>
106    
107            * HTML.pm.src, mkhtmlparser.pl: Replace |{prev_char}|
108            by |{s_kwd}| in DATA_STATE.
109    
110    2008-09-15  Wakaba  <wakaba@suika.fam.cx>
111    
112            * HTML.pm.src: Shorten keys.
113    
114    2008-09-15  Wakaba  <wakaba@suika.fam.cx>
115    
116            * HTML.pm.src: Remove checking for control character, surrogate
117            pair, or noncharacter code points and non-Unicode code
118            points (they should be handled by Whatpm::Charset::UnicodeChecker).
119            (parse_char_stream): Support for the |$get_wrapper| argument and
120            character stream error handlers.
121    
122    2008-09-15  Wakaba  <wakaba@suika.fam.cx>
123    
124            * ContentChecker.pm: Don't call |loda_ns_module|
125            for null-namespace elements/attributes.
126    
127            * HTML.pm.src: Fact out $disallowed_control_chars
128            as a hash.
129    
130    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
131    
132            * HTML.pm.src: Regexp typo fixed.  |{prev_char}|
133            and |{next_char}| initializations are moved to initialization
134            method.  |{read_until}| now supports buffering.  Sync |set_inner_html|
135            with |parse_char_stream|.
136    
137    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
138    
139            * HTML.pm.src (parse_char_stream): Make |set_next_char|
140            invoke |manakai_read_until|, not only |read|, where
141            possible, to decrease the number of |read| method calls.
142    
143            * mkhtmlparser.pl: Related changes to the aforementioned
144            modification.
145    
146    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
147    
148            * HTML.pm.src: Use |read| instead of |getc|.  |set_inner_html|
149            would report character error from now.
150    
151    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
152    
153            * HTML.pm.src: White-space-leaded non-white-space character
154            tokens in "before head insertion mode" was not
155            correctly handled.
156            (set_inner_html): Reimplemented using CharString decodehandle
157            class.  Support for $get_wrapper argument.  Support
158            for |{read_until}| feature.
159    
160    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
161    
162            * HTML.pm.src: Make a "bare ero" error for unknown
163            entities point the "&" character.
164    
165    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
166    
167            * HTML.pm.src: It turns out that U+FFFD don't have to
168            be added to the list of excluded characters.
169    
170    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
171    
172            * HTML.pm.src ($char_onerror): Have character decoder's |line|
173            and |column| a higher priority than the one set by the
174            tokenizer's input handler.
175            ($self->{read_until}): Exclude U+FFFD (but this might
176            not be necessary, since now we do line/column fixup in
177            the character decode handle).
178    
179    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
180    
181            * HTML.pm.src: Use |{read_until}| where possible.
182    
183    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
184    
185            * HTML.pm.src: Change |{getc_until}| to |{read_until}|
186            and |manakai_getc_until| to |manakai_read_until| to
187            reduce the number of string copies.
188    
189    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
190    
191            * HTML.pm.src (parse_char_string): Use newly created
192            |Whatpm::Charset::DecodeHandle::CharString| instead of Perl's
193            standard feature to |open| a string as a filehandle,
194            since Perl's string filehandle seems not supporting |ungetc|
195            method correctly.
196            (parse_char_stream): Define |{getc_until}| method.
197            (DATA_STATE): Experimental support for |getc_until| feature.
198    
199    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
200    
201            * HTML.pm.src: Check points added to newly added branches.
202    
203    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
204    
205            * HTML.pm.src: Remove |{char}|, which is no longer used.
206            Remove |{entity_in_attr}| and |{last_attribute_value_state}|
207            and replaced by |{prev_state}|.
208    
209            * mkhtmlparser.pl: Remove |{char}| feature.
210            Remove |!!!back-next-input-character;| macro.
211    
212    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
213    
214            * HTML.pm.src: Finally we get rid of all the inner loops.  Remove
215            entity related tokenizer states in favor of new states
216            implementing the consume character reference algorithm.
217    
218    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
219    
220            * HTML.pm.src: "Consume a character reference" algorithm is
221            now implemented as a tokenizer's state, rather than
222            a method, with minimum changes (more changes will
223            be made, in due course).  "Bogus comment state"'s inner
224            loop gets removed.
225    
226    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
227    
228            * HTML.pm.src: Make |PUBLIC| and |SYSTEM| keyword tokenizing
229            into their own tokenizer states.
230    
231    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
232    
233            * HTML.pm.src: |CDATA_SECTION_STATE| (formally |CDATA_BLOCK_STATE|
234            is split into three states.
235    
236    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
237    
238            * HTML.pm.src: |CLOSE_TAG_OPEN_STATE| is broken into
239            itself and new |CDATA_PCDATA_CLOSE_TAG_STATE| so that
240            no longer does the tokenizer have to push back next input
241            characters in those states.
242    
243    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
244    
245            * HTML.pm.src: |MARKUP_DECLARATION_OPEN_STATE| broken
246            into four states so that no longer does the tokenizer have to push
247            back next input characters in that state.
248    
249    2008-09-11  Wakaba  <wakaba@suika.fam.cx>
250    
251            * HTML.pm.src: Methods now accept additional parameter, $get_wrapper,
252            which can be used to insert some wrapper between the character
253            stream handle and the tokenizer.  (It is currently not supported
254            for |set_inner_html| for |Element|s).
255    
256    2008-09-10  Wakaba  <wakaba@suika.fam.cx>
257    
258            * HTML.pm.src: Ignore punctuations in charset names.
259    
260    2008-09-10  Wakaba  <wakaba@suika.fam.cx>
261    
262            * ContentChecker.pm: Support for charset-layer error levels.
263    
264            * HTML.pm.src: Don't specify |text| argument for the
265            |chardecode:fallback| error, since it is not the encoding
266            being used alternatively.
267    
268    2008-09-06  Wakaba  <wakaba@suika.fam.cx>
269    
270            * HTML.pm.src: Support for |XSLT-compat| (HTML5 revision 2141).
271    
272    2008-08-31  Wakaba  <wakaba@suika.fam.cx>
273    
274            * CacheManifest.pm: Support for extensibility (HTML5 revision 2051).
275    
276    2008-08-31  Wakaba  <wakaba@suika.fam.cx>
277    
278            * HTML.pm.src: Bug fix and sync with the spec with regard
279            to after after frameset insertion mode processing (HTML5
280            revision 1909).  Note that the implementation was wrong
281            per the old spec before the r1909 changes.
282    
283    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
284    
285            * HTMLTable.pm: scope=auto algorithm fix synced with the
286            spec (HTML5 revision 2093).
287            ($process_row): Algorithm step numbers synced with the
288            spec (HTML5 revision 2092).
289    
290    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
291    
292            * HTMLTable.pm: Zs is not what we want; we want White_Space! (HTML5
293            revision 2094).
294    
295    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
296    
297            * ContentType.pm: Support for image/svg+xml (HTML5 revision 2096).
298    
299    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
300    
301            * HTML.pm.src: '"' and "'" at the end of attribute
302            name (after another attribute) now raise parse error (HTML5
303            revision 2123).  Empty unquoted attribute values are no
304            longer allowed (HTML5 revision 2122).
305    
306    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
307    
308            * mkhtmlparser.pl: Support for MathML |definitionURL| attribute (HTML5
309            revision 2130).
310    
311    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
312    
313            * ContentChecker.pm: |xml:lang| attribute value must be same
314            as |lang| attribute value for HTML elements (HTML5 revision 2062
315            and so on).
316    
317    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
318    
319            * ContentChecker.pm: Error level definition for |xml_id_error|
320            was missing.
321    
322            * URIChecker.pm: The end of the URL should be marked as the
323            error location for an empty path error.  The position
324            between the userinfo and the port components should be
325            marked as the error location for an empty host error.
326    
327    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
328    
329            * URIChecker.pm: Set parameters representing where in the
330            value the error occurs for errors.  Report unknown
331            address format error in warning level, since address
332            formats are rarely added.  Path segments starting with "/.."
333            were misinterpreted as a dot-segment.
334    
335    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
336    
337            * URIChecker.pm (check_iri_reference): Requires
338            |Message::DOM::DOMImplementation|.
339    
340    2008-08-29  Wakaba  <wakaba@suika.fam.cx>
341    
342            * IMTChecker.pm: Updated for the new error reporting architecture.
343    
344            * ContentChecker.pm: Error levels for IMTs are added.
345    
346    2008-08-17  Wakaba  <wakaba@suika.fam.cx>
347    
348            * H2H.pm (_shift_token): Support for unquoted HTML attribute
349            values.
350    
351    2008-08-16  Wakaba  <wakaba@suika.fam.cx>
352    
353            * CacheManifest.pm: Support for new style of error
354            reports.
355    
356            * HTML.pm.src: Set line=1, column=1 to the document node.
357    
358    2008-08-16  Wakaba  <wakaba@suika.fam.cx>
359    
360            * ContentChecker.pm, RDFXML.pm: Pass {level} object to language tag
361            and URL checkers.  Support for more error levels for bogus
362            langauge tag and URL "standards".
363    
364            * LangTag.pm, URIChecker.pm: Support for new style error
365            level reporting.
366    
367    2008-08-15  Wakaba  <wakaba@suika.fam.cx>
368    
369            * ContentChecker.pm: Support for RDF/XML error levels.
370    
371            * HTMLTable.pm, RDFXML.pm: Support for new style of error level
372            specifying.  Error types are revised.
373    
374    2008-08-15  Wakaba  <wakaba@suika.fam.cx>
375    
376            * ContentChecker.pm: All error reporting method calls are
377            renewed.
378    
379    2008-08-15  Wakaba  <wakaba@suika.fam.cx>
380    
381            * HTML.pm.src: All error type names and "text" parameters
382            are revised.  Use new style for "level" specification.
383    
384            * mkhtmlparser.pl: Use new style for "level" specification.
385    
386    2008-08-03  Wakaba  <wakaba@suika.fam.cx>
387    
388            * WebIDL.pm (parse_char_string): Simplified error
389            reporting process for broken ignored valuetype definition.
390            (Valuetype idl_text): Support for special "DOMString" name.
391    
392    2008-08-03  Wakaba  <wakaba@suika.fam.cx>
393    
394            * WebIDL.pm ($get_scoped_name): Append "::::" if the last
395            terminal of the ScopedName is "DOMString", such that whether
396            the last part of the scoped name is "DOMString" or "_DOMString"
397            later.  It is necessary to determine whether a |typedef|
398            definition should be ignored or not.
399            (parse_char_string): Unescape the identifier of
400            exception members.
401            ($resolve): Return undef for builtin types and sequence<T>
402            types (we might not have to do this, however...).
403            (check): Support checking for Exceptions, Valuetypes,
404            and Typedefs.
405            ($serialize_type): Support for "DOMString::::" syntax.
406            (Typedef idl_text): Output Type as "DOMString" if it
407            is really "DOMString" (i.e. its internal representation
408            is "::DOMString::").
409    
410    2008-08-03  Wakaba  <wakaba@suika.fam.cx>
411    
412            * WebIDL.pm ($resolve): New code, based on resolve code
413            for constant types in the |check| method.
414            (check): Support for checking of attributes, operations, and
415            arguments.
416            (Attribute/Operation idl_text): Exception names in getraises,
417            setraises, and raises clauses is serizlied by |$serialize_type|
418            code.
419    
420    2008-08-02  Wakaba  <wakaba@suika.fam.cx>
421    
422            * WebIDL.pm ($integer): Order of selections are changed to match
423            hexadecimal numbers (the original pattern, taken from the spec,
424            was not work for hexadecimal numbers, because the "0" prefix
425            matches to the [0-7]* part (as an empty string) and therefore
426            it does not match with remaining "x..." part of a "0x..." integer
427            literal.
428            ($get_type): It now returns a string, not an array reference,
429            for regular types and |sequence| types (i.e. it in any case
430            returns a string).
431            ($get_next_token): The second item in the array that represents
432            a integer or float token is now a Perl number value, not the
433            original string representation of the number.
434            (check): Support for const value consistency checking.
435            No extended attribute is defined for constants.
436            (Node subclasses): Use simple strings rather than array references
437            for default data type values.
438            ($serialize_type): Type values are now simple strings.
439            (value): If the new attribute value is a false value, then
440            a FALSE value is set to the attribute.
441    
442    2008-08-02  Wakaba  <wakaba@suika.fam.cx>
443    
444            * WebIDL.pm ($get_scoped_name): Now scoped names are stored
445            in its stringified format ("scoped name" as defined in the
446            spec).  Note that future version of this module should not use
447            array references for type values and the |type_text| attribute
448            should be made obsolete.
449            (parse_char_string): Unescape attribute names.
450            (check): Support for checking of whether inherited interfaces
451            are actually defined or not.  Support for checking of whether
452            interface member identifiers are duplicated or not.
453            ($serialize_type): Scoped names are returned as is.  A future
454            version of this code should escape identifiers other than "DOMString",
455            otherwise the idl_text would be non-conforming.
456    
457    2008-08-02  Wakaba  <wakaba@suika.fam.cx>
458    
459            * WebIDL.pm (parse_char_string): Set line/column numbers
460            to generated nodes.  Unescape identifiers.  Extended attributes
461            for Definition's were ignored.
462            (append_child): Set |parent_node| attribute.
463            (parent_node): New attribute.
464            (check): Support interface/exception members.  Support
465            extended attributes.  Support definition identifier uniqueness
466            constraint.
467            (qualified_name): New attribute.
468            (Interface/Exception idl_text): Extended attributes were
469            not prepended to the returned text.
470    
471    2008-08-02  Wakaba  <wakaba@suika.fam.cx>
472    
473            * WebIDL.pm (parse_char_string): Set line/column numbers
474            to interface object experimentally.  s/shift/pop/g, shift
475            would make things wrong.  Support for interface forward
476            declarations was missing.  Broken interface declarations
477            with no block were not ignored entirely.
478            (Whatpm::WebIDL::Node): New abstract class.  This class
479            makes things easier.
480            (child_nodes): New attribute.  Unlike DOM's attribute with
481            same name, this attribute returns a dead list of nodes for
482            simplicity.
483            (get_user_data, set_user_data): New methods.
484            (Module idl_text): A SPACE character should be inserted
485            before the |{| character.
486            (Interface idl_text): Support for interface forward declarations.
487            (is_forward_declaration): New attribute.
488    
489    2008-07-19  Wakaba  <wakaba@suika.fam.cx>
490    
491            * WebIDL.pm (type_text): Better serializer.
492    
493    2008-07-19  Wakaba  <wakaba@suika.fam.cx>
494    
495            * WebIDL.pm: Revise forward-compatible parsing so that
496            it now can handle broken extended attributes and as such.
497    
498    2008-07-19  Wakaba  <wakaba@suika.fam.cx>
499    
500            * WebIDL.pm: Real support for extended attributes.
501            Support for extended attributes with arguments.
502    
503    2008-07-19  Wakaba  <wakaba@suika.fam.cx>
504    
505            * WebIDL.pm: Support for |exception| syntax.
506            (Interface->idl_text): Tentative support for inheritances.
507    
508    2008-07-19  Wakaba  <wakaba@suika.fam.cx>
509    
510            * WebIDL.pm: Hierarchical scoped name support was broken.
511            Support for raises, setraises, and getraises syntaxes.
512    
513    2008-07-18  Wakaba  <wakaba@suika.fam.cx>
514    
515            * WebIDL.pm: Support for |idl_text| attribute, version 1 (no
516            proper support for types, extended attributes, and exceptions yet).
517            WebIDL parser, version 1 (no support for exceptions yet,
518            no proper support for extended attributes yet).
519    
520    2008-07-09  Wakaba  <wakaba@suika.fam.cx>
521    
522            * WebIDL.pm (parse_char_string): Support for basic attribute syntax.
523    
524    2008-06-29  Wakaba  <wakaba@suika.fam.cx>
525    
526            * WebIDL.pm: Support for valuetype and const.
527    
528    2008-06-29  Wakaba  <wakaba@suika.fam.cx>
529            
530            * WebIDL.pm: New module.
531    
532    2008-06-15  Wakaba  <wakaba@suika.fam.cx>
533    
534            * Makefile (Entities.html): URI changed.
535    
536    2008-06-08  Wakaba  <wakaba@suika.fam.cx>
537    
538            * HTML.pm.src: Support for ruby parsing (HTML5 revision 1704).
539    
540    2008-06-01  Wakaba  <wakaba@suika.fam.cx>
541    
542            * HTML.pm.src (_get_next_token): A parse error was missing.
543    
544    2008-06-01  Wakaba  <wakaba@suika.fam.cx>
545    
546            * mklinktypelist.pl: rel=contact is no longer part of the HTML5
547            spec (commented out). (HTML5 revision 1711).
548    
549    2008-05-25  Wakaba  <wakaba@suika.fam.cx>
550    
551            * ContentType.pm: Drop support for UTF-32 (HTML5 revision 1701).
552    
553            * HTML.pm.src: UTF-16BE and UTF-16LE should be considered
554            as UTF-16 (HTML5 revision 1701).
555    
556    2008-05-25  Wakaba  <wakaba@suika.fam.cx>
557    
558            * HTML.pm.src: Support for <noframes> in <head> (HTML5 revision
559            1692).
560    
561    2008-05-25  Wakaba  <wakaba@suika.fam.cx>
562    
563            * HTML.pm.src: The secondary insertion mode used when switching
564            to foreign content is the "in body" insertion mode (HTML5 revision
565            1696).
566    
567    2008-05-25  Wakaba  <wakaba@suika.fam.cx>
568    
569            * HTML.pm.src: Don't raise parse error for <isindex/> (HTML5
570            revision 1697).
571    
572    2008-05-24  Wakaba  <wakaba@suika.fam.cx>
573    
574            * HTML.pm.src: Support for end-of-file token in foreign content
575            insertion mode (HTML5 revision 1693).  Update SVG camelCase
576            attribute list (HTML5 revision 1700).  <textarea> closes
577            </select> (HTML5 revision 1699).  More start tags close in
578            foreign content insertion mode (HTML5 revision 1698).
579    
580    2008-05-24  Wakaba  <wakaba@suika.fam.cx>
581    
582            * HTML.pm.src: ";" is not part of charset name (HTML5 revision 1665).
583    
584    2008-05-24  Wakaba  <wakaba@suika.fam.cx>
585    
586            * HTML.pm.src: More robust charset parameter detection (HTML5
587            revision 1674).
588    
589    2008-05-24  Wakaba  <wakaba@suika.fam.cx>
590    
591            * ContentType.pm: Support for image/vnd.microsoft.icon (HTML5
592            revision 1676).
593    
594    2008-05-24  Wakaba  <wakaba@suika.fam.cx>
595    
596            * HTML.pm.src: Ignore language part of public identifiers for
597            quriks mode detection (HTML5 revision 1679).
598    
599    2008-05-24  Wakaba  <wakaba@suika.fam.cx>
600    
601            * HTML.pm.src: Reduce the number of errors in truncated doctypes (HTML5
602            revision 1685).
603    
604    2008-05-24  Wakaba  <wakaba@suika.fam.cx>
605    
606            * HTML.pm.src: Support for EOF in new states for tags (HTML5
607            revision 1684).
608    
609    2008-05-24  Wakaba  <wakaba@suika.fam.cx>
610    
611            * HTML.pm.src (_reset_insertion_mode): Make <td>.innerHTML
612            work (HTML5 revision 1690).
613    
614    2008-05-24  Wakaba  <wakaba@suika.fam.cx>
615    
616            * HTML.pm.src (_tree_construction_main): Change handling of
617            end tags in head insertion modes (HTML5 revision 1686).
618            (parse_char_string): Bug fix for non-utf8 character string handlings.
619            (parse_char_stream): |ungetc| does not work well for this context.
620    
621    2008-05-18  Wakaba  <wakaba@suika.fam.cx>
622    
623            * HTML.pm.src (parse_byte_string): Redefined to invoke
624            |parse_byte_stream|.
625            (parse_byte_stream): New method.
626    
627    2008-05-18  Wakaba  <wakaba@suika.fam.cx>
628    
629            * HTML.pm.src (parse_byte_string): Fix the column number reported
630            by encoding layer error reporter.
631    
632    2008-05-17  Wakaba  <wakaba@suika.fam.cx>
633    
634            * HTML.pm.src (parse_byte_string): Use streaming decoder
635            rather than converting the whole byte string and then parsing.
636            Propagate errors in character encoding layer.
637            (get_next_token): Precise error reporting for |bare stago| error.
638    
639    2008-05-17  Wakaba  <wakaba@suika.fam.cx>
640    
641            * HTML.pm.src (parse_char_stream): New method.
642            (parse_char_string): This method is now defined as an invocation
643            of the |parse_char_stream| method.
644    
645    2008-05-17  Wakaba  <wakaba@suika.fam.cx>
646    
647            * HTML.pm.src (parse_byte_string): Report various status
648            of the sniffing as info-level errors.  Support for new
649            decoding framework in parser resestting.
650            (new): Various default error levels were not set.
651    
652    2008-05-17  Wakaba  <wakaba@suika.fam.cx>
653    
654            * HTML.pm.src (parse_byte_string): HTML5 encoding siniffing
655            algorithm, except for the actual sniffing, is implemented
656            with new framework with Message::Charset::Info.
657    
658    2008-05-16  Wakaba  <wakaba@suika.fam.cx>
659    
660            * CacheManifest.pm (_parse): Drop fragment identifiers from
661            URIs in fallback section (HTML5 revision 1596).
662    
663    2008-05-10  Wakaba  <wakaba@suika.fam.cx>
664    
665            * Makefile (Entities.html): URI has changed.
666    
667    2008-05-10  Wakaba  <wakaba@suika.fam.cx>
668    
669            * CacheManifest.pm: Don't replace U+0000 NULL (HTML5 revision
670            1553).
671    
672    2008-05-06  Wakaba  <wakaba@suika.fam.cx>
673    
674            * ContentChecker.pm: Noted that those returned in |table| are
675            no longer table elements, but table objects returned
676            by Whatpm::HTMLTable.
677    
678            * HTMLTable.pm (form_table): Return table element node
679            as |$table->{element}|.
680            (assign_header): Support for the |headers=""| attribute.
681    
682    2008-05-06  Wakaba  <wakaba@suika.fam.cx>
683    
684            * HTMLTable.pm (assign_header): New function; first version
685            with no support for headers="".
686            (form_table): Include table width and height to the returned
687            table object for covenience.  Indexing in column assignement was wrong.
688            Set whether a data cell is empty or not for convenience.
689    
690    2008-05-05  Wakaba  <wakaba@suika.fam.cx>
691    
692            * HTMLTable.pm: Robuster caption support (HTML5 revision 1393).
693    
694    2008-05-05  Wakaba  <wakaba@suika.fam.cx>
695    
696            * HTMLTable.pm: How table model errors are detected is
697            changed (HTML5 revision 1387).
698    
699    2008-05-05  Wakaba  <wakaba@suika.fam.cx>
700    
701            * HTMLTable.pm: The algorithm now moves |tfoot| elements
702            to the end of the table (HTML5 revision 1380).
703    
704    2008-05-05  Wakaba  <wakaba@suika.fam.cx>
705    
706            * HTMLTable.pm: The algorithm is now 0-based indexing, instead
707            of 1-based (HTML5 revision 1376).
708    
709    2008-05-05  Wakaba  <wakaba@suika.fam.cx>
710    
711            * ContentType.pm: "Content-Type: text/plain; charset=UTF-8"
712            and "Content-Encoding" no longer prevent sniffing (HTML5
713            revision 1288).
714    
715    2008-05-05  Wakaba  <wakaba@suika.fam.cx>
716    
717            * ContentType.pm: Skip BOMs n feed or HTML algorithm (HTML5 revision
718            1282).
719    
720    2008-05-03  Wakaba  <wakaba@suika.fam.cx>
721    
722            * ContentChecker.pm: Support for global attributes.
723            Status of XML specs are added.
724    
725    2008-05-03  Wakaba  <wakaba@suika.fam.cx>
726    
727            * ContentChecker.pm (check_element): Support for |template=""|
728            and |ref=""| attribute (referent element type checking).
729    
730    2008-04-29  Wakaba  <wakaba@suika.fam.cx>
731    
732            * CacheManifest.pm (_parse): New same origin definition (HTML5 revision
733            1500) is implemented (except for IDNA part and URI-scheme-specific
734            knowledge).  Line number counting was wrong for LF-only
735            documents.
736    
737    2008-04-13  Wakaba  <wakaba@suika.fam.cx>
738    
739            * HTML.pm.src: Raise an parse error for any disallowed
740            character (HTML5 revision 1263).
741    
742    2008-04-13  Wakaba  <wakaba@suika.fam.cx>
743    
744            * mkentitylist.pl: Support for new HTML5 entity table format (the
745            definition for |AElig;| was missing).
746    
747    2008-04-13  Wakaba  <wakaba@suika.fam.cx>
748    
749            * HTML.pm.src, mkhtmlparser.pl: Support for element/attribute
750            name/namespace fixup (HTML5 revisions 1413, 1415, 1416, and 1417).
751    
752    2008-04-13  Wakaba  <wakaba@suika.fam.cx>
753    
754            * HTML.pm.src: List of element names that close foreign content
755            insertion mode is added (HTML5 revisions 1412 and 1418).
756    
757    2008-04-13  Wakaba  <wakaba@suika.fam.cx>
758    
759            * HTML.pm.src: Support for |mglyph| and |malignmark|
760            elements (HTML5 revision 1410).
761    
762    2008-04-13  Wakaba  <wakaba@suika.fam.cx>
763    
764            * HTML.pm.src: Support for new long MathML entities (HTML5
765            revision 1406).
766    
767    2008-04-13  Wakaba  <wakaba@suika.fam.cx>
768    
769            * HTML.pm.src: CDATA section support for MathML and SVG
770            elements (HTML5 revision 1404 and 1420).
771    
772    2008-04-12  Wakaba  <wakaba@suika.fam.cx>
773    
774            * HTML.pm.src, mkhtmlparser.pl: Support for MathML and SVG
775            elements (HTML5 revision 1404).  Unused !!!macro definitions
776            are removed.
777    
778    2008-04-12  Wakaba  <wakaba@suika.fam.cx>
779    
780            * HTML.pm.src, mkhtmlparser.pl: The way permitted slash errors
781            are raised is changed (HTML5 revision 1404).
782    
783    2008-04-06  Wakaba  <wakaba@suika.fam.cx>
784    
785            * HTML.pm.src: Category sets that are no longer used are removed.
786    
787    2008-04-06  Wakaba  <wakaba@suika.fam.cx>
788    
789            * HTML.pm.src: The ->[1] property of stack entries are now
790            replaced by constants representing element category.
791    
792    2008-04-06  Wakaba  <wakaba@suika.fam.cx>
793    
794            * HTML.pm.src: Don't use local name stored in stack (i.e. ->[1])
795            for error reporting.  (This is a preparation for using constant
796            value for ->[1].)
797    
798    2008-03-22  Wakaba  <wakaba@suika.fam.cx>
799    
800            * RDFXML.pm: Typo fixed.
801    
802    2008-03-22  Wakaba  <wakaba@suika.fam.cx>
803    
804            * ContentChecker.pm: |fact_level| is now treated
805            as same as |must_level|, i.e. level = |m|.
806            (check_element): Make list of URIs in the DOM.
807    
808    2008-03-21  Wakaba  <wakaba@suika.fam.cx>
809    
810            * RDFXML.pm: Language accessor implemented.
811            Local (null-namespace) attribute support.
812    
813    2008-03-21  Wakaba  <wakaba@suika.fam.cx>
814    
815            * RDFXML.pm: Factored out ID checking code.
816    
817    2008-03-21  Wakaba  <wakaba@suika.fam.cx>
818    
819            * RDFXML.pm: TODO items noted.  Validation of ID
820            and URI attributes is implemented.  Warn if unknown
821            value is used in rdf:parseType="" attribute.
822    
823            * URIChecker.pm (check_rdf_uri_reference): New function.
824    
825    2008-03-21  Wakaba  <wakaba@suika.fam.cx>
826    
827            * RDFXML.pm: bnodeid implemented.  Relative references
828            are now resolved.
829    
830    2008-03-21  Wakaba  <wakaba@suika.fam.cx>
831    
832            * ContentChecker.pm: RDF reification implemented.
833    
834            * RDFXML.pm: undef vs false bug fixed.
835            Reification implemented.
836    
837    2008-03-21  Wakaba  <wakaba@suika.fam.cx>
838    
839            * RDFXML.pm: s/id/ID/ for attribute name.
840            The |node| arguments are added for |ontriple| calls.
841            Too many "attribute not allowed" errors were raised.
842    
843            * ContentChecker.pm: Initial experimental support for rdf:RDF
844            element.
845    
846    2008-03-21  Wakaba  <wakaba@suika.fam.cx>
847    
848            * RDFXML.pm: New module.
849    
850    2008-03-20  Wakaba  <wakaba@suika.fam.cx>
851    
852            * HTML.pm.src (set_inner_html): Line/column number
853            code was old one yet.
854    
855    2008-03-20  Wakaba  <wakaba@suika.fam.cx>
856    
857            * HTML.pm.src: Better line/column reporting for "duplicate attribute"
858            errors.  Line/column markings for DOCTYPE, comment, and
859            character tokens are reintroduced; otherwise,
860            error location for "not HTML5" error and errors
861            for implied elements are not attached.
862    
863    2008-03-20  Wakaba  <wakaba@suika.fam.cx>
864    
865            * HTML.pm.src: Set line/column numbers to attributes.
866    
867            * NanoDOM.pm (create_attribute_ns, set_attribute_node_ns): Added.
868            (value): Setter implemented.
869    
870            * mkhtmlparser.pl: Set line/column numbers to Attr nodes.
871    
872    2008-03-20  Wakaba  <wakaba@suika.fam.cx>
873    
874            * HTML.pm.src: Unused line/column markings are removed.
875    
876    2008-03-20  Wakaba  <wakaba@suika.fam.cx>
877    
878            * HTML.pm.src (_get_next_token): Remove |first_start_tag|
879            flag, which is no longer used.
880    
881    2008-03-17  Wakaba  <wakaba@suika.fam.cx>
882    
883            * HTML.pm.src: Set line/column information to element nodes.
884    
885            * mkhtmlparser.pl (!!!create-element, !!!insert-element,
886            and !!!insert-element-t): Set line/column information to
887            element nodes.
888    
889    2008-03-17  Wakaba  <wakaba@suika.fam.cx>
890    
891            * HTML.pm.src (_get_next_token): The first "<" character
892            in "<?", "<>", or "</>" should be the error point.
893    
894    2008-03-16  Wakaba  <wakaba@suika.fam.cx>
895    
896            * HTML.pm.src: Some more fixes on error position reporting.
897    
898    2008-03-16  Wakaba  <wakaba@suika.fam.cx>
899    
900            * HTML.pm.src: Token-level precious error reporting.
901    
902    2008-03-16  Wakaba  <wakaba@suika.fam.cx>
903    
904            * HTML.pm.src: Preparation for more precious error point
905            reporting.
906    
907    2008-03-11  Wakaba  <wakaba@suika.fam.cx>
908    
909            * HTML.pm.src: Error type revised.
910    
911    2008-03-11  Wakaba  <wakaba@suika.fam.cx>
912    
913            * HTML.pm.src: Similar codes are merged together, again.
914    
915    2008-03-11  Wakaba  <wakaba@suika.fam.cx>
916    
917            * HTML.pm.src: Similar codes are merged together.
918    
919    2008-03-10  Wakaba  <wakaba@suika.fam.cx>
920    
921            * mkhtmlparser.pl: Set "level" parameter to parse errors.
922    
923            * HTML.pm.src: Code refined.
924    
925    2008-03-09  Wakaba  <wakaba@suika.fam.cx>
926    
927            * HTML.pm.src: |</body>| treatement has been changed (HTML5
928            revision 1348).  Note that I really don't know this makes
929            any difference in the black-box behavior of the parser.
930    
931    2008-03-09  Wakaba  <wakaba@suika.fam.cx>
932    
933            * HTML.pm.src: New end-of-file token implementation (HTML5
934            revision 1348).
935    
936    2008-03-09  Wakaba  <wakaba@suika.fam.cx>
937    
938            * HTML.pm.src: |applet| support (HTML5 revision 1347).
939    
940    2008-03-09  Wakaba  <wakaba@suika.fam.cx>
941    
942            * HTML.pm.src: Foster parenting in AAA (HTML5 revision 1343).
943    
944    2008-03-09  Wakaba  <wakaba@suika.fam.cx>
945    
946            * HTML.pm.src: Support for |<input>| in the "in select" insertion
947            mode and support for the "in select in table" insertion mode (HTML5
948            revision 1342).
949    
950    2008-03-09  Wakaba  <wakaba@suika.fam.cx>
951    
952            * HTML.pm.src: No foster parenting for <script> and <script>
953            in non-tainted <table>s (HTML5 revision 1336).
954    
955    2008-03-09  Wakaba  <wakaba@suika.fam.cx>
956    
957            * HTML.pm.src: Ignore white space characters between <html>
958            and <head> (HTML5 revision 1332).
959    
960    2008-03-09  Wakaba  <wakaba@suika.fam.cx>
961    
962            * HTML.pm.src: Treat <input type=hidden> as if it were a
963            white space (HTML5 revision 1331).
964    
965    2008-03-08  Wakaba  <wakaba@suika.fam.cx>
966    
967            * HTML.pm.src: Ignore U+000A at the beginning of a |listing|
968            element (HTML5 revision 1330).
969    
970    2008-03-08  Wakaba  <wakaba@suika.fam.cx>
971    
972            * HTML.pm.src: <title> is always appended to the current
973            element (HTML5 revision 1328).
974    
975    2008-03-08  Wakaba  <wakaba@suika.fam.cx>
976    
977            * HTML.pm.src: White space in tainted tables are moved
978            into foster parents (HTML5 revision 1326).
979    
980    2008-03-08  Wakaba  <wakaba@suika.fam.cx>
981    
982            * HTML.pm.src: Reduce errors from foster parenting cases (HTML5
983            revision 1321).
984    
985    2008-03-08  Wakaba  <wakaba@suika.fam.cx>
986    
987            * HTML.pm.src: |</h/n/>| case code rearranged to align with
988            the spec (HTML5 revision 1320).  Note that we finally complete
989            all of HTML5 revision 1320 changes.
990    
991    2008-03-08  Wakaba  <wakaba@suika.fam.cx>
992    
993            * HTML.pm.src: |</form>| now works similar to |</div>| for unclosed
994            tags (HTML5 revision 1320).
995    
996    2008-03-08  Wakaba  <wakaba@suika.fam.cx>
997    
998            * HTML.pm.src: |</p>| case rearranged with no actual change in fact.
999    
1000    2008-03-08  Wakaba  <wakaba@suika.fam.cx>
1001    
1002            * HTML.pm.src: A "generate implied end tags" code (t409.1)
1003            could not be reached so that it is now removed (HTML5 revision
1004            1320).
1005    
1006    2008-03-08  Wakaba  <wakaba@suika.fam.cx>
1007    
1008            * HTML.pm.src: Code for the case of |</div>| and so on are revised
1009            to align with new spec text (HTML5 revision 1320).
1010    
1011    2008-03-08  Wakaba  <wakaba@suika.fam.cx>
1012    
1013            * HTML.pm.src: Remove strange |if| condition; however, it should
1014            have had no harm in theory.
1015    
1016    2008-03-08  Wakaba  <wakaba@suika.fam.cx>
1017    
1018            * HTML.pm.src (_tree_construction_main): '</p>' in body
1019            case is split from other end tags for the preparation
1020            of implementing HTML revision 1320.
1021    
1022    2008-03-07  Wakaba  <wakaba@suika.fam.cx>
1023    
1024            * HTML.pm.src: Simplified "generate implied end tag" (HTML5
1025            revision 1320).
1026    
1027    2008-03-07  Wakaba  <wakaba@suika.fam.cx>
1028    
1029            * HTML.pm.src (_tree_construction_main): Merge rules for "h1"
1030            and "div" (HTML5 revision 1318).  Add comments to where
1031            |form| pointer association codes should be inserted (HTML5 revision
1032            1319).
1033    
1034    2008-03-06  Wakaba  <wakaba@suika.fam.cx>
1035    
1036            * HTML.pm.src: <html> treatement refined (HTML5 revision 1314).
1037    
1038    2008-03-05  Wakaba  <wakaba@suika.fam.cx>
1039    
1040            * HTML.pm.src: Since the case t268 should never be reached (no
1041            other token type, there are), it is replaced by a |die| statement.
1042    
1043    2008-03-05  Wakaba  <wakaba@suika.fam.cx>
1044    
1045            * HTML.pm.src: Typo fixed.
1046    
1047    2008-03-04  Wakaba  <wakaba@suika.fam.cx>
1048    
1049            * HTML.pm.src (_tree_construction_initial): Some limited quirks
1050            doctypes were not uppercased for comparison.
1051    
1052    2008-03-03  Wakaba  <wakaba@suika.fam.cx>
1053    
1054            * HTML.pm.src (tree construction and set_inner_html): Checkpoints
1055            are added.
1056    
1057    2008-03-03  Wakaba  <wakaba@suika.fam.cx>
1058    
1059            * HTML.pm.src (_tokenize_attempt_to_consume_an_entity): Checkpoints
1060            are set.  Cases that are unlikely reached are noted as so.
1061    
1062    2008-03-03  Wakaba  <wakaba@suika.fam.cx>
1063    
1064            * HTML.pm.src: Checkpoints for debugging are added.
1065    
1066            * mkhtmlparser.pl: Support for |!!!cp| syntax.
1067    
1068    2008-03-03  Wakaba  <wakaba@suika.fam.cx>
1069    
1070            * HTML.pm.src, mkhtmlparser.pl: s/_input_character/_char/g
1071            for simplicity.
1072    
1073    2008-03-03  Wakaba  <wakaba@suika.fam.cx>
1074    
1075            * HTML.pm.src: Flag name changed: s/correct/force-quirks/g (HTML5
1076            revision 1307).
1077    
1078    2008-03-03  Wakaba  <wakaba@suika.fam.cx>
1079    
1080            * HTML.pm.src (_get_next_token): Where the /incorrect/ flag
1081            is set are changed (HTML5 revision 1305).
1082    
1083    2008-03-02  Wakaba  <wakaba@suika.fam.cx>
1084    
1085            * HTML.pm.src: Raise a parse error for |<span ===>| (HTML5 revision
1086            1292).  Entities are not parsed in comment-like part in RCDATA
1087            elements (HTML5 revision 1294).  Allow bare & at the end
1088            of attribute value literals (HTML5 revision 1296).  More
1089            quirks mode doctypes (HTML5 revision 1302).  Requires spaces
1090            between attributes and ban attribute names or unquoted
1091            attribute values containing single or double quotes (HTML5
1092            revision 1303).
1093    
1094    2008-03-02  Wakaba  <wakaba@suika.fam.cx>
1095    
1096            * ContentChecker.pm: Typo fixed.  Don't raise "character encoding"
1097            and related errors unless it is an HTML document (though the spec
1098            is unclear on whether it is applied to XHTML document).
1099    
1100            * HTML.pm (%HTMLAttrStatus): WF2 repetition model attributes
1101            are added.
1102    
1103    2008-03-02  Wakaba  <wakaba@suika.fam.cx>
1104    
1105            * HTML.pm.src: s/local_name/manakai_local_name/g.
1106    
1107    2008-03-01  Wakaba  <wakaba@suika.fam.cx>
1108    
1109            * _NamedEntityList.pm: Updated (HTML5 revision 1286).
1110    
1111            * HTML.pm.src: |charset| in |content| attribute is
1112            case-insensitive (HTML5 revision 1270).
1113    
1114    2008-02-26  Wakaba  <wakaba@suika.fam.cx>
1115    
1116            * ContentChecker.pm: New status constants are added.
1117            ($ElementDefault): |status| added.
1118            (check_element): Err for non-standard or deprecated elements.
1119            (_attr_status_info): For non-standard or deprecated attributes.
1120    
1121    2008-02-24  Wakaba  <wakaba@suika.fam.cx>
1122    
1123            * ContentChecker.pm (_attr_status_info): New internal method.
1124    
1125    2008-02-24  Wakaba  <wakaba@suika.fam.cx>
1126    
1127            * ContentChecker.pm (check_element): Element standardized
1128            status information is now dispatched.
1129    
1130    2008-02-24  Wakaba  <wakaba@suika.fam.cx>
1131    
1132            * ContentChecker.pm (check_element): Fix |del|-and-significant
1133            problem by adding some more arguments.
1134    
1135    2008-02-24  Wakaba  <wakaba@suika.fam.cx>
1136    
1137            * ContentChecker.pm (check_element): Use context of
1138            container-for-the-purpose-of-content-model element (not transparent
1139            element) for |check_child_element| calling and significant
1140            text flag marking.  This reintroduces |<del>|-and-significant
1141            problem again.
1142    
1143    2008-02-24  Wakaba  <wakaba@suika.fam.cx>
1144    
1145            * ContentChecker.pm (check_element): Make semi-transparent
1146            elements ignored for the purpose of phase changes in
1147            content model checking.
1148    
1149  2008-02-23  Wakaba  <wakaba@suika.fam.cx>  2008-02-23  Wakaba  <wakaba@suika.fam.cx>
1150    
1151          * ContentChecker.pm (check_element): In-element state          * ContentChecker.pm (check_element): In-element state

Legend:
Removed from v.1.141  
changed lines
  Added in v.1.329

admin@suikawiki.org
ViewVC Help
Powered by ViewVC 1.1.24