/[suikacvs]/markup/html/whatpm/Whatpm/ChangeLog
Suika

Diff of /markup/html/whatpm/Whatpm/ChangeLog

Parent Directory Parent Directory | Revision Log Revision Log | View Patch Patch

revision 1.162 by wakaba, Thu Mar 6 15:29:39 2008 UTC revision 1.332 by wakaba, Sun Sep 21 11:55:49 2008 UTC
# Line 1  Line 1 
1    2008-09-21  Wakaba  <wakaba@suika.fam.cx>
2    
3            * ContentChecker.pm: |contextmenu| validness is now checked using
4            |id| and |id_type| properties, and |menu| property is removed.
5    
6    2008-09-21  Wakaba  <wakaba@suika.fam.cx>
7    
8            * ContentChecker.pm: Prepare for |form| |name| attribute's
9            duplication checking.
10    
11    2008-09-21  Wakaba  <wakaba@suika.fam.cx>
12    
13            * HTML.pm.src (parse_byte_stream): Support (or non-support) for
14            unsupported charset="" parameter value (HTML5 revision 2131).
15    
16    2008-09-20  Wakaba  <wakaba@suika.fam.cx>
17    
18            * HTML.pm.src: Reminding places where U+000B is allowed as a space
19            character is fixed (cf. HTML5 revision 1738).
20    
21            * ContentChecker.pm, HTMLTable.pm: U+000B is no longer part of
22            space characters (HTML5 revision 1738).
23    
24    2008-09-20  Wakaba  <wakaba@suika.fam.cx>
25    
26            * HTML.pm.src: The "anything else" case for the "after after body"
27            insertion mode was not updated to swtich to the "in body"
28            insertion mode.  U+000B is no longer a space character for the
29            purpose of tree construction phase (HTML5 revision 1738).
30    
31    2008-09-20  Wakaba  <wakaba@suika.fam.cx>
32    
33            * HTML.pm.src: U+000B is no longer a space character (HTML5
34            revision 1738).
35    
36    2008-09-20  Wakaba  <wakaba@suika.fam.cx>
37    
38            * ContentType.pm: 0x0B is no longer a space character (HTML5
39            revision 1738).
40    
41            * HTML.pm.src: U+000B is no longer a space character for the
42            algorithm for extracting an encoding from a Content-Type (HTML5
43            revision 1738).
44    
45    2008-09-20  Wakaba  <wakaba@suika.fam.cx>
46    
47            * ContentChecker.pm ($IsInHTMLInteractiveContent): New.
48    
49    2008-09-18  Wakaba  <wakaba@suika.fam.cx>
50    
51            * LangTag.pm: Add checks for remaining requirements from RFC 4646.
52    
53            * mklangreg.pl: Sort 'Prefix' values by their length, to ease
54            matching.
55    
56    2008-09-18  Wakaba  <wakaba@suika.fam.cx>
57    
58            * LangTag.pm: Warn for private use language subtags.  Error level
59            typos fixed.  Support for Suppress-Script field.
60    
61            * mklangreg.pl: Support for dumping of nested structure.
62    
63    2008-09-18  Wakaba  <wakaba@suika.fam.cx>
64    
65            * LangTag.pm (check_rfc4646_langtag): Check if a tag is in the
66            recommended case as per RFC 4646.
67    
68    2008-09-18  Wakaba  <wakaba@suika.fam.cx>
69    
70            * LangTag.pm (check_rfc4646_langtag): New method.
71    
72    2008-09-18  Wakaba  <wakaba@suika.fam.cx>
73    
74            * mklangreg.pl: New script.
75    
76            * Makefile: Updated for creation of the module for language subtag
77            registry.
78            
79    2008-09-16  Wakaba  <wakaba@suika.fam.cx>
80    
81            * Makefile: WebIDL.html added.
82    
83            * WebIDL.pod: New documentation.
84    
85    2008-09-16  Wakaba  <wakaba@suika.fam.cx>
86    
87            * WebIDL.pm: Checker's error types are redefined.
88    
89    2008-09-16  Wakaba  <wakaba@suika.fam.cx>
90    
91            * WebIDL.pm: Parser's error types are redefined.  Some forward
92            compatible parsing bugs are fixed.  Some unreachable codes are
93            commented out.
94    
95    2008-09-16  Wakaba  <wakaba@suika.fam.cx>
96    
97            * WebIDL.pm: Support for the reminding extended attributes are
98            added.  It does not satisfy the definition that a forward
99            interface declaration has an extended attribute.  It seems that
100            unless explicitly allowed multiple extended attributes with the
101            same name is not allowed, though it is not explicitly mentioned in
102            the spec.
103    
104    2008-09-16  Wakaba  <wakaba@suika.fam.cx>
105    
106            * WebIDL.pm: Unescapes extended attribute names and extended
107            attribute identifiers.  Preserve whether an extended attribute has
108            an argument list of not.  Support for extended attributes:
109            Constructor, ExceptionConsts, IndexGetter, IndexSetter,
110            NameGetter, NameSetter, and Null.
111            (has_argument_list): New attribute.
112            (idl_text): Stringifies argument lists, if any, even if it is
113            empty.
114    
115    2008-09-15  Wakaba  <wakaba@suika.fam.cx>
116    
117            * HTML.pm.src: New state |PCDATA_STATE|.  Use an empty string for
118            |{s_kwd}| in DATA_STATE as default.
119    
120    2008-09-15  Wakaba  <wakaba@suika.fam.cx>
121    
122            * HTML.pm.src, mkhtmlparser.pl: Replace |{prev_char}|
123            by |{s_kwd}| in DATA_STATE.
124    
125    2008-09-15  Wakaba  <wakaba@suika.fam.cx>
126    
127            * HTML.pm.src: Shorten keys.
128    
129    2008-09-15  Wakaba  <wakaba@suika.fam.cx>
130    
131            * HTML.pm.src: Remove checking for control character, surrogate
132            pair, or noncharacter code points and non-Unicode code
133            points (they should be handled by Whatpm::Charset::UnicodeChecker).
134            (parse_char_stream): Support for the |$get_wrapper| argument and
135            character stream error handlers.
136    
137    2008-09-15  Wakaba  <wakaba@suika.fam.cx>
138    
139            * ContentChecker.pm: Don't call |loda_ns_module|
140            for null-namespace elements/attributes.
141    
142            * HTML.pm.src: Fact out $disallowed_control_chars
143            as a hash.
144    
145    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
146    
147            * HTML.pm.src: Regexp typo fixed.  |{prev_char}|
148            and |{next_char}| initializations are moved to initialization
149            method.  |{read_until}| now supports buffering.  Sync |set_inner_html|
150            with |parse_char_stream|.
151    
152    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
153    
154            * HTML.pm.src (parse_char_stream): Make |set_next_char|
155            invoke |manakai_read_until|, not only |read|, where
156            possible, to decrease the number of |read| method calls.
157    
158            * mkhtmlparser.pl: Related changes to the aforementioned
159            modification.
160    
161    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
162    
163            * HTML.pm.src: Use |read| instead of |getc|.  |set_inner_html|
164            would report character error from now.
165    
166    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
167    
168            * HTML.pm.src: White-space-leaded non-white-space character
169            tokens in "before head insertion mode" was not
170            correctly handled.
171            (set_inner_html): Reimplemented using CharString decodehandle
172            class.  Support for $get_wrapper argument.  Support
173            for |{read_until}| feature.
174    
175    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
176    
177            * HTML.pm.src: Make a "bare ero" error for unknown
178            entities point the "&" character.
179    
180    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
181    
182            * HTML.pm.src: It turns out that U+FFFD don't have to
183            be added to the list of excluded characters.
184    
185    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
186    
187            * HTML.pm.src ($char_onerror): Have character decoder's |line|
188            and |column| a higher priority than the one set by the
189            tokenizer's input handler.
190            ($self->{read_until}): Exclude U+FFFD (but this might
191            not be necessary, since now we do line/column fixup in
192            the character decode handle).
193    
194    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
195    
196            * HTML.pm.src: Use |{read_until}| where possible.
197    
198    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
199    
200            * HTML.pm.src: Change |{getc_until}| to |{read_until}|
201            and |manakai_getc_until| to |manakai_read_until| to
202            reduce the number of string copies.
203    
204    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
205    
206            * HTML.pm.src (parse_char_string): Use newly created
207            |Whatpm::Charset::DecodeHandle::CharString| instead of Perl's
208            standard feature to |open| a string as a filehandle,
209            since Perl's string filehandle seems not supporting |ungetc|
210            method correctly.
211            (parse_char_stream): Define |{getc_until}| method.
212            (DATA_STATE): Experimental support for |getc_until| feature.
213    
214    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
215    
216            * HTML.pm.src: Check points added to newly added branches.
217    
218    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
219    
220            * HTML.pm.src: Remove |{char}|, which is no longer used.
221            Remove |{entity_in_attr}| and |{last_attribute_value_state}|
222            and replaced by |{prev_state}|.
223    
224            * mkhtmlparser.pl: Remove |{char}| feature.
225            Remove |!!!back-next-input-character;| macro.
226    
227    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
228    
229            * HTML.pm.src: Finally we get rid of all the inner loops.  Remove
230            entity related tokenizer states in favor of new states
231            implementing the consume character reference algorithm.
232    
233    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
234    
235            * HTML.pm.src: "Consume a character reference" algorithm is
236            now implemented as a tokenizer's state, rather than
237            a method, with minimum changes (more changes will
238            be made, in due course).  "Bogus comment state"'s inner
239            loop gets removed.
240    
241    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
242    
243            * HTML.pm.src: Make |PUBLIC| and |SYSTEM| keyword tokenizing
244            into their own tokenizer states.
245    
246    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
247    
248            * HTML.pm.src: |CDATA_SECTION_STATE| (formally |CDATA_BLOCK_STATE|
249            is split into three states.
250    
251    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
252    
253            * HTML.pm.src: |CLOSE_TAG_OPEN_STATE| is broken into
254            itself and new |CDATA_PCDATA_CLOSE_TAG_STATE| so that
255            no longer does the tokenizer have to push back next input
256            characters in those states.
257    
258    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
259    
260            * HTML.pm.src: |MARKUP_DECLARATION_OPEN_STATE| broken
261            into four states so that no longer does the tokenizer have to push
262            back next input characters in that state.
263    
264    2008-09-11  Wakaba  <wakaba@suika.fam.cx>
265    
266            * HTML.pm.src: Methods now accept additional parameter, $get_wrapper,
267            which can be used to insert some wrapper between the character
268            stream handle and the tokenizer.  (It is currently not supported
269            for |set_inner_html| for |Element|s).
270    
271    2008-09-10  Wakaba  <wakaba@suika.fam.cx>
272    
273            * HTML.pm.src: Ignore punctuations in charset names.
274    
275    2008-09-10  Wakaba  <wakaba@suika.fam.cx>
276    
277            * ContentChecker.pm: Support for charset-layer error levels.
278    
279            * HTML.pm.src: Don't specify |text| argument for the
280            |chardecode:fallback| error, since it is not the encoding
281            being used alternatively.
282    
283    2008-09-06  Wakaba  <wakaba@suika.fam.cx>
284    
285            * HTML.pm.src: Support for |XSLT-compat| (HTML5 revision 2141).
286    
287    2008-08-31  Wakaba  <wakaba@suika.fam.cx>
288    
289            * CacheManifest.pm: Support for extensibility (HTML5 revision 2051).
290    
291    2008-08-31  Wakaba  <wakaba@suika.fam.cx>
292    
293            * HTML.pm.src: Bug fix and sync with the spec with regard
294            to after after frameset insertion mode processing (HTML5
295            revision 1909).  Note that the implementation was wrong
296            per the old spec before the r1909 changes.
297    
298    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
299    
300            * HTMLTable.pm: scope=auto algorithm fix synced with the
301            spec (HTML5 revision 2093).
302            ($process_row): Algorithm step numbers synced with the
303            spec (HTML5 revision 2092).
304    
305    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
306    
307            * HTMLTable.pm: Zs is not what we want; we want White_Space! (HTML5
308            revision 2094).
309    
310    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
311    
312            * ContentType.pm: Support for image/svg+xml (HTML5 revision 2096).
313    
314    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
315    
316            * HTML.pm.src: '"' and "'" at the end of attribute
317            name (after another attribute) now raise parse error (HTML5
318            revision 2123).  Empty unquoted attribute values are no
319            longer allowed (HTML5 revision 2122).
320    
321    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
322    
323            * mkhtmlparser.pl: Support for MathML |definitionURL| attribute (HTML5
324            revision 2130).
325    
326    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
327    
328            * ContentChecker.pm: |xml:lang| attribute value must be same
329            as |lang| attribute value for HTML elements (HTML5 revision 2062
330            and so on).
331    
332    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
333    
334            * ContentChecker.pm: Error level definition for |xml_id_error|
335            was missing.
336    
337            * URIChecker.pm: The end of the URL should be marked as the
338            error location for an empty path error.  The position
339            between the userinfo and the port components should be
340            marked as the error location for an empty host error.
341    
342    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
343    
344            * URIChecker.pm: Set parameters representing where in the
345            value the error occurs for errors.  Report unknown
346            address format error in warning level, since address
347            formats are rarely added.  Path segments starting with "/.."
348            were misinterpreted as a dot-segment.
349    
350    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
351    
352            * URIChecker.pm (check_iri_reference): Requires
353            |Message::DOM::DOMImplementation|.
354    
355    2008-08-29  Wakaba  <wakaba@suika.fam.cx>
356    
357            * IMTChecker.pm: Updated for the new error reporting architecture.
358    
359            * ContentChecker.pm: Error levels for IMTs are added.
360    
361    2008-08-17  Wakaba  <wakaba@suika.fam.cx>
362    
363            * H2H.pm (_shift_token): Support for unquoted HTML attribute
364            values.
365    
366    2008-08-16  Wakaba  <wakaba@suika.fam.cx>
367    
368            * CacheManifest.pm: Support for new style of error
369            reports.
370    
371            * HTML.pm.src: Set line=1, column=1 to the document node.
372    
373    2008-08-16  Wakaba  <wakaba@suika.fam.cx>
374    
375            * ContentChecker.pm, RDFXML.pm: Pass {level} object to language tag
376            and URL checkers.  Support for more error levels for bogus
377            langauge tag and URL "standards".
378    
379            * LangTag.pm, URIChecker.pm: Support for new style error
380            level reporting.
381    
382    2008-08-15  Wakaba  <wakaba@suika.fam.cx>
383    
384            * ContentChecker.pm: Support for RDF/XML error levels.
385    
386            * HTMLTable.pm, RDFXML.pm: Support for new style of error level
387            specifying.  Error types are revised.
388    
389    2008-08-15  Wakaba  <wakaba@suika.fam.cx>
390    
391            * ContentChecker.pm: All error reporting method calls are
392            renewed.
393    
394    2008-08-15  Wakaba  <wakaba@suika.fam.cx>
395    
396            * HTML.pm.src: All error type names and "text" parameters
397            are revised.  Use new style for "level" specification.
398    
399            * mkhtmlparser.pl: Use new style for "level" specification.
400    
401    2008-08-03  Wakaba  <wakaba@suika.fam.cx>
402    
403            * WebIDL.pm (parse_char_string): Simplified error
404            reporting process for broken ignored valuetype definition.
405            (Valuetype idl_text): Support for special "DOMString" name.
406    
407    2008-08-03  Wakaba  <wakaba@suika.fam.cx>
408    
409            * WebIDL.pm ($get_scoped_name): Append "::::" if the last
410            terminal of the ScopedName is "DOMString", such that whether
411            the last part of the scoped name is "DOMString" or "_DOMString"
412            later.  It is necessary to determine whether a |typedef|
413            definition should be ignored or not.
414            (parse_char_string): Unescape the identifier of
415            exception members.
416            ($resolve): Return undef for builtin types and sequence<T>
417            types (we might not have to do this, however...).
418            (check): Support checking for Exceptions, Valuetypes,
419            and Typedefs.
420            ($serialize_type): Support for "DOMString::::" syntax.
421            (Typedef idl_text): Output Type as "DOMString" if it
422            is really "DOMString" (i.e. its internal representation
423            is "::DOMString::").
424    
425    2008-08-03  Wakaba  <wakaba@suika.fam.cx>
426    
427            * WebIDL.pm ($resolve): New code, based on resolve code
428            for constant types in the |check| method.
429            (check): Support for checking of attributes, operations, and
430            arguments.
431            (Attribute/Operation idl_text): Exception names in getraises,
432            setraises, and raises clauses is serizlied by |$serialize_type|
433            code.
434    
435    2008-08-02  Wakaba  <wakaba@suika.fam.cx>
436    
437            * WebIDL.pm ($integer): Order of selections are changed to match
438            hexadecimal numbers (the original pattern, taken from the spec,
439            was not work for hexadecimal numbers, because the "0" prefix
440            matches to the [0-7]* part (as an empty string) and therefore
441            it does not match with remaining "x..." part of a "0x..." integer
442            literal.
443            ($get_type): It now returns a string, not an array reference,
444            for regular types and |sequence| types (i.e. it in any case
445            returns a string).
446            ($get_next_token): The second item in the array that represents
447            a integer or float token is now a Perl number value, not the
448            original string representation of the number.
449            (check): Support for const value consistency checking.
450            No extended attribute is defined for constants.
451            (Node subclasses): Use simple strings rather than array references
452            for default data type values.
453            ($serialize_type): Type values are now simple strings.
454            (value): If the new attribute value is a false value, then
455            a FALSE value is set to the attribute.
456    
457    2008-08-02  Wakaba  <wakaba@suika.fam.cx>
458    
459            * WebIDL.pm ($get_scoped_name): Now scoped names are stored
460            in its stringified format ("scoped name" as defined in the
461            spec).  Note that future version of this module should not use
462            array references for type values and the |type_text| attribute
463            should be made obsolete.
464            (parse_char_string): Unescape attribute names.
465            (check): Support for checking of whether inherited interfaces
466            are actually defined or not.  Support for checking of whether
467            interface member identifiers are duplicated or not.
468            ($serialize_type): Scoped names are returned as is.  A future
469            version of this code should escape identifiers other than "DOMString",
470            otherwise the idl_text would be non-conforming.
471    
472    2008-08-02  Wakaba  <wakaba@suika.fam.cx>
473    
474            * WebIDL.pm (parse_char_string): Set line/column numbers
475            to generated nodes.  Unescape identifiers.  Extended attributes
476            for Definition's were ignored.
477            (append_child): Set |parent_node| attribute.
478            (parent_node): New attribute.
479            (check): Support interface/exception members.  Support
480            extended attributes.  Support definition identifier uniqueness
481            constraint.
482            (qualified_name): New attribute.
483            (Interface/Exception idl_text): Extended attributes were
484            not prepended to the returned text.
485    
486    2008-08-02  Wakaba  <wakaba@suika.fam.cx>
487    
488            * WebIDL.pm (parse_char_string): Set line/column numbers
489            to interface object experimentally.  s/shift/pop/g, shift
490            would make things wrong.  Support for interface forward
491            declarations was missing.  Broken interface declarations
492            with no block were not ignored entirely.
493            (Whatpm::WebIDL::Node): New abstract class.  This class
494            makes things easier.
495            (child_nodes): New attribute.  Unlike DOM's attribute with
496            same name, this attribute returns a dead list of nodes for
497            simplicity.
498            (get_user_data, set_user_data): New methods.
499            (Module idl_text): A SPACE character should be inserted
500            before the |{| character.
501            (Interface idl_text): Support for interface forward declarations.
502            (is_forward_declaration): New attribute.
503    
504    2008-07-19  Wakaba  <wakaba@suika.fam.cx>
505    
506            * WebIDL.pm (type_text): Better serializer.
507    
508    2008-07-19  Wakaba  <wakaba@suika.fam.cx>
509    
510            * WebIDL.pm: Revise forward-compatible parsing so that
511            it now can handle broken extended attributes and as such.
512    
513    2008-07-19  Wakaba  <wakaba@suika.fam.cx>
514    
515            * WebIDL.pm: Real support for extended attributes.
516            Support for extended attributes with arguments.
517    
518    2008-07-19  Wakaba  <wakaba@suika.fam.cx>
519    
520            * WebIDL.pm: Support for |exception| syntax.
521            (Interface->idl_text): Tentative support for inheritances.
522    
523    2008-07-19  Wakaba  <wakaba@suika.fam.cx>
524    
525            * WebIDL.pm: Hierarchical scoped name support was broken.
526            Support for raises, setraises, and getraises syntaxes.
527    
528    2008-07-18  Wakaba  <wakaba@suika.fam.cx>
529    
530            * WebIDL.pm: Support for |idl_text| attribute, version 1 (no
531            proper support for types, extended attributes, and exceptions yet).
532            WebIDL parser, version 1 (no support for exceptions yet,
533            no proper support for extended attributes yet).
534    
535    2008-07-09  Wakaba  <wakaba@suika.fam.cx>
536    
537            * WebIDL.pm (parse_char_string): Support for basic attribute syntax.
538    
539    2008-06-29  Wakaba  <wakaba@suika.fam.cx>
540    
541            * WebIDL.pm: Support for valuetype and const.
542    
543    2008-06-29  Wakaba  <wakaba@suika.fam.cx>
544            
545            * WebIDL.pm: New module.
546    
547    2008-06-15  Wakaba  <wakaba@suika.fam.cx>
548    
549            * Makefile (Entities.html): URI changed.
550    
551    2008-06-08  Wakaba  <wakaba@suika.fam.cx>
552    
553            * HTML.pm.src: Support for ruby parsing (HTML5 revision 1704).
554    
555    2008-06-01  Wakaba  <wakaba@suika.fam.cx>
556    
557            * HTML.pm.src (_get_next_token): A parse error was missing.
558    
559    2008-06-01  Wakaba  <wakaba@suika.fam.cx>
560    
561            * mklinktypelist.pl: rel=contact is no longer part of the HTML5
562            spec (commented out). (HTML5 revision 1711).
563    
564    2008-05-25  Wakaba  <wakaba@suika.fam.cx>
565    
566            * ContentType.pm: Drop support for UTF-32 (HTML5 revision 1701).
567    
568            * HTML.pm.src: UTF-16BE and UTF-16LE should be considered
569            as UTF-16 (HTML5 revision 1701).
570    
571    2008-05-25  Wakaba  <wakaba@suika.fam.cx>
572    
573            * HTML.pm.src: Support for <noframes> in <head> (HTML5 revision
574            1692).
575    
576    2008-05-25  Wakaba  <wakaba@suika.fam.cx>
577    
578            * HTML.pm.src: The secondary insertion mode used when switching
579            to foreign content is the "in body" insertion mode (HTML5 revision
580            1696).
581    
582    2008-05-25  Wakaba  <wakaba@suika.fam.cx>
583    
584            * HTML.pm.src: Don't raise parse error for <isindex/> (HTML5
585            revision 1697).
586    
587    2008-05-24  Wakaba  <wakaba@suika.fam.cx>
588    
589            * HTML.pm.src: Support for end-of-file token in foreign content
590            insertion mode (HTML5 revision 1693).  Update SVG camelCase
591            attribute list (HTML5 revision 1700).  <textarea> closes
592            </select> (HTML5 revision 1699).  More start tags close in
593            foreign content insertion mode (HTML5 revision 1698).
594    
595    2008-05-24  Wakaba  <wakaba@suika.fam.cx>
596    
597            * HTML.pm.src: ";" is not part of charset name (HTML5 revision 1665).
598    
599    2008-05-24  Wakaba  <wakaba@suika.fam.cx>
600    
601            * HTML.pm.src: More robust charset parameter detection (HTML5
602            revision 1674).
603    
604    2008-05-24  Wakaba  <wakaba@suika.fam.cx>
605    
606            * ContentType.pm: Support for image/vnd.microsoft.icon (HTML5
607            revision 1676).
608    
609    2008-05-24  Wakaba  <wakaba@suika.fam.cx>
610    
611            * HTML.pm.src: Ignore language part of public identifiers for
612            quriks mode detection (HTML5 revision 1679).
613    
614    2008-05-24  Wakaba  <wakaba@suika.fam.cx>
615    
616            * HTML.pm.src: Reduce the number of errors in truncated doctypes (HTML5
617            revision 1685).
618    
619    2008-05-24  Wakaba  <wakaba@suika.fam.cx>
620    
621            * HTML.pm.src: Support for EOF in new states for tags (HTML5
622            revision 1684).
623    
624    2008-05-24  Wakaba  <wakaba@suika.fam.cx>
625    
626            * HTML.pm.src (_reset_insertion_mode): Make <td>.innerHTML
627            work (HTML5 revision 1690).
628    
629    2008-05-24  Wakaba  <wakaba@suika.fam.cx>
630    
631            * HTML.pm.src (_tree_construction_main): Change handling of
632            end tags in head insertion modes (HTML5 revision 1686).
633            (parse_char_string): Bug fix for non-utf8 character string handlings.
634            (parse_char_stream): |ungetc| does not work well for this context.
635    
636    2008-05-18  Wakaba  <wakaba@suika.fam.cx>
637    
638            * HTML.pm.src (parse_byte_string): Redefined to invoke
639            |parse_byte_stream|.
640            (parse_byte_stream): New method.
641    
642    2008-05-18  Wakaba  <wakaba@suika.fam.cx>
643    
644            * HTML.pm.src (parse_byte_string): Fix the column number reported
645            by encoding layer error reporter.
646    
647    2008-05-17  Wakaba  <wakaba@suika.fam.cx>
648    
649            * HTML.pm.src (parse_byte_string): Use streaming decoder
650            rather than converting the whole byte string and then parsing.
651            Propagate errors in character encoding layer.
652            (get_next_token): Precise error reporting for |bare stago| error.
653    
654    2008-05-17  Wakaba  <wakaba@suika.fam.cx>
655    
656            * HTML.pm.src (parse_char_stream): New method.
657            (parse_char_string): This method is now defined as an invocation
658            of the |parse_char_stream| method.
659    
660    2008-05-17  Wakaba  <wakaba@suika.fam.cx>
661    
662            * HTML.pm.src (parse_byte_string): Report various status
663            of the sniffing as info-level errors.  Support for new
664            decoding framework in parser resestting.
665            (new): Various default error levels were not set.
666    
667    2008-05-17  Wakaba  <wakaba@suika.fam.cx>
668    
669            * HTML.pm.src (parse_byte_string): HTML5 encoding siniffing
670            algorithm, except for the actual sniffing, is implemented
671            with new framework with Message::Charset::Info.
672    
673    2008-05-16  Wakaba  <wakaba@suika.fam.cx>
674    
675            * CacheManifest.pm (_parse): Drop fragment identifiers from
676            URIs in fallback section (HTML5 revision 1596).
677    
678    2008-05-10  Wakaba  <wakaba@suika.fam.cx>
679    
680            * Makefile (Entities.html): URI has changed.
681    
682    2008-05-10  Wakaba  <wakaba@suika.fam.cx>
683    
684            * CacheManifest.pm: Don't replace U+0000 NULL (HTML5 revision
685            1553).
686    
687    2008-05-06  Wakaba  <wakaba@suika.fam.cx>
688    
689            * ContentChecker.pm: Noted that those returned in |table| are
690            no longer table elements, but table objects returned
691            by Whatpm::HTMLTable.
692    
693            * HTMLTable.pm (form_table): Return table element node
694            as |$table->{element}|.
695            (assign_header): Support for the |headers=""| attribute.
696    
697    2008-05-06  Wakaba  <wakaba@suika.fam.cx>
698    
699            * HTMLTable.pm (assign_header): New function; first version
700            with no support for headers="".
701            (form_table): Include table width and height to the returned
702            table object for covenience.  Indexing in column assignement was wrong.
703            Set whether a data cell is empty or not for convenience.
704    
705    2008-05-05  Wakaba  <wakaba@suika.fam.cx>
706    
707            * HTMLTable.pm: Robuster caption support (HTML5 revision 1393).
708    
709    2008-05-05  Wakaba  <wakaba@suika.fam.cx>
710    
711            * HTMLTable.pm: How table model errors are detected is
712            changed (HTML5 revision 1387).
713    
714    2008-05-05  Wakaba  <wakaba@suika.fam.cx>
715    
716            * HTMLTable.pm: The algorithm now moves |tfoot| elements
717            to the end of the table (HTML5 revision 1380).
718    
719    2008-05-05  Wakaba  <wakaba@suika.fam.cx>
720    
721            * HTMLTable.pm: The algorithm is now 0-based indexing, instead
722            of 1-based (HTML5 revision 1376).
723    
724    2008-05-05  Wakaba  <wakaba@suika.fam.cx>
725    
726            * ContentType.pm: "Content-Type: text/plain; charset=UTF-8"
727            and "Content-Encoding" no longer prevent sniffing (HTML5
728            revision 1288).
729    
730    2008-05-05  Wakaba  <wakaba@suika.fam.cx>
731    
732            * ContentType.pm: Skip BOMs n feed or HTML algorithm (HTML5 revision
733            1282).
734    
735    2008-05-03  Wakaba  <wakaba@suika.fam.cx>
736    
737            * ContentChecker.pm: Support for global attributes.
738            Status of XML specs are added.
739    
740    2008-05-03  Wakaba  <wakaba@suika.fam.cx>
741    
742            * ContentChecker.pm (check_element): Support for |template=""|
743            and |ref=""| attribute (referent element type checking).
744    
745    2008-04-29  Wakaba  <wakaba@suika.fam.cx>
746    
747            * CacheManifest.pm (_parse): New same origin definition (HTML5 revision
748            1500) is implemented (except for IDNA part and URI-scheme-specific
749            knowledge).  Line number counting was wrong for LF-only
750            documents.
751    
752    2008-04-13  Wakaba  <wakaba@suika.fam.cx>
753    
754            * HTML.pm.src: Raise an parse error for any disallowed
755            character (HTML5 revision 1263).
756    
757    2008-04-13  Wakaba  <wakaba@suika.fam.cx>
758    
759            * mkentitylist.pl: Support for new HTML5 entity table format (the
760            definition for |AElig;| was missing).
761    
762    2008-04-13  Wakaba  <wakaba@suika.fam.cx>
763    
764            * HTML.pm.src, mkhtmlparser.pl: Support for element/attribute
765            name/namespace fixup (HTML5 revisions 1413, 1415, 1416, and 1417).
766    
767    2008-04-13  Wakaba  <wakaba@suika.fam.cx>
768    
769            * HTML.pm.src: List of element names that close foreign content
770            insertion mode is added (HTML5 revisions 1412 and 1418).
771    
772    2008-04-13  Wakaba  <wakaba@suika.fam.cx>
773    
774            * HTML.pm.src: Support for |mglyph| and |malignmark|
775            elements (HTML5 revision 1410).
776    
777    2008-04-13  Wakaba  <wakaba@suika.fam.cx>
778    
779            * HTML.pm.src: Support for new long MathML entities (HTML5
780            revision 1406).
781    
782    2008-04-13  Wakaba  <wakaba@suika.fam.cx>
783    
784            * HTML.pm.src: CDATA section support for MathML and SVG
785            elements (HTML5 revision 1404 and 1420).
786    
787    2008-04-12  Wakaba  <wakaba@suika.fam.cx>
788    
789            * HTML.pm.src, mkhtmlparser.pl: Support for MathML and SVG
790            elements (HTML5 revision 1404).  Unused !!!macro definitions
791            are removed.
792    
793    2008-04-12  Wakaba  <wakaba@suika.fam.cx>
794    
795            * HTML.pm.src, mkhtmlparser.pl: The way permitted slash errors
796            are raised is changed (HTML5 revision 1404).
797    
798    2008-04-06  Wakaba  <wakaba@suika.fam.cx>
799    
800            * HTML.pm.src: Category sets that are no longer used are removed.
801    
802    2008-04-06  Wakaba  <wakaba@suika.fam.cx>
803    
804            * HTML.pm.src: The ->[1] property of stack entries are now
805            replaced by constants representing element category.
806    
807    2008-04-06  Wakaba  <wakaba@suika.fam.cx>
808    
809            * HTML.pm.src: Don't use local name stored in stack (i.e. ->[1])
810            for error reporting.  (This is a preparation for using constant
811            value for ->[1].)
812    
813    2008-03-22  Wakaba  <wakaba@suika.fam.cx>
814    
815            * RDFXML.pm: Typo fixed.
816    
817    2008-03-22  Wakaba  <wakaba@suika.fam.cx>
818    
819            * ContentChecker.pm: |fact_level| is now treated
820            as same as |must_level|, i.e. level = |m|.
821            (check_element): Make list of URIs in the DOM.
822    
823    2008-03-21  Wakaba  <wakaba@suika.fam.cx>
824    
825            * RDFXML.pm: Language accessor implemented.
826            Local (null-namespace) attribute support.
827    
828    2008-03-21  Wakaba  <wakaba@suika.fam.cx>
829    
830            * RDFXML.pm: Factored out ID checking code.
831    
832    2008-03-21  Wakaba  <wakaba@suika.fam.cx>
833    
834            * RDFXML.pm: TODO items noted.  Validation of ID
835            and URI attributes is implemented.  Warn if unknown
836            value is used in rdf:parseType="" attribute.
837    
838            * URIChecker.pm (check_rdf_uri_reference): New function.
839    
840    2008-03-21  Wakaba  <wakaba@suika.fam.cx>
841    
842            * RDFXML.pm: bnodeid implemented.  Relative references
843            are now resolved.
844    
845    2008-03-21  Wakaba  <wakaba@suika.fam.cx>
846    
847            * ContentChecker.pm: RDF reification implemented.
848    
849            * RDFXML.pm: undef vs false bug fixed.
850            Reification implemented.
851    
852    2008-03-21  Wakaba  <wakaba@suika.fam.cx>
853    
854            * RDFXML.pm: s/id/ID/ for attribute name.
855            The |node| arguments are added for |ontriple| calls.
856            Too many "attribute not allowed" errors were raised.
857    
858            * ContentChecker.pm: Initial experimental support for rdf:RDF
859            element.
860    
861    2008-03-21  Wakaba  <wakaba@suika.fam.cx>
862    
863            * RDFXML.pm: New module.
864    
865    2008-03-20  Wakaba  <wakaba@suika.fam.cx>
866    
867            * HTML.pm.src (set_inner_html): Line/column number
868            code was old one yet.
869    
870    2008-03-20  Wakaba  <wakaba@suika.fam.cx>
871    
872            * HTML.pm.src: Better line/column reporting for "duplicate attribute"
873            errors.  Line/column markings for DOCTYPE, comment, and
874            character tokens are reintroduced; otherwise,
875            error location for "not HTML5" error and errors
876            for implied elements are not attached.
877    
878    2008-03-20  Wakaba  <wakaba@suika.fam.cx>
879    
880            * HTML.pm.src: Set line/column numbers to attributes.
881    
882            * NanoDOM.pm (create_attribute_ns, set_attribute_node_ns): Added.
883            (value): Setter implemented.
884    
885            * mkhtmlparser.pl: Set line/column numbers to Attr nodes.
886    
887    2008-03-20  Wakaba  <wakaba@suika.fam.cx>
888    
889            * HTML.pm.src: Unused line/column markings are removed.
890    
891    2008-03-20  Wakaba  <wakaba@suika.fam.cx>
892    
893            * HTML.pm.src (_get_next_token): Remove |first_start_tag|
894            flag, which is no longer used.
895    
896    2008-03-17  Wakaba  <wakaba@suika.fam.cx>
897    
898            * HTML.pm.src: Set line/column information to element nodes.
899    
900            * mkhtmlparser.pl (!!!create-element, !!!insert-element,
901            and !!!insert-element-t): Set line/column information to
902            element nodes.
903    
904    2008-03-17  Wakaba  <wakaba@suika.fam.cx>
905    
906            * HTML.pm.src (_get_next_token): The first "<" character
907            in "<?", "<>", or "</>" should be the error point.
908    
909    2008-03-16  Wakaba  <wakaba@suika.fam.cx>
910    
911            * HTML.pm.src: Some more fixes on error position reporting.
912    
913    2008-03-16  Wakaba  <wakaba@suika.fam.cx>
914    
915            * HTML.pm.src: Token-level precious error reporting.
916    
917    2008-03-16  Wakaba  <wakaba@suika.fam.cx>
918    
919            * HTML.pm.src: Preparation for more precious error point
920            reporting.
921    
922    2008-03-11  Wakaba  <wakaba@suika.fam.cx>
923    
924            * HTML.pm.src: Error type revised.
925    
926    2008-03-11  Wakaba  <wakaba@suika.fam.cx>
927    
928            * HTML.pm.src: Similar codes are merged together, again.
929    
930    2008-03-11  Wakaba  <wakaba@suika.fam.cx>
931    
932            * HTML.pm.src: Similar codes are merged together.
933    
934    2008-03-10  Wakaba  <wakaba@suika.fam.cx>
935    
936            * mkhtmlparser.pl: Set "level" parameter to parse errors.
937    
938            * HTML.pm.src: Code refined.
939    
940    2008-03-09  Wakaba  <wakaba@suika.fam.cx>
941    
942            * HTML.pm.src: |</body>| treatement has been changed (HTML5
943            revision 1348).  Note that I really don't know this makes
944            any difference in the black-box behavior of the parser.
945    
946    2008-03-09  Wakaba  <wakaba@suika.fam.cx>
947    
948            * HTML.pm.src: New end-of-file token implementation (HTML5
949            revision 1348).
950    
951    2008-03-09  Wakaba  <wakaba@suika.fam.cx>
952    
953            * HTML.pm.src: |applet| support (HTML5 revision 1347).
954    
955    2008-03-09  Wakaba  <wakaba@suika.fam.cx>
956    
957            * HTML.pm.src: Foster parenting in AAA (HTML5 revision 1343).
958    
959    2008-03-09  Wakaba  <wakaba@suika.fam.cx>
960    
961            * HTML.pm.src: Support for |<input>| in the "in select" insertion
962            mode and support for the "in select in table" insertion mode (HTML5
963            revision 1342).
964    
965    2008-03-09  Wakaba  <wakaba@suika.fam.cx>
966    
967            * HTML.pm.src: No foster parenting for <script> and <script>
968            in non-tainted <table>s (HTML5 revision 1336).
969    
970    2008-03-09  Wakaba  <wakaba@suika.fam.cx>
971    
972            * HTML.pm.src: Ignore white space characters between <html>
973            and <head> (HTML5 revision 1332).
974    
975    2008-03-09  Wakaba  <wakaba@suika.fam.cx>
976    
977            * HTML.pm.src: Treat <input type=hidden> as if it were a
978            white space (HTML5 revision 1331).
979    
980    2008-03-08  Wakaba  <wakaba@suika.fam.cx>
981    
982            * HTML.pm.src: Ignore U+000A at the beginning of a |listing|
983            element (HTML5 revision 1330).
984    
985    2008-03-08  Wakaba  <wakaba@suika.fam.cx>
986    
987            * HTML.pm.src: <title> is always appended to the current
988            element (HTML5 revision 1328).
989    
990    2008-03-08  Wakaba  <wakaba@suika.fam.cx>
991    
992            * HTML.pm.src: White space in tainted tables are moved
993            into foster parents (HTML5 revision 1326).
994    
995    2008-03-08  Wakaba  <wakaba@suika.fam.cx>
996    
997            * HTML.pm.src: Reduce errors from foster parenting cases (HTML5
998            revision 1321).
999    
1000    2008-03-08  Wakaba  <wakaba@suika.fam.cx>
1001    
1002            * HTML.pm.src: |</h/n/>| case code rearranged to align with
1003            the spec (HTML5 revision 1320).  Note that we finally complete
1004            all of HTML5 revision 1320 changes.
1005    
1006    2008-03-08  Wakaba  <wakaba@suika.fam.cx>
1007    
1008            * HTML.pm.src: |</form>| now works similar to |</div>| for unclosed
1009            tags (HTML5 revision 1320).
1010    
1011    2008-03-08  Wakaba  <wakaba@suika.fam.cx>
1012    
1013            * HTML.pm.src: |</p>| case rearranged with no actual change in fact.
1014    
1015    2008-03-08  Wakaba  <wakaba@suika.fam.cx>
1016    
1017            * HTML.pm.src: A "generate implied end tags" code (t409.1)
1018            could not be reached so that it is now removed (HTML5 revision
1019            1320).
1020    
1021    2008-03-08  Wakaba  <wakaba@suika.fam.cx>
1022    
1023            * HTML.pm.src: Code for the case of |</div>| and so on are revised
1024            to align with new spec text (HTML5 revision 1320).
1025    
1026    2008-03-08  Wakaba  <wakaba@suika.fam.cx>
1027    
1028            * HTML.pm.src: Remove strange |if| condition; however, it should
1029            have had no harm in theory.
1030    
1031    2008-03-08  Wakaba  <wakaba@suika.fam.cx>
1032    
1033            * HTML.pm.src (_tree_construction_main): '</p>' in body
1034            case is split from other end tags for the preparation
1035            of implementing HTML revision 1320.
1036    
1037    2008-03-07  Wakaba  <wakaba@suika.fam.cx>
1038    
1039            * HTML.pm.src: Simplified "generate implied end tag" (HTML5
1040            revision 1320).
1041    
1042  2008-03-07  Wakaba  <wakaba@suika.fam.cx>  2008-03-07  Wakaba  <wakaba@suika.fam.cx>
1043    
1044          * HTML.pm.src (_tree_construction_main): Merge rules for "h1"          * HTML.pm.src (_tree_construction_main): Merge rules for "h1"

Legend:
Removed from v.1.162  
changed lines
  Added in v.1.332

admin@suikawiki.org
ViewVC Help
Powered by ViewVC 1.1.24