/[suikacvs]/markup/html/whatpm/Whatpm/ChangeLog
Suika

Diff of /markup/html/whatpm/Whatpm/ChangeLog

Parent Directory Parent Directory | Revision Log Revision Log | View Patch Patch

revision 1.163 by wakaba, Thu Mar 6 15:56:52 2008 UTC revision 1.344 by wakaba, Sat Oct 4 12:20:35 2008 UTC
# Line 1  Line 1 
1    2008-10-04  Wakaba  <wakaba@suika.fam.cx>
2    
3            * HTML.pm.src: Support for <option> and <optgroup> in body (HTML5
4            revisions 1731 and 2128).
5    
6    2008-10-04  Wakaba  <wakaba@suika.fam.cx>
7    
8            * HTML.pm.src: Make <foreignObject> scoping (HTML5 revision 1837).
9            Support for end tags of camelCase SVG elements were broken.  A
10            wrong error type text fixed.
11    
12    2008-10-04  Wakaba  <wakaba@suika.fam.cx>
13    
14            * HTML.pm.src: Drop redundant code (HTML5 revision 1731).
15    
16    2008-10-04  Wakaba  <wakaba@suika.fam.cx>
17    
18            * HTML.pm.src: Support for new definition of |param| and |source|
19            start tag parsing (HTML5 revision 1731).
20    
21    2008-10-04  Wakaba  <wakaba@suika.fam.cx>
22    
23            * HTML.pm.src: <p> steps reimplemented (HTML5 revision 1731).
24    
25    2008-10-04  Wakaba  <wakaba@suika.fam.cx>
26    
27            * HTML.pm.src: <li>, <dt>, and <dd> steps reimplemented (HTML5
28            revisions 1731 and 1831).
29    
30    2008-10-04  Wakaba  <wakaba@suika.fam.cx>
31    
32            * HTML.pm.src: Support for new flow (but not phrasing) elements (HTML5
33            revisions 1731 and 1778).  Support for the </sarcasm> end tag (HTML5
34            revision 1731).
35    
36    2008-10-04  Wakaba  <wakaba@suika.fam.cx>
37    
38            * HTML.pm.src: Support for |command| and |eventsource| elements (HTML5
39            revision 1731).  End tags of |option| and |optgroup| elements are
40            now optional (HTML5 revision 1731).
41    
42    2008-10-04  Wakaba  <wakaba@suika.fam.cx>
43    
44            * HTML.pm.src: New "special" elements added to the list (HTML5
45            revision 1778).  "strile" -> "strike".
46    
47    2008-10-02  Wakaba  <wakaba@suika.fam.cx>
48    
49            * ContentType.pm (get_sniffed_type): Support for the "better"
50            content sniffing (HTML5 revision 1927).  In a case the official
51            type was not returned when the method is invoked in the list
52            context.
53    
54    2008-09-22  Wakaba  <wakaba@suika.fam.cx>
55    
56            * HTML.pm.src: Character references for non-space C0 characters,
57            including U+000B VT, DEL character, noncharacter code points, are
58            now converted to the U+FFFD character (cf. HTML5 revision 2138).
59    
60    2008-09-21  Wakaba  <wakaba@suika.fam.cx>
61    
62            * ContentChecker.pm: |form=""| check support added.
63    
64    2008-09-21  Wakaba  <wakaba@suika.fam.cx>
65    
66            * ContentChecker.pm: |contextmenu| validness is now checked using
67            |id| and |id_type| properties, and |menu| property is removed.
68    
69    2008-09-21  Wakaba  <wakaba@suika.fam.cx>
70    
71            * ContentChecker.pm: Prepare for |form| |name| attribute's
72            duplication checking.
73    
74    2008-09-21  Wakaba  <wakaba@suika.fam.cx>
75    
76            * HTML.pm.src (parse_byte_stream): Support (or non-support) for
77            unsupported charset="" parameter value (HTML5 revision 2131).
78    
79    2008-09-20  Wakaba  <wakaba@suika.fam.cx>
80    
81            * HTML.pm.src: Reminding places where U+000B is allowed as a space
82            character is fixed (cf. HTML5 revision 1738).
83    
84            * ContentChecker.pm, HTMLTable.pm: U+000B is no longer part of
85            space characters (HTML5 revision 1738).
86    
87    2008-09-20  Wakaba  <wakaba@suika.fam.cx>
88    
89            * HTML.pm.src: The "anything else" case for the "after after body"
90            insertion mode was not updated to swtich to the "in body"
91            insertion mode.  U+000B is no longer a space character for the
92            purpose of tree construction phase (HTML5 revision 1738).
93    
94    2008-09-20  Wakaba  <wakaba@suika.fam.cx>
95    
96            * HTML.pm.src: U+000B is no longer a space character (HTML5
97            revision 1738).
98    
99    2008-09-20  Wakaba  <wakaba@suika.fam.cx>
100    
101            * ContentType.pm: 0x0B is no longer a space character (HTML5
102            revision 1738).
103    
104            * HTML.pm.src: U+000B is no longer a space character for the
105            algorithm for extracting an encoding from a Content-Type (HTML5
106            revision 1738).
107    
108    2008-09-20  Wakaba  <wakaba@suika.fam.cx>
109    
110            * ContentChecker.pm ($IsInHTMLInteractiveContent): New.
111    
112    2008-09-18  Wakaba  <wakaba@suika.fam.cx>
113    
114            * LangTag.pm: Add checks for remaining requirements from RFC 4646.
115    
116            * mklangreg.pl: Sort 'Prefix' values by their length, to ease
117            matching.
118    
119    2008-09-18  Wakaba  <wakaba@suika.fam.cx>
120    
121            * LangTag.pm: Warn for private use language subtags.  Error level
122            typos fixed.  Support for Suppress-Script field.
123    
124            * mklangreg.pl: Support for dumping of nested structure.
125    
126    2008-09-18  Wakaba  <wakaba@suika.fam.cx>
127    
128            * LangTag.pm (check_rfc4646_langtag): Check if a tag is in the
129            recommended case as per RFC 4646.
130    
131    2008-09-18  Wakaba  <wakaba@suika.fam.cx>
132    
133            * LangTag.pm (check_rfc4646_langtag): New method.
134    
135    2008-09-18  Wakaba  <wakaba@suika.fam.cx>
136    
137            * mklangreg.pl: New script.
138    
139            * Makefile: Updated for creation of the module for language subtag
140            registry.
141            
142    2008-09-16  Wakaba  <wakaba@suika.fam.cx>
143    
144            * Makefile: WebIDL.html added.
145    
146            * WebIDL.pod: New documentation.
147    
148    2008-09-16  Wakaba  <wakaba@suika.fam.cx>
149    
150            * WebIDL.pm: Checker's error types are redefined.
151    
152    2008-09-16  Wakaba  <wakaba@suika.fam.cx>
153    
154            * WebIDL.pm: Parser's error types are redefined.  Some forward
155            compatible parsing bugs are fixed.  Some unreachable codes are
156            commented out.
157    
158    2008-09-16  Wakaba  <wakaba@suika.fam.cx>
159    
160            * WebIDL.pm: Support for the reminding extended attributes are
161            added.  It does not satisfy the definition that a forward
162            interface declaration has an extended attribute.  It seems that
163            unless explicitly allowed multiple extended attributes with the
164            same name is not allowed, though it is not explicitly mentioned in
165            the spec.
166    
167    2008-09-16  Wakaba  <wakaba@suika.fam.cx>
168    
169            * WebIDL.pm: Unescapes extended attribute names and extended
170            attribute identifiers.  Preserve whether an extended attribute has
171            an argument list of not.  Support for extended attributes:
172            Constructor, ExceptionConsts, IndexGetter, IndexSetter,
173            NameGetter, NameSetter, and Null.
174            (has_argument_list): New attribute.
175            (idl_text): Stringifies argument lists, if any, even if it is
176            empty.
177    
178    2008-09-15  Wakaba  <wakaba@suika.fam.cx>
179    
180            * HTML.pm.src: New state |PCDATA_STATE|.  Use an empty string for
181            |{s_kwd}| in DATA_STATE as default.
182    
183    2008-09-15  Wakaba  <wakaba@suika.fam.cx>
184    
185            * HTML.pm.src, mkhtmlparser.pl: Replace |{prev_char}|
186            by |{s_kwd}| in DATA_STATE.
187    
188    2008-09-15  Wakaba  <wakaba@suika.fam.cx>
189    
190            * HTML.pm.src: Shorten keys.
191    
192    2008-09-15  Wakaba  <wakaba@suika.fam.cx>
193    
194            * HTML.pm.src: Remove checking for control character, surrogate
195            pair, or noncharacter code points and non-Unicode code
196            points (they should be handled by Whatpm::Charset::UnicodeChecker).
197            (parse_char_stream): Support for the |$get_wrapper| argument and
198            character stream error handlers.
199    
200    2008-09-15  Wakaba  <wakaba@suika.fam.cx>
201    
202            * ContentChecker.pm: Don't call |loda_ns_module|
203            for null-namespace elements/attributes.
204    
205            * HTML.pm.src: Fact out $disallowed_control_chars
206            as a hash.
207    
208    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
209    
210            * HTML.pm.src: Regexp typo fixed.  |{prev_char}|
211            and |{next_char}| initializations are moved to initialization
212            method.  |{read_until}| now supports buffering.  Sync |set_inner_html|
213            with |parse_char_stream|.
214    
215    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
216    
217            * HTML.pm.src (parse_char_stream): Make |set_next_char|
218            invoke |manakai_read_until|, not only |read|, where
219            possible, to decrease the number of |read| method calls.
220    
221            * mkhtmlparser.pl: Related changes to the aforementioned
222            modification.
223    
224    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
225    
226            * HTML.pm.src: Use |read| instead of |getc|.  |set_inner_html|
227            would report character error from now.
228    
229    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
230    
231            * HTML.pm.src: White-space-leaded non-white-space character
232            tokens in "before head insertion mode" was not
233            correctly handled.
234            (set_inner_html): Reimplemented using CharString decodehandle
235            class.  Support for $get_wrapper argument.  Support
236            for |{read_until}| feature.
237    
238    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
239    
240            * HTML.pm.src: Make a "bare ero" error for unknown
241            entities point the "&" character.
242    
243    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
244    
245            * HTML.pm.src: It turns out that U+FFFD don't have to
246            be added to the list of excluded characters.
247    
248    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
249    
250            * HTML.pm.src ($char_onerror): Have character decoder's |line|
251            and |column| a higher priority than the one set by the
252            tokenizer's input handler.
253            ($self->{read_until}): Exclude U+FFFD (but this might
254            not be necessary, since now we do line/column fixup in
255            the character decode handle).
256    
257    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
258    
259            * HTML.pm.src: Use |{read_until}| where possible.
260    
261    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
262    
263            * HTML.pm.src: Change |{getc_until}| to |{read_until}|
264            and |manakai_getc_until| to |manakai_read_until| to
265            reduce the number of string copies.
266    
267    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
268    
269            * HTML.pm.src (parse_char_string): Use newly created
270            |Whatpm::Charset::DecodeHandle::CharString| instead of Perl's
271            standard feature to |open| a string as a filehandle,
272            since Perl's string filehandle seems not supporting |ungetc|
273            method correctly.
274            (parse_char_stream): Define |{getc_until}| method.
275            (DATA_STATE): Experimental support for |getc_until| feature.
276    
277    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
278    
279            * HTML.pm.src: Check points added to newly added branches.
280    
281    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
282    
283            * HTML.pm.src: Remove |{char}|, which is no longer used.
284            Remove |{entity_in_attr}| and |{last_attribute_value_state}|
285            and replaced by |{prev_state}|.
286    
287            * mkhtmlparser.pl: Remove |{char}| feature.
288            Remove |!!!back-next-input-character;| macro.
289    
290    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
291    
292            * HTML.pm.src: Finally we get rid of all the inner loops.  Remove
293            entity related tokenizer states in favor of new states
294            implementing the consume character reference algorithm.
295    
296    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
297    
298            * HTML.pm.src: "Consume a character reference" algorithm is
299            now implemented as a tokenizer's state, rather than
300            a method, with minimum changes (more changes will
301            be made, in due course).  "Bogus comment state"'s inner
302            loop gets removed.
303    
304    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
305    
306            * HTML.pm.src: Make |PUBLIC| and |SYSTEM| keyword tokenizing
307            into their own tokenizer states.
308    
309    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
310    
311            * HTML.pm.src: |CDATA_SECTION_STATE| (formally |CDATA_BLOCK_STATE|
312            is split into three states.
313    
314    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
315    
316            * HTML.pm.src: |CLOSE_TAG_OPEN_STATE| is broken into
317            itself and new |CDATA_PCDATA_CLOSE_TAG_STATE| so that
318            no longer does the tokenizer have to push back next input
319            characters in those states.
320    
321    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
322    
323            * HTML.pm.src: |MARKUP_DECLARATION_OPEN_STATE| broken
324            into four states so that no longer does the tokenizer have to push
325            back next input characters in that state.
326    
327    2008-09-11  Wakaba  <wakaba@suika.fam.cx>
328    
329            * HTML.pm.src: Methods now accept additional parameter, $get_wrapper,
330            which can be used to insert some wrapper between the character
331            stream handle and the tokenizer.  (It is currently not supported
332            for |set_inner_html| for |Element|s).
333    
334    2008-09-10  Wakaba  <wakaba@suika.fam.cx>
335    
336            * HTML.pm.src: Ignore punctuations in charset names.
337    
338    2008-09-10  Wakaba  <wakaba@suika.fam.cx>
339    
340            * ContentChecker.pm: Support for charset-layer error levels.
341    
342            * HTML.pm.src: Don't specify |text| argument for the
343            |chardecode:fallback| error, since it is not the encoding
344            being used alternatively.
345    
346    2008-09-06  Wakaba  <wakaba@suika.fam.cx>
347    
348            * HTML.pm.src: Support for |XSLT-compat| (HTML5 revision 2141).
349    
350    2008-08-31  Wakaba  <wakaba@suika.fam.cx>
351    
352            * CacheManifest.pm: Support for extensibility (HTML5 revision 2051).
353    
354    2008-08-31  Wakaba  <wakaba@suika.fam.cx>
355    
356            * HTML.pm.src: Bug fix and sync with the spec with regard
357            to after after frameset insertion mode processing (HTML5
358            revision 1909).  Note that the implementation was wrong
359            per the old spec before the r1909 changes.
360    
361    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
362    
363            * HTMLTable.pm: scope=auto algorithm fix synced with the
364            spec (HTML5 revision 2093).
365            ($process_row): Algorithm step numbers synced with the
366            spec (HTML5 revision 2092).
367    
368    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
369    
370            * HTMLTable.pm: Zs is not what we want; we want White_Space! (HTML5
371            revision 2094).
372    
373    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
374    
375            * ContentType.pm: Support for image/svg+xml (HTML5 revision 2096).
376    
377    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
378    
379            * HTML.pm.src: '"' and "'" at the end of attribute
380            name (after another attribute) now raise parse error (HTML5
381            revision 2123).  Empty unquoted attribute values are no
382            longer allowed (HTML5 revision 2122).
383    
384    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
385    
386            * mkhtmlparser.pl: Support for MathML |definitionURL| attribute (HTML5
387            revision 2130).
388    
389    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
390    
391            * ContentChecker.pm: |xml:lang| attribute value must be same
392            as |lang| attribute value for HTML elements (HTML5 revision 2062
393            and so on).
394    
395    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
396    
397            * ContentChecker.pm: Error level definition for |xml_id_error|
398            was missing.
399    
400            * URIChecker.pm: The end of the URL should be marked as the
401            error location for an empty path error.  The position
402            between the userinfo and the port components should be
403            marked as the error location for an empty host error.
404    
405    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
406    
407            * URIChecker.pm: Set parameters representing where in the
408            value the error occurs for errors.  Report unknown
409            address format error in warning level, since address
410            formats are rarely added.  Path segments starting with "/.."
411            were misinterpreted as a dot-segment.
412    
413    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
414    
415            * URIChecker.pm (check_iri_reference): Requires
416            |Message::DOM::DOMImplementation|.
417    
418    2008-08-29  Wakaba  <wakaba@suika.fam.cx>
419    
420            * IMTChecker.pm: Updated for the new error reporting architecture.
421    
422            * ContentChecker.pm: Error levels for IMTs are added.
423    
424    2008-08-17  Wakaba  <wakaba@suika.fam.cx>
425    
426            * H2H.pm (_shift_token): Support for unquoted HTML attribute
427            values.
428    
429    2008-08-16  Wakaba  <wakaba@suika.fam.cx>
430    
431            * CacheManifest.pm: Support for new style of error
432            reports.
433    
434            * HTML.pm.src: Set line=1, column=1 to the document node.
435    
436    2008-08-16  Wakaba  <wakaba@suika.fam.cx>
437    
438            * ContentChecker.pm, RDFXML.pm: Pass {level} object to language tag
439            and URL checkers.  Support for more error levels for bogus
440            langauge tag and URL "standards".
441    
442            * LangTag.pm, URIChecker.pm: Support for new style error
443            level reporting.
444    
445    2008-08-15  Wakaba  <wakaba@suika.fam.cx>
446    
447            * ContentChecker.pm: Support for RDF/XML error levels.
448    
449            * HTMLTable.pm, RDFXML.pm: Support for new style of error level
450            specifying.  Error types are revised.
451    
452    2008-08-15  Wakaba  <wakaba@suika.fam.cx>
453    
454            * ContentChecker.pm: All error reporting method calls are
455            renewed.
456    
457    2008-08-15  Wakaba  <wakaba@suika.fam.cx>
458    
459            * HTML.pm.src: All error type names and "text" parameters
460            are revised.  Use new style for "level" specification.
461    
462            * mkhtmlparser.pl: Use new style for "level" specification.
463    
464    2008-08-03  Wakaba  <wakaba@suika.fam.cx>
465    
466            * WebIDL.pm (parse_char_string): Simplified error
467            reporting process for broken ignored valuetype definition.
468            (Valuetype idl_text): Support for special "DOMString" name.
469    
470    2008-08-03  Wakaba  <wakaba@suika.fam.cx>
471    
472            * WebIDL.pm ($get_scoped_name): Append "::::" if the last
473            terminal of the ScopedName is "DOMString", such that whether
474            the last part of the scoped name is "DOMString" or "_DOMString"
475            later.  It is necessary to determine whether a |typedef|
476            definition should be ignored or not.
477            (parse_char_string): Unescape the identifier of
478            exception members.
479            ($resolve): Return undef for builtin types and sequence<T>
480            types (we might not have to do this, however...).
481            (check): Support checking for Exceptions, Valuetypes,
482            and Typedefs.
483            ($serialize_type): Support for "DOMString::::" syntax.
484            (Typedef idl_text): Output Type as "DOMString" if it
485            is really "DOMString" (i.e. its internal representation
486            is "::DOMString::").
487    
488    2008-08-03  Wakaba  <wakaba@suika.fam.cx>
489    
490            * WebIDL.pm ($resolve): New code, based on resolve code
491            for constant types in the |check| method.
492            (check): Support for checking of attributes, operations, and
493            arguments.
494            (Attribute/Operation idl_text): Exception names in getraises,
495            setraises, and raises clauses is serizlied by |$serialize_type|
496            code.
497    
498    2008-08-02  Wakaba  <wakaba@suika.fam.cx>
499    
500            * WebIDL.pm ($integer): Order of selections are changed to match
501            hexadecimal numbers (the original pattern, taken from the spec,
502            was not work for hexadecimal numbers, because the "0" prefix
503            matches to the [0-7]* part (as an empty string) and therefore
504            it does not match with remaining "x..." part of a "0x..." integer
505            literal.
506            ($get_type): It now returns a string, not an array reference,
507            for regular types and |sequence| types (i.e. it in any case
508            returns a string).
509            ($get_next_token): The second item in the array that represents
510            a integer or float token is now a Perl number value, not the
511            original string representation of the number.
512            (check): Support for const value consistency checking.
513            No extended attribute is defined for constants.
514            (Node subclasses): Use simple strings rather than array references
515            for default data type values.
516            ($serialize_type): Type values are now simple strings.
517            (value): If the new attribute value is a false value, then
518            a FALSE value is set to the attribute.
519    
520    2008-08-02  Wakaba  <wakaba@suika.fam.cx>
521    
522            * WebIDL.pm ($get_scoped_name): Now scoped names are stored
523            in its stringified format ("scoped name" as defined in the
524            spec).  Note that future version of this module should not use
525            array references for type values and the |type_text| attribute
526            should be made obsolete.
527            (parse_char_string): Unescape attribute names.
528            (check): Support for checking of whether inherited interfaces
529            are actually defined or not.  Support for checking of whether
530            interface member identifiers are duplicated or not.
531            ($serialize_type): Scoped names are returned as is.  A future
532            version of this code should escape identifiers other than "DOMString",
533            otherwise the idl_text would be non-conforming.
534    
535    2008-08-02  Wakaba  <wakaba@suika.fam.cx>
536    
537            * WebIDL.pm (parse_char_string): Set line/column numbers
538            to generated nodes.  Unescape identifiers.  Extended attributes
539            for Definition's were ignored.
540            (append_child): Set |parent_node| attribute.
541            (parent_node): New attribute.
542            (check): Support interface/exception members.  Support
543            extended attributes.  Support definition identifier uniqueness
544            constraint.
545            (qualified_name): New attribute.
546            (Interface/Exception idl_text): Extended attributes were
547            not prepended to the returned text.
548    
549    2008-08-02  Wakaba  <wakaba@suika.fam.cx>
550    
551            * WebIDL.pm (parse_char_string): Set line/column numbers
552            to interface object experimentally.  s/shift/pop/g, shift
553            would make things wrong.  Support for interface forward
554            declarations was missing.  Broken interface declarations
555            with no block were not ignored entirely.
556            (Whatpm::WebIDL::Node): New abstract class.  This class
557            makes things easier.
558            (child_nodes): New attribute.  Unlike DOM's attribute with
559            same name, this attribute returns a dead list of nodes for
560            simplicity.
561            (get_user_data, set_user_data): New methods.
562            (Module idl_text): A SPACE character should be inserted
563            before the |{| character.
564            (Interface idl_text): Support for interface forward declarations.
565            (is_forward_declaration): New attribute.
566    
567    2008-07-19  Wakaba  <wakaba@suika.fam.cx>
568    
569            * WebIDL.pm (type_text): Better serializer.
570    
571    2008-07-19  Wakaba  <wakaba@suika.fam.cx>
572    
573            * WebIDL.pm: Revise forward-compatible parsing so that
574            it now can handle broken extended attributes and as such.
575    
576    2008-07-19  Wakaba  <wakaba@suika.fam.cx>
577    
578            * WebIDL.pm: Real support for extended attributes.
579            Support for extended attributes with arguments.
580    
581    2008-07-19  Wakaba  <wakaba@suika.fam.cx>
582    
583            * WebIDL.pm: Support for |exception| syntax.
584            (Interface->idl_text): Tentative support for inheritances.
585    
586    2008-07-19  Wakaba  <wakaba@suika.fam.cx>
587    
588            * WebIDL.pm: Hierarchical scoped name support was broken.
589            Support for raises, setraises, and getraises syntaxes.
590    
591    2008-07-18  Wakaba  <wakaba@suika.fam.cx>
592    
593            * WebIDL.pm: Support for |idl_text| attribute, version 1 (no
594            proper support for types, extended attributes, and exceptions yet).
595            WebIDL parser, version 1 (no support for exceptions yet,
596            no proper support for extended attributes yet).
597    
598    2008-07-09  Wakaba  <wakaba@suika.fam.cx>
599    
600            * WebIDL.pm (parse_char_string): Support for basic attribute syntax.
601    
602    2008-06-29  Wakaba  <wakaba@suika.fam.cx>
603    
604            * WebIDL.pm: Support for valuetype and const.
605    
606    2008-06-29  Wakaba  <wakaba@suika.fam.cx>
607            
608            * WebIDL.pm: New module.
609    
610    2008-06-15  Wakaba  <wakaba@suika.fam.cx>
611    
612            * Makefile (Entities.html): URI changed.
613    
614    2008-06-08  Wakaba  <wakaba@suika.fam.cx>
615    
616            * HTML.pm.src: Support for ruby parsing (HTML5 revision 1704).
617    
618    2008-06-01  Wakaba  <wakaba@suika.fam.cx>
619    
620            * HTML.pm.src (_get_next_token): A parse error was missing.
621    
622    2008-06-01  Wakaba  <wakaba@suika.fam.cx>
623    
624            * mklinktypelist.pl: rel=contact is no longer part of the HTML5
625            spec (commented out). (HTML5 revision 1711).
626    
627    2008-05-25  Wakaba  <wakaba@suika.fam.cx>
628    
629            * ContentType.pm: Drop support for UTF-32 (HTML5 revision 1701).
630    
631            * HTML.pm.src: UTF-16BE and UTF-16LE should be considered
632            as UTF-16 (HTML5 revision 1701).
633    
634    2008-05-25  Wakaba  <wakaba@suika.fam.cx>
635    
636            * HTML.pm.src: Support for <noframes> in <head> (HTML5 revision
637            1692).
638    
639    2008-05-25  Wakaba  <wakaba@suika.fam.cx>
640    
641            * HTML.pm.src: The secondary insertion mode used when switching
642            to foreign content is the "in body" insertion mode (HTML5 revision
643            1696).
644    
645    2008-05-25  Wakaba  <wakaba@suika.fam.cx>
646    
647            * HTML.pm.src: Don't raise parse error for <isindex/> (HTML5
648            revision 1697).
649    
650    2008-05-24  Wakaba  <wakaba@suika.fam.cx>
651    
652            * HTML.pm.src: Support for end-of-file token in foreign content
653            insertion mode (HTML5 revision 1693).  Update SVG camelCase
654            attribute list (HTML5 revision 1700).  <textarea> closes
655            </select> (HTML5 revision 1699).  More start tags close in
656            foreign content insertion mode (HTML5 revision 1698).
657    
658    2008-05-24  Wakaba  <wakaba@suika.fam.cx>
659    
660            * HTML.pm.src: ";" is not part of charset name (HTML5 revision 1665).
661    
662    2008-05-24  Wakaba  <wakaba@suika.fam.cx>
663    
664            * HTML.pm.src: More robust charset parameter detection (HTML5
665            revision 1674).
666    
667    2008-05-24  Wakaba  <wakaba@suika.fam.cx>
668    
669            * ContentType.pm: Support for image/vnd.microsoft.icon (HTML5
670            revision 1676).
671    
672    2008-05-24  Wakaba  <wakaba@suika.fam.cx>
673    
674            * HTML.pm.src: Ignore language part of public identifiers for
675            quriks mode detection (HTML5 revision 1679).
676    
677    2008-05-24  Wakaba  <wakaba@suika.fam.cx>
678    
679            * HTML.pm.src: Reduce the number of errors in truncated doctypes (HTML5
680            revision 1685).
681    
682    2008-05-24  Wakaba  <wakaba@suika.fam.cx>
683    
684            * HTML.pm.src: Support for EOF in new states for tags (HTML5
685            revision 1684).
686    
687    2008-05-24  Wakaba  <wakaba@suika.fam.cx>
688    
689            * HTML.pm.src (_reset_insertion_mode): Make <td>.innerHTML
690            work (HTML5 revision 1690).
691    
692    2008-05-24  Wakaba  <wakaba@suika.fam.cx>
693    
694            * HTML.pm.src (_tree_construction_main): Change handling of
695            end tags in head insertion modes (HTML5 revision 1686).
696            (parse_char_string): Bug fix for non-utf8 character string handlings.
697            (parse_char_stream): |ungetc| does not work well for this context.
698    
699    2008-05-18  Wakaba  <wakaba@suika.fam.cx>
700    
701            * HTML.pm.src (parse_byte_string): Redefined to invoke
702            |parse_byte_stream|.
703            (parse_byte_stream): New method.
704    
705    2008-05-18  Wakaba  <wakaba@suika.fam.cx>
706    
707            * HTML.pm.src (parse_byte_string): Fix the column number reported
708            by encoding layer error reporter.
709    
710    2008-05-17  Wakaba  <wakaba@suika.fam.cx>
711    
712            * HTML.pm.src (parse_byte_string): Use streaming decoder
713            rather than converting the whole byte string and then parsing.
714            Propagate errors in character encoding layer.
715            (get_next_token): Precise error reporting for |bare stago| error.
716    
717    2008-05-17  Wakaba  <wakaba@suika.fam.cx>
718    
719            * HTML.pm.src (parse_char_stream): New method.
720            (parse_char_string): This method is now defined as an invocation
721            of the |parse_char_stream| method.
722    
723    2008-05-17  Wakaba  <wakaba@suika.fam.cx>
724    
725            * HTML.pm.src (parse_byte_string): Report various status
726            of the sniffing as info-level errors.  Support for new
727            decoding framework in parser resestting.
728            (new): Various default error levels were not set.
729    
730    2008-05-17  Wakaba  <wakaba@suika.fam.cx>
731    
732            * HTML.pm.src (parse_byte_string): HTML5 encoding siniffing
733            algorithm, except for the actual sniffing, is implemented
734            with new framework with Message::Charset::Info.
735    
736    2008-05-16  Wakaba  <wakaba@suika.fam.cx>
737    
738            * CacheManifest.pm (_parse): Drop fragment identifiers from
739            URIs in fallback section (HTML5 revision 1596).
740    
741    2008-05-10  Wakaba  <wakaba@suika.fam.cx>
742    
743            * Makefile (Entities.html): URI has changed.
744    
745    2008-05-10  Wakaba  <wakaba@suika.fam.cx>
746    
747            * CacheManifest.pm: Don't replace U+0000 NULL (HTML5 revision
748            1553).
749    
750    2008-05-06  Wakaba  <wakaba@suika.fam.cx>
751    
752            * ContentChecker.pm: Noted that those returned in |table| are
753            no longer table elements, but table objects returned
754            by Whatpm::HTMLTable.
755    
756            * HTMLTable.pm (form_table): Return table element node
757            as |$table->{element}|.
758            (assign_header): Support for the |headers=""| attribute.
759    
760    2008-05-06  Wakaba  <wakaba@suika.fam.cx>
761    
762            * HTMLTable.pm (assign_header): New function; first version
763            with no support for headers="".
764            (form_table): Include table width and height to the returned
765            table object for covenience.  Indexing in column assignement was wrong.
766            Set whether a data cell is empty or not for convenience.
767    
768    2008-05-05  Wakaba  <wakaba@suika.fam.cx>
769    
770            * HTMLTable.pm: Robuster caption support (HTML5 revision 1393).
771    
772    2008-05-05  Wakaba  <wakaba@suika.fam.cx>
773    
774            * HTMLTable.pm: How table model errors are detected is
775            changed (HTML5 revision 1387).
776    
777    2008-05-05  Wakaba  <wakaba@suika.fam.cx>
778    
779            * HTMLTable.pm: The algorithm now moves |tfoot| elements
780            to the end of the table (HTML5 revision 1380).
781    
782    2008-05-05  Wakaba  <wakaba@suika.fam.cx>
783    
784            * HTMLTable.pm: The algorithm is now 0-based indexing, instead
785            of 1-based (HTML5 revision 1376).
786    
787    2008-05-05  Wakaba  <wakaba@suika.fam.cx>
788    
789            * ContentType.pm: "Content-Type: text/plain; charset=UTF-8"
790            and "Content-Encoding" no longer prevent sniffing (HTML5
791            revision 1288).
792    
793    2008-05-05  Wakaba  <wakaba@suika.fam.cx>
794    
795            * ContentType.pm: Skip BOMs n feed or HTML algorithm (HTML5 revision
796            1282).
797    
798    2008-05-03  Wakaba  <wakaba@suika.fam.cx>
799    
800            * ContentChecker.pm: Support for global attributes.
801            Status of XML specs are added.
802    
803    2008-05-03  Wakaba  <wakaba@suika.fam.cx>
804    
805            * ContentChecker.pm (check_element): Support for |template=""|
806            and |ref=""| attribute (referent element type checking).
807    
808    2008-04-29  Wakaba  <wakaba@suika.fam.cx>
809    
810            * CacheManifest.pm (_parse): New same origin definition (HTML5 revision
811            1500) is implemented (except for IDNA part and URI-scheme-specific
812            knowledge).  Line number counting was wrong for LF-only
813            documents.
814    
815    2008-04-13  Wakaba  <wakaba@suika.fam.cx>
816    
817            * HTML.pm.src: Raise an parse error for any disallowed
818            character (HTML5 revision 1263).
819    
820    2008-04-13  Wakaba  <wakaba@suika.fam.cx>
821    
822            * mkentitylist.pl: Support for new HTML5 entity table format (the
823            definition for |AElig;| was missing).
824    
825    2008-04-13  Wakaba  <wakaba@suika.fam.cx>
826    
827            * HTML.pm.src, mkhtmlparser.pl: Support for element/attribute
828            name/namespace fixup (HTML5 revisions 1413, 1415, 1416, and 1417).
829    
830    2008-04-13  Wakaba  <wakaba@suika.fam.cx>
831    
832            * HTML.pm.src: List of element names that close foreign content
833            insertion mode is added (HTML5 revisions 1412 and 1418).
834    
835    2008-04-13  Wakaba  <wakaba@suika.fam.cx>
836    
837            * HTML.pm.src: Support for |mglyph| and |malignmark|
838            elements (HTML5 revision 1410).
839    
840    2008-04-13  Wakaba  <wakaba@suika.fam.cx>
841    
842            * HTML.pm.src: Support for new long MathML entities (HTML5
843            revision 1406).
844    
845    2008-04-13  Wakaba  <wakaba@suika.fam.cx>
846    
847            * HTML.pm.src: CDATA section support for MathML and SVG
848            elements (HTML5 revision 1404 and 1420).
849    
850    2008-04-12  Wakaba  <wakaba@suika.fam.cx>
851    
852            * HTML.pm.src, mkhtmlparser.pl: Support for MathML and SVG
853            elements (HTML5 revision 1404).  Unused !!!macro definitions
854            are removed.
855    
856    2008-04-12  Wakaba  <wakaba@suika.fam.cx>
857    
858            * HTML.pm.src, mkhtmlparser.pl: The way permitted slash errors
859            are raised is changed (HTML5 revision 1404).
860    
861    2008-04-06  Wakaba  <wakaba@suika.fam.cx>
862    
863            * HTML.pm.src: Category sets that are no longer used are removed.
864    
865    2008-04-06  Wakaba  <wakaba@suika.fam.cx>
866    
867            * HTML.pm.src: The ->[1] property of stack entries are now
868            replaced by constants representing element category.
869    
870    2008-04-06  Wakaba  <wakaba@suika.fam.cx>
871    
872            * HTML.pm.src: Don't use local name stored in stack (i.e. ->[1])
873            for error reporting.  (This is a preparation for using constant
874            value for ->[1].)
875    
876    2008-03-22  Wakaba  <wakaba@suika.fam.cx>
877    
878            * RDFXML.pm: Typo fixed.
879    
880    2008-03-22  Wakaba  <wakaba@suika.fam.cx>
881    
882            * ContentChecker.pm: |fact_level| is now treated
883            as same as |must_level|, i.e. level = |m|.
884            (check_element): Make list of URIs in the DOM.
885    
886    2008-03-21  Wakaba  <wakaba@suika.fam.cx>
887    
888            * RDFXML.pm: Language accessor implemented.
889            Local (null-namespace) attribute support.
890    
891    2008-03-21  Wakaba  <wakaba@suika.fam.cx>
892    
893            * RDFXML.pm: Factored out ID checking code.
894    
895    2008-03-21  Wakaba  <wakaba@suika.fam.cx>
896    
897            * RDFXML.pm: TODO items noted.  Validation of ID
898            and URI attributes is implemented.  Warn if unknown
899            value is used in rdf:parseType="" attribute.
900    
901            * URIChecker.pm (check_rdf_uri_reference): New function.
902    
903    2008-03-21  Wakaba  <wakaba@suika.fam.cx>
904    
905            * RDFXML.pm: bnodeid implemented.  Relative references
906            are now resolved.
907    
908    2008-03-21  Wakaba  <wakaba@suika.fam.cx>
909    
910            * ContentChecker.pm: RDF reification implemented.
911    
912            * RDFXML.pm: undef vs false bug fixed.
913            Reification implemented.
914    
915    2008-03-21  Wakaba  <wakaba@suika.fam.cx>
916    
917            * RDFXML.pm: s/id/ID/ for attribute name.
918            The |node| arguments are added for |ontriple| calls.
919            Too many "attribute not allowed" errors were raised.
920    
921            * ContentChecker.pm: Initial experimental support for rdf:RDF
922            element.
923    
924    2008-03-21  Wakaba  <wakaba@suika.fam.cx>
925    
926            * RDFXML.pm: New module.
927    
928    2008-03-20  Wakaba  <wakaba@suika.fam.cx>
929    
930            * HTML.pm.src (set_inner_html): Line/column number
931            code was old one yet.
932    
933    2008-03-20  Wakaba  <wakaba@suika.fam.cx>
934    
935            * HTML.pm.src: Better line/column reporting for "duplicate attribute"
936            errors.  Line/column markings for DOCTYPE, comment, and
937            character tokens are reintroduced; otherwise,
938            error location for "not HTML5" error and errors
939            for implied elements are not attached.
940    
941    2008-03-20  Wakaba  <wakaba@suika.fam.cx>
942    
943            * HTML.pm.src: Set line/column numbers to attributes.
944    
945            * NanoDOM.pm (create_attribute_ns, set_attribute_node_ns): Added.
946            (value): Setter implemented.
947    
948            * mkhtmlparser.pl: Set line/column numbers to Attr nodes.
949    
950    2008-03-20  Wakaba  <wakaba@suika.fam.cx>
951    
952            * HTML.pm.src: Unused line/column markings are removed.
953    
954    2008-03-20  Wakaba  <wakaba@suika.fam.cx>
955    
956            * HTML.pm.src (_get_next_token): Remove |first_start_tag|
957            flag, which is no longer used.
958    
959    2008-03-17  Wakaba  <wakaba@suika.fam.cx>
960    
961            * HTML.pm.src: Set line/column information to element nodes.
962    
963            * mkhtmlparser.pl (!!!create-element, !!!insert-element,
964            and !!!insert-element-t): Set line/column information to
965            element nodes.
966    
967    2008-03-17  Wakaba  <wakaba@suika.fam.cx>
968    
969            * HTML.pm.src (_get_next_token): The first "<" character
970            in "<?", "<>", or "</>" should be the error point.
971    
972    2008-03-16  Wakaba  <wakaba@suika.fam.cx>
973    
974            * HTML.pm.src: Some more fixes on error position reporting.
975    
976    2008-03-16  Wakaba  <wakaba@suika.fam.cx>
977    
978            * HTML.pm.src: Token-level precious error reporting.
979    
980    2008-03-16  Wakaba  <wakaba@suika.fam.cx>
981    
982            * HTML.pm.src: Preparation for more precious error point
983            reporting.
984    
985    2008-03-11  Wakaba  <wakaba@suika.fam.cx>
986    
987            * HTML.pm.src: Error type revised.
988    
989    2008-03-11  Wakaba  <wakaba@suika.fam.cx>
990    
991            * HTML.pm.src: Similar codes are merged together, again.
992    
993    2008-03-11  Wakaba  <wakaba@suika.fam.cx>
994    
995            * HTML.pm.src: Similar codes are merged together.
996    
997    2008-03-10  Wakaba  <wakaba@suika.fam.cx>
998    
999            * mkhtmlparser.pl: Set "level" parameter to parse errors.
1000    
1001            * HTML.pm.src: Code refined.
1002    
1003    2008-03-09  Wakaba  <wakaba@suika.fam.cx>
1004    
1005            * HTML.pm.src: |</body>| treatement has been changed (HTML5
1006            revision 1348).  Note that I really don't know this makes
1007            any difference in the black-box behavior of the parser.
1008    
1009    2008-03-09  Wakaba  <wakaba@suika.fam.cx>
1010    
1011            * HTML.pm.src: New end-of-file token implementation (HTML5
1012            revision 1348).
1013    
1014    2008-03-09  Wakaba  <wakaba@suika.fam.cx>
1015    
1016            * HTML.pm.src: |applet| support (HTML5 revision 1347).
1017    
1018    2008-03-09  Wakaba  <wakaba@suika.fam.cx>
1019    
1020            * HTML.pm.src: Foster parenting in AAA (HTML5 revision 1343).
1021    
1022    2008-03-09  Wakaba  <wakaba@suika.fam.cx>
1023    
1024            * HTML.pm.src: Support for |<input>| in the "in select" insertion
1025            mode and support for the "in select in table" insertion mode (HTML5
1026            revision 1342).
1027    
1028    2008-03-09  Wakaba  <wakaba@suika.fam.cx>
1029    
1030            * HTML.pm.src: No foster parenting for <script> and <script>
1031            in non-tainted <table>s (HTML5 revision 1336).
1032    
1033    2008-03-09  Wakaba  <wakaba@suika.fam.cx>
1034    
1035            * HTML.pm.src: Ignore white space characters between <html>
1036            and <head> (HTML5 revision 1332).
1037    
1038    2008-03-09  Wakaba  <wakaba@suika.fam.cx>
1039    
1040            * HTML.pm.src: Treat <input type=hidden> as if it were a
1041            white space (HTML5 revision 1331).
1042    
1043    2008-03-08  Wakaba  <wakaba@suika.fam.cx>
1044    
1045            * HTML.pm.src: Ignore U+000A at the beginning of a |listing|
1046            element (HTML5 revision 1330).
1047    
1048    2008-03-08  Wakaba  <wakaba@suika.fam.cx>
1049    
1050            * HTML.pm.src: <title> is always appended to the current
1051            element (HTML5 revision 1328).
1052    
1053    2008-03-08  Wakaba  <wakaba@suika.fam.cx>
1054    
1055            * HTML.pm.src: White space in tainted tables are moved
1056            into foster parents (HTML5 revision 1326).
1057    
1058    2008-03-08  Wakaba  <wakaba@suika.fam.cx>
1059    
1060            * HTML.pm.src: Reduce errors from foster parenting cases (HTML5
1061            revision 1321).
1062    
1063    2008-03-08  Wakaba  <wakaba@suika.fam.cx>
1064    
1065            * HTML.pm.src: |</h/n/>| case code rearranged to align with
1066            the spec (HTML5 revision 1320).  Note that we finally complete
1067            all of HTML5 revision 1320 changes.
1068    
1069    2008-03-08  Wakaba  <wakaba@suika.fam.cx>
1070    
1071            * HTML.pm.src: |</form>| now works similar to |</div>| for unclosed
1072            tags (HTML5 revision 1320).
1073    
1074    2008-03-08  Wakaba  <wakaba@suika.fam.cx>
1075    
1076            * HTML.pm.src: |</p>| case rearranged with no actual change in fact.
1077    
1078    2008-03-08  Wakaba  <wakaba@suika.fam.cx>
1079    
1080            * HTML.pm.src: A "generate implied end tags" code (t409.1)
1081            could not be reached so that it is now removed (HTML5 revision
1082            1320).
1083    
1084    2008-03-08  Wakaba  <wakaba@suika.fam.cx>
1085    
1086            * HTML.pm.src: Code for the case of |</div>| and so on are revised
1087            to align with new spec text (HTML5 revision 1320).
1088    
1089    2008-03-08  Wakaba  <wakaba@suika.fam.cx>
1090    
1091            * HTML.pm.src: Remove strange |if| condition; however, it should
1092            have had no harm in theory.
1093    
1094    2008-03-08  Wakaba  <wakaba@suika.fam.cx>
1095    
1096            * HTML.pm.src (_tree_construction_main): '</p>' in body
1097            case is split from other end tags for the preparation
1098            of implementing HTML revision 1320.
1099    
1100  2008-03-07  Wakaba  <wakaba@suika.fam.cx>  2008-03-07  Wakaba  <wakaba@suika.fam.cx>
1101    
1102          * HTML.pm.src: Simplified "generate implied end tag" (HTML5          * HTML.pm.src: Simplified "generate implied end tag" (HTML5

Legend:
Removed from v.1.163  
changed lines
  Added in v.1.344

admin@suikawiki.org
ViewVC Help
Powered by ViewVC 1.1.24