/[suikacvs]/markup/html/whatpm/Whatpm/ChangeLog
Suika

Diff of /markup/html/whatpm/Whatpm/ChangeLog

Parent Directory Parent Directory | Revision Log Revision Log | View Patch Patch

revision 1.178 by wakaba, Sun Mar 9 05:12:04 2008 UTC revision 1.353 by wakaba, Tue Oct 14 09:00:57 2008 UTC
# Line 1  Line 1 
1    2008-10-14  Wakaba  <wakaba@suika.fam.cx>
2    
3            * NanoDOM.pm (dom_config): New attribute (do nothing), for
4            Whatpm::XML::Parser support.
5    
6    2008-10-14  Wakaba  <wakaba@suika.fam.cx>
7    
8            * Makefile: New rule to make HTML/Tokenizer.pm is added.
9    
10            * HTML.pm.src: Tokenizer part moved to another file.
11    
12    2008-10-13  Wakaba  <wakaba@suika.fam.cx>
13    
14            * HTML.pm.src: Merge |DT_EL| and |DD_EL| as |DTDD_EL|.
15    
16    2008-10-13  Wakaba  <wakaba@suika.fam.cx>
17    
18            * HTML.pm.src: Element category constants redefined.
19            
20    2008-10-13  Wakaba  <wakaba@suika.fam.cx>
21    
22            * HTML.pm.src: Steps for CDATA/RCDATA elements in tree
23            construction stage synced with the spec (HTML5 revisions 2139 and
24            2302).
25    
26    2008-10-07  Wakaba  <wakaba@suika.fam.cx>
27    
28            * ContentChecker.pm: New error level "html5_fact" added, which
29            should be tentatively used until all of requirements are properly
30            specced as RFC 2119 "MUST" in HTML5.
31    
32    2008-10-05  Wakaba  <wakaba@suika.fam.cx>
33    
34            * ContentChecker.pod: Note on internal flags is added.
35    
36    2008-10-05  Wakaba  <wakaba@suika.fam.cx>
37    
38            * HTML.pm.src: An AAA bug fixed.
39    
40    2008-10-04  Wakaba  <wakaba@suika.fam.cx>
41    
42            * HTML.pm.src: If another node is inserted by the parser, don't
43            reuse existing Text node to append a character (HTML5 revision
44            2124).
45    
46    2008-10-04  Wakaba  <wakaba@suika.fam.cx>
47    
48            * HTML.pm.src: Support for <option> and <optgroup> in body (HTML5
49            revisions 1731 and 2128).
50    
51    2008-10-04  Wakaba  <wakaba@suika.fam.cx>
52    
53            * HTML.pm.src: Make <foreignObject> scoping (HTML5 revision 1837).
54            Support for end tags of camelCase SVG elements were broken.  A
55            wrong error type text fixed.
56    
57    2008-10-04  Wakaba  <wakaba@suika.fam.cx>
58    
59            * HTML.pm.src: Drop redundant code (HTML5 revision 1731).
60    
61    2008-10-04  Wakaba  <wakaba@suika.fam.cx>
62    
63            * HTML.pm.src: Support for new definition of |param| and |source|
64            start tag parsing (HTML5 revision 1731).
65    
66    2008-10-04  Wakaba  <wakaba@suika.fam.cx>
67    
68            * HTML.pm.src: <p> steps reimplemented (HTML5 revision 1731).
69    
70    2008-10-04  Wakaba  <wakaba@suika.fam.cx>
71    
72            * HTML.pm.src: <li>, <dt>, and <dd> steps reimplemented (HTML5
73            revisions 1731 and 1831).
74    
75    2008-10-04  Wakaba  <wakaba@suika.fam.cx>
76    
77            * HTML.pm.src: Support for new flow (but not phrasing) elements (HTML5
78            revisions 1731 and 1778).  Support for the </sarcasm> end tag (HTML5
79            revision 1731).
80    
81    2008-10-04  Wakaba  <wakaba@suika.fam.cx>
82    
83            * HTML.pm.src: Support for |command| and |eventsource| elements (HTML5
84            revision 1731).  End tags of |option| and |optgroup| elements are
85            now optional (HTML5 revision 1731).
86    
87    2008-10-04  Wakaba  <wakaba@suika.fam.cx>
88    
89            * HTML.pm.src: New "special" elements added to the list (HTML5
90            revision 1778).  "strile" -> "strike".
91    
92    2008-10-02  Wakaba  <wakaba@suika.fam.cx>
93    
94            * ContentType.pm (get_sniffed_type): Support for the "better"
95            content sniffing (HTML5 revision 1927).  In a case the official
96            type was not returned when the method is invoked in the list
97            context.
98    
99    2008-09-22  Wakaba  <wakaba@suika.fam.cx>
100    
101            * HTML.pm.src: Character references for non-space C0 characters,
102            including U+000B VT, DEL character, noncharacter code points, are
103            now converted to the U+FFFD character (cf. HTML5 revision 2138).
104    
105    2008-09-21  Wakaba  <wakaba@suika.fam.cx>
106    
107            * ContentChecker.pm: |form=""| check support added.
108    
109    2008-09-21  Wakaba  <wakaba@suika.fam.cx>
110    
111            * ContentChecker.pm: |contextmenu| validness is now checked using
112            |id| and |id_type| properties, and |menu| property is removed.
113    
114    2008-09-21  Wakaba  <wakaba@suika.fam.cx>
115    
116            * ContentChecker.pm: Prepare for |form| |name| attribute's
117            duplication checking.
118    
119    2008-09-21  Wakaba  <wakaba@suika.fam.cx>
120    
121            * HTML.pm.src (parse_byte_stream): Support (or non-support) for
122            unsupported charset="" parameter value (HTML5 revision 2131).
123    
124    2008-09-20  Wakaba  <wakaba@suika.fam.cx>
125    
126            * HTML.pm.src: Reminding places where U+000B is allowed as a space
127            character is fixed (cf. HTML5 revision 1738).
128    
129            * ContentChecker.pm, HTMLTable.pm: U+000B is no longer part of
130            space characters (HTML5 revision 1738).
131    
132    2008-09-20  Wakaba  <wakaba@suika.fam.cx>
133    
134            * HTML.pm.src: The "anything else" case for the "after after body"
135            insertion mode was not updated to swtich to the "in body"
136            insertion mode.  U+000B is no longer a space character for the
137            purpose of tree construction phase (HTML5 revision 1738).
138    
139    2008-09-20  Wakaba  <wakaba@suika.fam.cx>
140    
141            * HTML.pm.src: U+000B is no longer a space character (HTML5
142            revision 1738).
143    
144    2008-09-20  Wakaba  <wakaba@suika.fam.cx>
145    
146            * ContentType.pm: 0x0B is no longer a space character (HTML5
147            revision 1738).
148    
149            * HTML.pm.src: U+000B is no longer a space character for the
150            algorithm for extracting an encoding from a Content-Type (HTML5
151            revision 1738).
152    
153    2008-09-20  Wakaba  <wakaba@suika.fam.cx>
154    
155            * ContentChecker.pm ($IsInHTMLInteractiveContent): New.
156    
157    2008-09-18  Wakaba  <wakaba@suika.fam.cx>
158    
159            * LangTag.pm: Add checks for remaining requirements from RFC 4646.
160    
161            * mklangreg.pl: Sort 'Prefix' values by their length, to ease
162            matching.
163    
164    2008-09-18  Wakaba  <wakaba@suika.fam.cx>
165    
166            * LangTag.pm: Warn for private use language subtags.  Error level
167            typos fixed.  Support for Suppress-Script field.
168    
169            * mklangreg.pl: Support for dumping of nested structure.
170    
171    2008-09-18  Wakaba  <wakaba@suika.fam.cx>
172    
173            * LangTag.pm (check_rfc4646_langtag): Check if a tag is in the
174            recommended case as per RFC 4646.
175    
176    2008-09-18  Wakaba  <wakaba@suika.fam.cx>
177    
178            * LangTag.pm (check_rfc4646_langtag): New method.
179    
180    2008-09-18  Wakaba  <wakaba@suika.fam.cx>
181    
182            * mklangreg.pl: New script.
183    
184            * Makefile: Updated for creation of the module for language subtag
185            registry.
186            
187    2008-09-16  Wakaba  <wakaba@suika.fam.cx>
188    
189            * Makefile: WebIDL.html added.
190    
191            * WebIDL.pod: New documentation.
192    
193    2008-09-16  Wakaba  <wakaba@suika.fam.cx>
194    
195            * WebIDL.pm: Checker's error types are redefined.
196    
197    2008-09-16  Wakaba  <wakaba@suika.fam.cx>
198    
199            * WebIDL.pm: Parser's error types are redefined.  Some forward
200            compatible parsing bugs are fixed.  Some unreachable codes are
201            commented out.
202    
203    2008-09-16  Wakaba  <wakaba@suika.fam.cx>
204    
205            * WebIDL.pm: Support for the reminding extended attributes are
206            added.  It does not satisfy the definition that a forward
207            interface declaration has an extended attribute.  It seems that
208            unless explicitly allowed multiple extended attributes with the
209            same name is not allowed, though it is not explicitly mentioned in
210            the spec.
211    
212    2008-09-16  Wakaba  <wakaba@suika.fam.cx>
213    
214            * WebIDL.pm: Unescapes extended attribute names and extended
215            attribute identifiers.  Preserve whether an extended attribute has
216            an argument list of not.  Support for extended attributes:
217            Constructor, ExceptionConsts, IndexGetter, IndexSetter,
218            NameGetter, NameSetter, and Null.
219            (has_argument_list): New attribute.
220            (idl_text): Stringifies argument lists, if any, even if it is
221            empty.
222    
223    2008-09-15  Wakaba  <wakaba@suika.fam.cx>
224    
225            * HTML.pm.src: New state |PCDATA_STATE|.  Use an empty string for
226            |{s_kwd}| in DATA_STATE as default.
227    
228    2008-09-15  Wakaba  <wakaba@suika.fam.cx>
229    
230            * HTML.pm.src, mkhtmlparser.pl: Replace |{prev_char}|
231            by |{s_kwd}| in DATA_STATE.
232    
233    2008-09-15  Wakaba  <wakaba@suika.fam.cx>
234    
235            * HTML.pm.src: Shorten keys.
236    
237    2008-09-15  Wakaba  <wakaba@suika.fam.cx>
238    
239            * HTML.pm.src: Remove checking for control character, surrogate
240            pair, or noncharacter code points and non-Unicode code
241            points (they should be handled by Whatpm::Charset::UnicodeChecker).
242            (parse_char_stream): Support for the |$get_wrapper| argument and
243            character stream error handlers.
244    
245    2008-09-15  Wakaba  <wakaba@suika.fam.cx>
246    
247            * ContentChecker.pm: Don't call |loda_ns_module|
248            for null-namespace elements/attributes.
249    
250            * HTML.pm.src: Fact out $disallowed_control_chars
251            as a hash.
252    
253    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
254    
255            * HTML.pm.src: Regexp typo fixed.  |{prev_char}|
256            and |{next_char}| initializations are moved to initialization
257            method.  |{read_until}| now supports buffering.  Sync |set_inner_html|
258            with |parse_char_stream|.
259    
260    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
261    
262            * HTML.pm.src (parse_char_stream): Make |set_next_char|
263            invoke |manakai_read_until|, not only |read|, where
264            possible, to decrease the number of |read| method calls.
265    
266            * mkhtmlparser.pl: Related changes to the aforementioned
267            modification.
268    
269    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
270    
271            * HTML.pm.src: Use |read| instead of |getc|.  |set_inner_html|
272            would report character error from now.
273    
274    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
275    
276            * HTML.pm.src: White-space-leaded non-white-space character
277            tokens in "before head insertion mode" was not
278            correctly handled.
279            (set_inner_html): Reimplemented using CharString decodehandle
280            class.  Support for $get_wrapper argument.  Support
281            for |{read_until}| feature.
282    
283    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
284    
285            * HTML.pm.src: Make a "bare ero" error for unknown
286            entities point the "&" character.
287    
288    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
289    
290            * HTML.pm.src: It turns out that U+FFFD don't have to
291            be added to the list of excluded characters.
292    
293    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
294    
295            * HTML.pm.src ($char_onerror): Have character decoder's |line|
296            and |column| a higher priority than the one set by the
297            tokenizer's input handler.
298            ($self->{read_until}): Exclude U+FFFD (but this might
299            not be necessary, since now we do line/column fixup in
300            the character decode handle).
301    
302    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
303    
304            * HTML.pm.src: Use |{read_until}| where possible.
305    
306    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
307    
308            * HTML.pm.src: Change |{getc_until}| to |{read_until}|
309            and |manakai_getc_until| to |manakai_read_until| to
310            reduce the number of string copies.
311    
312    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
313    
314            * HTML.pm.src (parse_char_string): Use newly created
315            |Whatpm::Charset::DecodeHandle::CharString| instead of Perl's
316            standard feature to |open| a string as a filehandle,
317            since Perl's string filehandle seems not supporting |ungetc|
318            method correctly.
319            (parse_char_stream): Define |{getc_until}| method.
320            (DATA_STATE): Experimental support for |getc_until| feature.
321    
322    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
323    
324            * HTML.pm.src: Check points added to newly added branches.
325    
326    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
327    
328            * HTML.pm.src: Remove |{char}|, which is no longer used.
329            Remove |{entity_in_attr}| and |{last_attribute_value_state}|
330            and replaced by |{prev_state}|.
331    
332            * mkhtmlparser.pl: Remove |{char}| feature.
333            Remove |!!!back-next-input-character;| macro.
334    
335    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
336    
337            * HTML.pm.src: Finally we get rid of all the inner loops.  Remove
338            entity related tokenizer states in favor of new states
339            implementing the consume character reference algorithm.
340    
341    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
342    
343            * HTML.pm.src: "Consume a character reference" algorithm is
344            now implemented as a tokenizer's state, rather than
345            a method, with minimum changes (more changes will
346            be made, in due course).  "Bogus comment state"'s inner
347            loop gets removed.
348    
349    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
350    
351            * HTML.pm.src: Make |PUBLIC| and |SYSTEM| keyword tokenizing
352            into their own tokenizer states.
353    
354    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
355    
356            * HTML.pm.src: |CDATA_SECTION_STATE| (formally |CDATA_BLOCK_STATE|
357            is split into three states.
358    
359    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
360    
361            * HTML.pm.src: |CLOSE_TAG_OPEN_STATE| is broken into
362            itself and new |CDATA_PCDATA_CLOSE_TAG_STATE| so that
363            no longer does the tokenizer have to push back next input
364            characters in those states.
365    
366    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
367    
368            * HTML.pm.src: |MARKUP_DECLARATION_OPEN_STATE| broken
369            into four states so that no longer does the tokenizer have to push
370            back next input characters in that state.
371    
372    2008-09-11  Wakaba  <wakaba@suika.fam.cx>
373    
374            * HTML.pm.src: Methods now accept additional parameter, $get_wrapper,
375            which can be used to insert some wrapper between the character
376            stream handle and the tokenizer.  (It is currently not supported
377            for |set_inner_html| for |Element|s).
378    
379    2008-09-10  Wakaba  <wakaba@suika.fam.cx>
380    
381            * HTML.pm.src: Ignore punctuations in charset names.
382    
383    2008-09-10  Wakaba  <wakaba@suika.fam.cx>
384    
385            * ContentChecker.pm: Support for charset-layer error levels.
386    
387            * HTML.pm.src: Don't specify |text| argument for the
388            |chardecode:fallback| error, since it is not the encoding
389            being used alternatively.
390    
391    2008-09-06  Wakaba  <wakaba@suika.fam.cx>
392    
393            * HTML.pm.src: Support for |XSLT-compat| (HTML5 revision 2141).
394    
395    2008-08-31  Wakaba  <wakaba@suika.fam.cx>
396    
397            * CacheManifest.pm: Support for extensibility (HTML5 revision 2051).
398    
399    2008-08-31  Wakaba  <wakaba@suika.fam.cx>
400    
401            * HTML.pm.src: Bug fix and sync with the spec with regard
402            to after after frameset insertion mode processing (HTML5
403            revision 1909).  Note that the implementation was wrong
404            per the old spec before the r1909 changes.
405    
406    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
407    
408            * HTMLTable.pm: scope=auto algorithm fix synced with the
409            spec (HTML5 revision 2093).
410            ($process_row): Algorithm step numbers synced with the
411            spec (HTML5 revision 2092).
412    
413    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
414    
415            * HTMLTable.pm: Zs is not what we want; we want White_Space! (HTML5
416            revision 2094).
417    
418    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
419    
420            * ContentType.pm: Support for image/svg+xml (HTML5 revision 2096).
421    
422    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
423    
424            * HTML.pm.src: '"' and "'" at the end of attribute
425            name (after another attribute) now raise parse error (HTML5
426            revision 2123).  Empty unquoted attribute values are no
427            longer allowed (HTML5 revision 2122).
428    
429    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
430    
431            * mkhtmlparser.pl: Support for MathML |definitionURL| attribute (HTML5
432            revision 2130).
433    
434    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
435    
436            * ContentChecker.pm: |xml:lang| attribute value must be same
437            as |lang| attribute value for HTML elements (HTML5 revision 2062
438            and so on).
439    
440    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
441    
442            * ContentChecker.pm: Error level definition for |xml_id_error|
443            was missing.
444    
445            * URIChecker.pm: The end of the URL should be marked as the
446            error location for an empty path error.  The position
447            between the userinfo and the port components should be
448            marked as the error location for an empty host error.
449    
450    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
451    
452            * URIChecker.pm: Set parameters representing where in the
453            value the error occurs for errors.  Report unknown
454            address format error in warning level, since address
455            formats are rarely added.  Path segments starting with "/.."
456            were misinterpreted as a dot-segment.
457    
458    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
459    
460            * URIChecker.pm (check_iri_reference): Requires
461            |Message::DOM::DOMImplementation|.
462    
463    2008-08-29  Wakaba  <wakaba@suika.fam.cx>
464    
465            * IMTChecker.pm: Updated for the new error reporting architecture.
466    
467            * ContentChecker.pm: Error levels for IMTs are added.
468    
469    2008-08-17  Wakaba  <wakaba@suika.fam.cx>
470    
471            * H2H.pm (_shift_token): Support for unquoted HTML attribute
472            values.
473    
474    2008-08-16  Wakaba  <wakaba@suika.fam.cx>
475    
476            * CacheManifest.pm: Support for new style of error
477            reports.
478    
479            * HTML.pm.src: Set line=1, column=1 to the document node.
480    
481    2008-08-16  Wakaba  <wakaba@suika.fam.cx>
482    
483            * ContentChecker.pm, RDFXML.pm: Pass {level} object to language tag
484            and URL checkers.  Support for more error levels for bogus
485            langauge tag and URL "standards".
486    
487            * LangTag.pm, URIChecker.pm: Support for new style error
488            level reporting.
489    
490    2008-08-15  Wakaba  <wakaba@suika.fam.cx>
491    
492            * ContentChecker.pm: Support for RDF/XML error levels.
493    
494            * HTMLTable.pm, RDFXML.pm: Support for new style of error level
495            specifying.  Error types are revised.
496    
497    2008-08-15  Wakaba  <wakaba@suika.fam.cx>
498    
499            * ContentChecker.pm: All error reporting method calls are
500            renewed.
501    
502    2008-08-15  Wakaba  <wakaba@suika.fam.cx>
503    
504            * HTML.pm.src: All error type names and "text" parameters
505            are revised.  Use new style for "level" specification.
506    
507            * mkhtmlparser.pl: Use new style for "level" specification.
508    
509    2008-08-03  Wakaba  <wakaba@suika.fam.cx>
510    
511            * WebIDL.pm (parse_char_string): Simplified error
512            reporting process for broken ignored valuetype definition.
513            (Valuetype idl_text): Support for special "DOMString" name.
514    
515    2008-08-03  Wakaba  <wakaba@suika.fam.cx>
516    
517            * WebIDL.pm ($get_scoped_name): Append "::::" if the last
518            terminal of the ScopedName is "DOMString", such that whether
519            the last part of the scoped name is "DOMString" or "_DOMString"
520            later.  It is necessary to determine whether a |typedef|
521            definition should be ignored or not.
522            (parse_char_string): Unescape the identifier of
523            exception members.
524            ($resolve): Return undef for builtin types and sequence<T>
525            types (we might not have to do this, however...).
526            (check): Support checking for Exceptions, Valuetypes,
527            and Typedefs.
528            ($serialize_type): Support for "DOMString::::" syntax.
529            (Typedef idl_text): Output Type as "DOMString" if it
530            is really "DOMString" (i.e. its internal representation
531            is "::DOMString::").
532    
533    2008-08-03  Wakaba  <wakaba@suika.fam.cx>
534    
535            * WebIDL.pm ($resolve): New code, based on resolve code
536            for constant types in the |check| method.
537            (check): Support for checking of attributes, operations, and
538            arguments.
539            (Attribute/Operation idl_text): Exception names in getraises,
540            setraises, and raises clauses is serizlied by |$serialize_type|
541            code.
542    
543    2008-08-02  Wakaba  <wakaba@suika.fam.cx>
544    
545            * WebIDL.pm ($integer): Order of selections are changed to match
546            hexadecimal numbers (the original pattern, taken from the spec,
547            was not work for hexadecimal numbers, because the "0" prefix
548            matches to the [0-7]* part (as an empty string) and therefore
549            it does not match with remaining "x..." part of a "0x..." integer
550            literal.
551            ($get_type): It now returns a string, not an array reference,
552            for regular types and |sequence| types (i.e. it in any case
553            returns a string).
554            ($get_next_token): The second item in the array that represents
555            a integer or float token is now a Perl number value, not the
556            original string representation of the number.
557            (check): Support for const value consistency checking.
558            No extended attribute is defined for constants.
559            (Node subclasses): Use simple strings rather than array references
560            for default data type values.
561            ($serialize_type): Type values are now simple strings.
562            (value): If the new attribute value is a false value, then
563            a FALSE value is set to the attribute.
564    
565    2008-08-02  Wakaba  <wakaba@suika.fam.cx>
566    
567            * WebIDL.pm ($get_scoped_name): Now scoped names are stored
568            in its stringified format ("scoped name" as defined in the
569            spec).  Note that future version of this module should not use
570            array references for type values and the |type_text| attribute
571            should be made obsolete.
572            (parse_char_string): Unescape attribute names.
573            (check): Support for checking of whether inherited interfaces
574            are actually defined or not.  Support for checking of whether
575            interface member identifiers are duplicated or not.
576            ($serialize_type): Scoped names are returned as is.  A future
577            version of this code should escape identifiers other than "DOMString",
578            otherwise the idl_text would be non-conforming.
579    
580    2008-08-02  Wakaba  <wakaba@suika.fam.cx>
581    
582            * WebIDL.pm (parse_char_string): Set line/column numbers
583            to generated nodes.  Unescape identifiers.  Extended attributes
584            for Definition's were ignored.
585            (append_child): Set |parent_node| attribute.
586            (parent_node): New attribute.
587            (check): Support interface/exception members.  Support
588            extended attributes.  Support definition identifier uniqueness
589            constraint.
590            (qualified_name): New attribute.
591            (Interface/Exception idl_text): Extended attributes were
592            not prepended to the returned text.
593    
594    2008-08-02  Wakaba  <wakaba@suika.fam.cx>
595    
596            * WebIDL.pm (parse_char_string): Set line/column numbers
597            to interface object experimentally.  s/shift/pop/g, shift
598            would make things wrong.  Support for interface forward
599            declarations was missing.  Broken interface declarations
600            with no block were not ignored entirely.
601            (Whatpm::WebIDL::Node): New abstract class.  This class
602            makes things easier.
603            (child_nodes): New attribute.  Unlike DOM's attribute with
604            same name, this attribute returns a dead list of nodes for
605            simplicity.
606            (get_user_data, set_user_data): New methods.
607            (Module idl_text): A SPACE character should be inserted
608            before the |{| character.
609            (Interface idl_text): Support for interface forward declarations.
610            (is_forward_declaration): New attribute.
611    
612    2008-07-19  Wakaba  <wakaba@suika.fam.cx>
613    
614            * WebIDL.pm (type_text): Better serializer.
615    
616    2008-07-19  Wakaba  <wakaba@suika.fam.cx>
617    
618            * WebIDL.pm: Revise forward-compatible parsing so that
619            it now can handle broken extended attributes and as such.
620    
621    2008-07-19  Wakaba  <wakaba@suika.fam.cx>
622    
623            * WebIDL.pm: Real support for extended attributes.
624            Support for extended attributes with arguments.
625    
626    2008-07-19  Wakaba  <wakaba@suika.fam.cx>
627    
628            * WebIDL.pm: Support for |exception| syntax.
629            (Interface->idl_text): Tentative support for inheritances.
630    
631    2008-07-19  Wakaba  <wakaba@suika.fam.cx>
632    
633            * WebIDL.pm: Hierarchical scoped name support was broken.
634            Support for raises, setraises, and getraises syntaxes.
635    
636    2008-07-18  Wakaba  <wakaba@suika.fam.cx>
637    
638            * WebIDL.pm: Support for |idl_text| attribute, version 1 (no
639            proper support for types, extended attributes, and exceptions yet).
640            WebIDL parser, version 1 (no support for exceptions yet,
641            no proper support for extended attributes yet).
642    
643    2008-07-09  Wakaba  <wakaba@suika.fam.cx>
644    
645            * WebIDL.pm (parse_char_string): Support for basic attribute syntax.
646    
647    2008-06-29  Wakaba  <wakaba@suika.fam.cx>
648    
649            * WebIDL.pm: Support for valuetype and const.
650    
651    2008-06-29  Wakaba  <wakaba@suika.fam.cx>
652            
653            * WebIDL.pm: New module.
654    
655    2008-06-15  Wakaba  <wakaba@suika.fam.cx>
656    
657            * Makefile (Entities.html): URI changed.
658    
659    2008-06-08  Wakaba  <wakaba@suika.fam.cx>
660    
661            * HTML.pm.src: Support for ruby parsing (HTML5 revision 1704).
662    
663    2008-06-01  Wakaba  <wakaba@suika.fam.cx>
664    
665            * HTML.pm.src (_get_next_token): A parse error was missing.
666    
667    2008-06-01  Wakaba  <wakaba@suika.fam.cx>
668    
669            * mklinktypelist.pl: rel=contact is no longer part of the HTML5
670            spec (commented out). (HTML5 revision 1711).
671    
672    2008-05-25  Wakaba  <wakaba@suika.fam.cx>
673    
674            * ContentType.pm: Drop support for UTF-32 (HTML5 revision 1701).
675    
676            * HTML.pm.src: UTF-16BE and UTF-16LE should be considered
677            as UTF-16 (HTML5 revision 1701).
678    
679    2008-05-25  Wakaba  <wakaba@suika.fam.cx>
680    
681            * HTML.pm.src: Support for <noframes> in <head> (HTML5 revision
682            1692).
683    
684    2008-05-25  Wakaba  <wakaba@suika.fam.cx>
685    
686            * HTML.pm.src: The secondary insertion mode used when switching
687            to foreign content is the "in body" insertion mode (HTML5 revision
688            1696).
689    
690    2008-05-25  Wakaba  <wakaba@suika.fam.cx>
691    
692            * HTML.pm.src: Don't raise parse error for <isindex/> (HTML5
693            revision 1697).
694    
695    2008-05-24  Wakaba  <wakaba@suika.fam.cx>
696    
697            * HTML.pm.src: Support for end-of-file token in foreign content
698            insertion mode (HTML5 revision 1693).  Update SVG camelCase
699            attribute list (HTML5 revision 1700).  <textarea> closes
700            </select> (HTML5 revision 1699).  More start tags close in
701            foreign content insertion mode (HTML5 revision 1698).
702    
703    2008-05-24  Wakaba  <wakaba@suika.fam.cx>
704    
705            * HTML.pm.src: ";" is not part of charset name (HTML5 revision 1665).
706    
707    2008-05-24  Wakaba  <wakaba@suika.fam.cx>
708    
709            * HTML.pm.src: More robust charset parameter detection (HTML5
710            revision 1674).
711    
712    2008-05-24  Wakaba  <wakaba@suika.fam.cx>
713    
714            * ContentType.pm: Support for image/vnd.microsoft.icon (HTML5
715            revision 1676).
716    
717    2008-05-24  Wakaba  <wakaba@suika.fam.cx>
718    
719            * HTML.pm.src: Ignore language part of public identifiers for
720            quriks mode detection (HTML5 revision 1679).
721    
722    2008-05-24  Wakaba  <wakaba@suika.fam.cx>
723    
724            * HTML.pm.src: Reduce the number of errors in truncated doctypes (HTML5
725            revision 1685).
726    
727    2008-05-24  Wakaba  <wakaba@suika.fam.cx>
728    
729            * HTML.pm.src: Support for EOF in new states for tags (HTML5
730            revision 1684).
731    
732    2008-05-24  Wakaba  <wakaba@suika.fam.cx>
733    
734            * HTML.pm.src (_reset_insertion_mode): Make <td>.innerHTML
735            work (HTML5 revision 1690).
736    
737    2008-05-24  Wakaba  <wakaba@suika.fam.cx>
738    
739            * HTML.pm.src (_tree_construction_main): Change handling of
740            end tags in head insertion modes (HTML5 revision 1686).
741            (parse_char_string): Bug fix for non-utf8 character string handlings.
742            (parse_char_stream): |ungetc| does not work well for this context.
743    
744    2008-05-18  Wakaba  <wakaba@suika.fam.cx>
745    
746            * HTML.pm.src (parse_byte_string): Redefined to invoke
747            |parse_byte_stream|.
748            (parse_byte_stream): New method.
749    
750    2008-05-18  Wakaba  <wakaba@suika.fam.cx>
751    
752            * HTML.pm.src (parse_byte_string): Fix the column number reported
753            by encoding layer error reporter.
754    
755    2008-05-17  Wakaba  <wakaba@suika.fam.cx>
756    
757            * HTML.pm.src (parse_byte_string): Use streaming decoder
758            rather than converting the whole byte string and then parsing.
759            Propagate errors in character encoding layer.
760            (get_next_token): Precise error reporting for |bare stago| error.
761    
762    2008-05-17  Wakaba  <wakaba@suika.fam.cx>
763    
764            * HTML.pm.src (parse_char_stream): New method.
765            (parse_char_string): This method is now defined as an invocation
766            of the |parse_char_stream| method.
767    
768    2008-05-17  Wakaba  <wakaba@suika.fam.cx>
769    
770            * HTML.pm.src (parse_byte_string): Report various status
771            of the sniffing as info-level errors.  Support for new
772            decoding framework in parser resestting.
773            (new): Various default error levels were not set.
774    
775    2008-05-17  Wakaba  <wakaba@suika.fam.cx>
776    
777            * HTML.pm.src (parse_byte_string): HTML5 encoding siniffing
778            algorithm, except for the actual sniffing, is implemented
779            with new framework with Message::Charset::Info.
780    
781    2008-05-16  Wakaba  <wakaba@suika.fam.cx>
782    
783            * CacheManifest.pm (_parse): Drop fragment identifiers from
784            URIs in fallback section (HTML5 revision 1596).
785    
786    2008-05-10  Wakaba  <wakaba@suika.fam.cx>
787    
788            * Makefile (Entities.html): URI has changed.
789    
790    2008-05-10  Wakaba  <wakaba@suika.fam.cx>
791    
792            * CacheManifest.pm: Don't replace U+0000 NULL (HTML5 revision
793            1553).
794    
795    2008-05-06  Wakaba  <wakaba@suika.fam.cx>
796    
797            * ContentChecker.pm: Noted that those returned in |table| are
798            no longer table elements, but table objects returned
799            by Whatpm::HTMLTable.
800    
801            * HTMLTable.pm (form_table): Return table element node
802            as |$table->{element}|.
803            (assign_header): Support for the |headers=""| attribute.
804    
805    2008-05-06  Wakaba  <wakaba@suika.fam.cx>
806    
807            * HTMLTable.pm (assign_header): New function; first version
808            with no support for headers="".
809            (form_table): Include table width and height to the returned
810            table object for covenience.  Indexing in column assignement was wrong.
811            Set whether a data cell is empty or not for convenience.
812    
813    2008-05-05  Wakaba  <wakaba@suika.fam.cx>
814    
815            * HTMLTable.pm: Robuster caption support (HTML5 revision 1393).
816    
817    2008-05-05  Wakaba  <wakaba@suika.fam.cx>
818    
819            * HTMLTable.pm: How table model errors are detected is
820            changed (HTML5 revision 1387).
821    
822    2008-05-05  Wakaba  <wakaba@suika.fam.cx>
823    
824            * HTMLTable.pm: The algorithm now moves |tfoot| elements
825            to the end of the table (HTML5 revision 1380).
826    
827    2008-05-05  Wakaba  <wakaba@suika.fam.cx>
828    
829            * HTMLTable.pm: The algorithm is now 0-based indexing, instead
830            of 1-based (HTML5 revision 1376).
831    
832    2008-05-05  Wakaba  <wakaba@suika.fam.cx>
833    
834            * ContentType.pm: "Content-Type: text/plain; charset=UTF-8"
835            and "Content-Encoding" no longer prevent sniffing (HTML5
836            revision 1288).
837    
838    2008-05-05  Wakaba  <wakaba@suika.fam.cx>
839    
840            * ContentType.pm: Skip BOMs n feed or HTML algorithm (HTML5 revision
841            1282).
842    
843    2008-05-03  Wakaba  <wakaba@suika.fam.cx>
844    
845            * ContentChecker.pm: Support for global attributes.
846            Status of XML specs are added.
847    
848    2008-05-03  Wakaba  <wakaba@suika.fam.cx>
849    
850            * ContentChecker.pm (check_element): Support for |template=""|
851            and |ref=""| attribute (referent element type checking).
852    
853    2008-04-29  Wakaba  <wakaba@suika.fam.cx>
854    
855            * CacheManifest.pm (_parse): New same origin definition (HTML5 revision
856            1500) is implemented (except for IDNA part and URI-scheme-specific
857            knowledge).  Line number counting was wrong for LF-only
858            documents.
859    
860    2008-04-13  Wakaba  <wakaba@suika.fam.cx>
861    
862            * HTML.pm.src: Raise an parse error for any disallowed
863            character (HTML5 revision 1263).
864    
865    2008-04-13  Wakaba  <wakaba@suika.fam.cx>
866    
867            * mkentitylist.pl: Support for new HTML5 entity table format (the
868            definition for |AElig;| was missing).
869    
870    2008-04-13  Wakaba  <wakaba@suika.fam.cx>
871    
872            * HTML.pm.src, mkhtmlparser.pl: Support for element/attribute
873            name/namespace fixup (HTML5 revisions 1413, 1415, 1416, and 1417).
874    
875    2008-04-13  Wakaba  <wakaba@suika.fam.cx>
876    
877            * HTML.pm.src: List of element names that close foreign content
878            insertion mode is added (HTML5 revisions 1412 and 1418).
879    
880    2008-04-13  Wakaba  <wakaba@suika.fam.cx>
881    
882            * HTML.pm.src: Support for |mglyph| and |malignmark|
883            elements (HTML5 revision 1410).
884    
885    2008-04-13  Wakaba  <wakaba@suika.fam.cx>
886    
887            * HTML.pm.src: Support for new long MathML entities (HTML5
888            revision 1406).
889    
890    2008-04-13  Wakaba  <wakaba@suika.fam.cx>
891    
892            * HTML.pm.src: CDATA section support for MathML and SVG
893            elements (HTML5 revision 1404 and 1420).
894    
895    2008-04-12  Wakaba  <wakaba@suika.fam.cx>
896    
897            * HTML.pm.src, mkhtmlparser.pl: Support for MathML and SVG
898            elements (HTML5 revision 1404).  Unused !!!macro definitions
899            are removed.
900    
901    2008-04-12  Wakaba  <wakaba@suika.fam.cx>
902    
903            * HTML.pm.src, mkhtmlparser.pl: The way permitted slash errors
904            are raised is changed (HTML5 revision 1404).
905    
906    2008-04-06  Wakaba  <wakaba@suika.fam.cx>
907    
908            * HTML.pm.src: Category sets that are no longer used are removed.
909    
910    2008-04-06  Wakaba  <wakaba@suika.fam.cx>
911    
912            * HTML.pm.src: The ->[1] property of stack entries are now
913            replaced by constants representing element category.
914    
915    2008-04-06  Wakaba  <wakaba@suika.fam.cx>
916    
917            * HTML.pm.src: Don't use local name stored in stack (i.e. ->[1])
918            for error reporting.  (This is a preparation for using constant
919            value for ->[1].)
920    
921    2008-03-22  Wakaba  <wakaba@suika.fam.cx>
922    
923            * RDFXML.pm: Typo fixed.
924    
925    2008-03-22  Wakaba  <wakaba@suika.fam.cx>
926    
927            * ContentChecker.pm: |fact_level| is now treated
928            as same as |must_level|, i.e. level = |m|.
929            (check_element): Make list of URIs in the DOM.
930    
931    2008-03-21  Wakaba  <wakaba@suika.fam.cx>
932    
933            * RDFXML.pm: Language accessor implemented.
934            Local (null-namespace) attribute support.
935    
936    2008-03-21  Wakaba  <wakaba@suika.fam.cx>
937    
938            * RDFXML.pm: Factored out ID checking code.
939    
940    2008-03-21  Wakaba  <wakaba@suika.fam.cx>
941    
942            * RDFXML.pm: TODO items noted.  Validation of ID
943            and URI attributes is implemented.  Warn if unknown
944            value is used in rdf:parseType="" attribute.
945    
946            * URIChecker.pm (check_rdf_uri_reference): New function.
947    
948    2008-03-21  Wakaba  <wakaba@suika.fam.cx>
949    
950            * RDFXML.pm: bnodeid implemented.  Relative references
951            are now resolved.
952    
953    2008-03-21  Wakaba  <wakaba@suika.fam.cx>
954    
955            * ContentChecker.pm: RDF reification implemented.
956    
957            * RDFXML.pm: undef vs false bug fixed.
958            Reification implemented.
959    
960    2008-03-21  Wakaba  <wakaba@suika.fam.cx>
961    
962            * RDFXML.pm: s/id/ID/ for attribute name.
963            The |node| arguments are added for |ontriple| calls.
964            Too many "attribute not allowed" errors were raised.
965    
966            * ContentChecker.pm: Initial experimental support for rdf:RDF
967            element.
968    
969    2008-03-21  Wakaba  <wakaba@suika.fam.cx>
970    
971            * RDFXML.pm: New module.
972    
973    2008-03-20  Wakaba  <wakaba@suika.fam.cx>
974    
975            * HTML.pm.src (set_inner_html): Line/column number
976            code was old one yet.
977    
978    2008-03-20  Wakaba  <wakaba@suika.fam.cx>
979    
980            * HTML.pm.src: Better line/column reporting for "duplicate attribute"
981            errors.  Line/column markings for DOCTYPE, comment, and
982            character tokens are reintroduced; otherwise,
983            error location for "not HTML5" error and errors
984            for implied elements are not attached.
985    
986    2008-03-20  Wakaba  <wakaba@suika.fam.cx>
987    
988            * HTML.pm.src: Set line/column numbers to attributes.
989    
990            * NanoDOM.pm (create_attribute_ns, set_attribute_node_ns): Added.
991            (value): Setter implemented.
992    
993            * mkhtmlparser.pl: Set line/column numbers to Attr nodes.
994    
995    2008-03-20  Wakaba  <wakaba@suika.fam.cx>
996    
997            * HTML.pm.src: Unused line/column markings are removed.
998    
999    2008-03-20  Wakaba  <wakaba@suika.fam.cx>
1000    
1001            * HTML.pm.src (_get_next_token): Remove |first_start_tag|
1002            flag, which is no longer used.
1003    
1004    2008-03-17  Wakaba  <wakaba@suika.fam.cx>
1005    
1006            * HTML.pm.src: Set line/column information to element nodes.
1007    
1008            * mkhtmlparser.pl (!!!create-element, !!!insert-element,
1009            and !!!insert-element-t): Set line/column information to
1010            element nodes.
1011    
1012    2008-03-17  Wakaba  <wakaba@suika.fam.cx>
1013    
1014            * HTML.pm.src (_get_next_token): The first "<" character
1015            in "<?", "<>", or "</>" should be the error point.
1016    
1017    2008-03-16  Wakaba  <wakaba@suika.fam.cx>
1018    
1019            * HTML.pm.src: Some more fixes on error position reporting.
1020    
1021    2008-03-16  Wakaba  <wakaba@suika.fam.cx>
1022    
1023            * HTML.pm.src: Token-level precious error reporting.
1024    
1025    2008-03-16  Wakaba  <wakaba@suika.fam.cx>
1026    
1027            * HTML.pm.src: Preparation for more precious error point
1028            reporting.
1029    
1030    2008-03-11  Wakaba  <wakaba@suika.fam.cx>
1031    
1032            * HTML.pm.src: Error type revised.
1033    
1034    2008-03-11  Wakaba  <wakaba@suika.fam.cx>
1035    
1036            * HTML.pm.src: Similar codes are merged together, again.
1037    
1038    2008-03-11  Wakaba  <wakaba@suika.fam.cx>
1039    
1040            * HTML.pm.src: Similar codes are merged together.
1041    
1042    2008-03-10  Wakaba  <wakaba@suika.fam.cx>
1043    
1044            * mkhtmlparser.pl: Set "level" parameter to parse errors.
1045    
1046            * HTML.pm.src: Code refined.
1047    
1048    2008-03-09  Wakaba  <wakaba@suika.fam.cx>
1049    
1050            * HTML.pm.src: |</body>| treatement has been changed (HTML5
1051            revision 1348).  Note that I really don't know this makes
1052            any difference in the black-box behavior of the parser.
1053    
1054    2008-03-09  Wakaba  <wakaba@suika.fam.cx>
1055    
1056            * HTML.pm.src: New end-of-file token implementation (HTML5
1057            revision 1348).
1058    
1059    2008-03-09  Wakaba  <wakaba@suika.fam.cx>
1060    
1061            * HTML.pm.src: |applet| support (HTML5 revision 1347).
1062    
1063    2008-03-09  Wakaba  <wakaba@suika.fam.cx>
1064    
1065            * HTML.pm.src: Foster parenting in AAA (HTML5 revision 1343).
1066    
1067  2008-03-09  Wakaba  <wakaba@suika.fam.cx>  2008-03-09  Wakaba  <wakaba@suika.fam.cx>
1068    
1069          * HTML.pm.src: Support for |<input>| in the "in select" insertion          * HTML.pm.src: Support for |<input>| in the "in select" insertion

Legend:
Removed from v.1.178  
changed lines
  Added in v.1.353

admin@suikawiki.org
ViewVC Help
Powered by ViewVC 1.1.24