/[suikacvs]/markup/html/whatpm/Whatpm/ChangeLog
Suika

Diff of /markup/html/whatpm/Whatpm/ChangeLog

Parent Directory Parent Directory | Revision Log Revision Log | View Patch Patch

revision 1.178 by wakaba, Sun Mar 9 05:12:04 2008 UTC revision 1.325 by wakaba, Sat Sep 20 06:10:17 2008 UTC
# Line 1  Line 1 
1    2008-09-20  Wakaba  <wakaba@suika.fam.cx>
2    
3            * ContentChecker.pm ($IsInHTMLInteractiveContent): New.
4    
5    2008-09-18  Wakaba  <wakaba@suika.fam.cx>
6    
7            * LangTag.pm: Add checks for remaining requirements from RFC 4646.
8    
9            * mklangreg.pl: Sort 'Prefix' values by their length, to ease
10            matching.
11    
12    2008-09-18  Wakaba  <wakaba@suika.fam.cx>
13    
14            * LangTag.pm: Warn for private use language subtags.  Error level
15            typos fixed.  Support for Suppress-Script field.
16    
17            * mklangreg.pl: Support for dumping of nested structure.
18    
19    2008-09-18  Wakaba  <wakaba@suika.fam.cx>
20    
21            * LangTag.pm (check_rfc4646_langtag): Check if a tag is in the
22            recommended case as per RFC 4646.
23    
24    2008-09-18  Wakaba  <wakaba@suika.fam.cx>
25    
26            * LangTag.pm (check_rfc4646_langtag): New method.
27    
28    2008-09-18  Wakaba  <wakaba@suika.fam.cx>
29    
30            * mklangreg.pl: New script.
31    
32            * Makefile: Updated for creation of the module for language subtag
33            registry.
34            
35    2008-09-16  Wakaba  <wakaba@suika.fam.cx>
36    
37            * Makefile: WebIDL.html added.
38    
39            * WebIDL.pod: New documentation.
40    
41    2008-09-16  Wakaba  <wakaba@suika.fam.cx>
42    
43            * WebIDL.pm: Checker's error types are redefined.
44    
45    2008-09-16  Wakaba  <wakaba@suika.fam.cx>
46    
47            * WebIDL.pm: Parser's error types are redefined.  Some forward
48            compatible parsing bugs are fixed.  Some unreachable codes are
49            commented out.
50    
51    2008-09-16  Wakaba  <wakaba@suika.fam.cx>
52    
53            * WebIDL.pm: Support for the reminding extended attributes are
54            added.  It does not satisfy the definition that a forward
55            interface declaration has an extended attribute.  It seems that
56            unless explicitly allowed multiple extended attributes with the
57            same name is not allowed, though it is not explicitly mentioned in
58            the spec.
59    
60    2008-09-16  Wakaba  <wakaba@suika.fam.cx>
61    
62            * WebIDL.pm: Unescapes extended attribute names and extended
63            attribute identifiers.  Preserve whether an extended attribute has
64            an argument list of not.  Support for extended attributes:
65            Constructor, ExceptionConsts, IndexGetter, IndexSetter,
66            NameGetter, NameSetter, and Null.
67            (has_argument_list): New attribute.
68            (idl_text): Stringifies argument lists, if any, even if it is
69            empty.
70    
71    2008-09-15  Wakaba  <wakaba@suika.fam.cx>
72    
73            * HTML.pm.src: New state |PCDATA_STATE|.  Use an empty string for
74            |{s_kwd}| in DATA_STATE as default.
75    
76    2008-09-15  Wakaba  <wakaba@suika.fam.cx>
77    
78            * HTML.pm.src, mkhtmlparser.pl: Replace |{prev_char}|
79            by |{s_kwd}| in DATA_STATE.
80    
81    2008-09-15  Wakaba  <wakaba@suika.fam.cx>
82    
83            * HTML.pm.src: Shorten keys.
84    
85    2008-09-15  Wakaba  <wakaba@suika.fam.cx>
86    
87            * HTML.pm.src: Remove checking for control character, surrogate
88            pair, or noncharacter code points and non-Unicode code
89            points (they should be handled by Whatpm::Charset::UnicodeChecker).
90            (parse_char_stream): Support for the |$get_wrapper| argument and
91            character stream error handlers.
92    
93    2008-09-15  Wakaba  <wakaba@suika.fam.cx>
94    
95            * ContentChecker.pm: Don't call |loda_ns_module|
96            for null-namespace elements/attributes.
97    
98            * HTML.pm.src: Fact out $disallowed_control_chars
99            as a hash.
100    
101    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
102    
103            * HTML.pm.src: Regexp typo fixed.  |{prev_char}|
104            and |{next_char}| initializations are moved to initialization
105            method.  |{read_until}| now supports buffering.  Sync |set_inner_html|
106            with |parse_char_stream|.
107    
108    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
109    
110            * HTML.pm.src (parse_char_stream): Make |set_next_char|
111            invoke |manakai_read_until|, not only |read|, where
112            possible, to decrease the number of |read| method calls.
113    
114            * mkhtmlparser.pl: Related changes to the aforementioned
115            modification.
116    
117    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
118    
119            * HTML.pm.src: Use |read| instead of |getc|.  |set_inner_html|
120            would report character error from now.
121    
122    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
123    
124            * HTML.pm.src: White-space-leaded non-white-space character
125            tokens in "before head insertion mode" was not
126            correctly handled.
127            (set_inner_html): Reimplemented using CharString decodehandle
128            class.  Support for $get_wrapper argument.  Support
129            for |{read_until}| feature.
130    
131    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
132    
133            * HTML.pm.src: Make a "bare ero" error for unknown
134            entities point the "&" character.
135    
136    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
137    
138            * HTML.pm.src: It turns out that U+FFFD don't have to
139            be added to the list of excluded characters.
140    
141    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
142    
143            * HTML.pm.src ($char_onerror): Have character decoder's |line|
144            and |column| a higher priority than the one set by the
145            tokenizer's input handler.
146            ($self->{read_until}): Exclude U+FFFD (but this might
147            not be necessary, since now we do line/column fixup in
148            the character decode handle).
149    
150    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
151    
152            * HTML.pm.src: Use |{read_until}| where possible.
153    
154    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
155    
156            * HTML.pm.src: Change |{getc_until}| to |{read_until}|
157            and |manakai_getc_until| to |manakai_read_until| to
158            reduce the number of string copies.
159    
160    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
161    
162            * HTML.pm.src (parse_char_string): Use newly created
163            |Whatpm::Charset::DecodeHandle::CharString| instead of Perl's
164            standard feature to |open| a string as a filehandle,
165            since Perl's string filehandle seems not supporting |ungetc|
166            method correctly.
167            (parse_char_stream): Define |{getc_until}| method.
168            (DATA_STATE): Experimental support for |getc_until| feature.
169    
170    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
171    
172            * HTML.pm.src: Check points added to newly added branches.
173    
174    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
175    
176            * HTML.pm.src: Remove |{char}|, which is no longer used.
177            Remove |{entity_in_attr}| and |{last_attribute_value_state}|
178            and replaced by |{prev_state}|.
179    
180            * mkhtmlparser.pl: Remove |{char}| feature.
181            Remove |!!!back-next-input-character;| macro.
182    
183    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
184    
185            * HTML.pm.src: Finally we get rid of all the inner loops.  Remove
186            entity related tokenizer states in favor of new states
187            implementing the consume character reference algorithm.
188    
189    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
190    
191            * HTML.pm.src: "Consume a character reference" algorithm is
192            now implemented as a tokenizer's state, rather than
193            a method, with minimum changes (more changes will
194            be made, in due course).  "Bogus comment state"'s inner
195            loop gets removed.
196    
197    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
198    
199            * HTML.pm.src: Make |PUBLIC| and |SYSTEM| keyword tokenizing
200            into their own tokenizer states.
201    
202    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
203    
204            * HTML.pm.src: |CDATA_SECTION_STATE| (formally |CDATA_BLOCK_STATE|
205            is split into three states.
206    
207    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
208    
209            * HTML.pm.src: |CLOSE_TAG_OPEN_STATE| is broken into
210            itself and new |CDATA_PCDATA_CLOSE_TAG_STATE| so that
211            no longer does the tokenizer have to push back next input
212            characters in those states.
213    
214    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
215    
216            * HTML.pm.src: |MARKUP_DECLARATION_OPEN_STATE| broken
217            into four states so that no longer does the tokenizer have to push
218            back next input characters in that state.
219    
220    2008-09-11  Wakaba  <wakaba@suika.fam.cx>
221    
222            * HTML.pm.src: Methods now accept additional parameter, $get_wrapper,
223            which can be used to insert some wrapper between the character
224            stream handle and the tokenizer.  (It is currently not supported
225            for |set_inner_html| for |Element|s).
226    
227    2008-09-10  Wakaba  <wakaba@suika.fam.cx>
228    
229            * HTML.pm.src: Ignore punctuations in charset names.
230    
231    2008-09-10  Wakaba  <wakaba@suika.fam.cx>
232    
233            * ContentChecker.pm: Support for charset-layer error levels.
234    
235            * HTML.pm.src: Don't specify |text| argument for the
236            |chardecode:fallback| error, since it is not the encoding
237            being used alternatively.
238    
239    2008-09-06  Wakaba  <wakaba@suika.fam.cx>
240    
241            * HTML.pm.src: Support for |XSLT-compat| (HTML5 revision 2141).
242    
243    2008-08-31  Wakaba  <wakaba@suika.fam.cx>
244    
245            * CacheManifest.pm: Support for extensibility (HTML5 revision 2051).
246    
247    2008-08-31  Wakaba  <wakaba@suika.fam.cx>
248    
249            * HTML.pm.src: Bug fix and sync with the spec with regard
250            to after after frameset insertion mode processing (HTML5
251            revision 1909).  Note that the implementation was wrong
252            per the old spec before the r1909 changes.
253    
254    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
255    
256            * HTMLTable.pm: scope=auto algorithm fix synced with the
257            spec (HTML5 revision 2093).
258            ($process_row): Algorithm step numbers synced with the
259            spec (HTML5 revision 2092).
260    
261    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
262    
263            * HTMLTable.pm: Zs is not what we want; we want White_Space! (HTML5
264            revision 2094).
265    
266    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
267    
268            * ContentType.pm: Support for image/svg+xml (HTML5 revision 2096).
269    
270    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
271    
272            * HTML.pm.src: '"' and "'" at the end of attribute
273            name (after another attribute) now raise parse error (HTML5
274            revision 2123).  Empty unquoted attribute values are no
275            longer allowed (HTML5 revision 2122).
276    
277    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
278    
279            * mkhtmlparser.pl: Support for MathML |definitionURL| attribute (HTML5
280            revision 2130).
281    
282    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
283    
284            * ContentChecker.pm: |xml:lang| attribute value must be same
285            as |lang| attribute value for HTML elements (HTML5 revision 2062
286            and so on).
287    
288    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
289    
290            * ContentChecker.pm: Error level definition for |xml_id_error|
291            was missing.
292    
293            * URIChecker.pm: The end of the URL should be marked as the
294            error location for an empty path error.  The position
295            between the userinfo and the port components should be
296            marked as the error location for an empty host error.
297    
298    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
299    
300            * URIChecker.pm: Set parameters representing where in the
301            value the error occurs for errors.  Report unknown
302            address format error in warning level, since address
303            formats are rarely added.  Path segments starting with "/.."
304            were misinterpreted as a dot-segment.
305    
306    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
307    
308            * URIChecker.pm (check_iri_reference): Requires
309            |Message::DOM::DOMImplementation|.
310    
311    2008-08-29  Wakaba  <wakaba@suika.fam.cx>
312    
313            * IMTChecker.pm: Updated for the new error reporting architecture.
314    
315            * ContentChecker.pm: Error levels for IMTs are added.
316    
317    2008-08-17  Wakaba  <wakaba@suika.fam.cx>
318    
319            * H2H.pm (_shift_token): Support for unquoted HTML attribute
320            values.
321    
322    2008-08-16  Wakaba  <wakaba@suika.fam.cx>
323    
324            * CacheManifest.pm: Support for new style of error
325            reports.
326    
327            * HTML.pm.src: Set line=1, column=1 to the document node.
328    
329    2008-08-16  Wakaba  <wakaba@suika.fam.cx>
330    
331            * ContentChecker.pm, RDFXML.pm: Pass {level} object to language tag
332            and URL checkers.  Support for more error levels for bogus
333            langauge tag and URL "standards".
334    
335            * LangTag.pm, URIChecker.pm: Support for new style error
336            level reporting.
337    
338    2008-08-15  Wakaba  <wakaba@suika.fam.cx>
339    
340            * ContentChecker.pm: Support for RDF/XML error levels.
341    
342            * HTMLTable.pm, RDFXML.pm: Support for new style of error level
343            specifying.  Error types are revised.
344    
345    2008-08-15  Wakaba  <wakaba@suika.fam.cx>
346    
347            * ContentChecker.pm: All error reporting method calls are
348            renewed.
349    
350    2008-08-15  Wakaba  <wakaba@suika.fam.cx>
351    
352            * HTML.pm.src: All error type names and "text" parameters
353            are revised.  Use new style for "level" specification.
354    
355            * mkhtmlparser.pl: Use new style for "level" specification.
356    
357    2008-08-03  Wakaba  <wakaba@suika.fam.cx>
358    
359            * WebIDL.pm (parse_char_string): Simplified error
360            reporting process for broken ignored valuetype definition.
361            (Valuetype idl_text): Support for special "DOMString" name.
362    
363    2008-08-03  Wakaba  <wakaba@suika.fam.cx>
364    
365            * WebIDL.pm ($get_scoped_name): Append "::::" if the last
366            terminal of the ScopedName is "DOMString", such that whether
367            the last part of the scoped name is "DOMString" or "_DOMString"
368            later.  It is necessary to determine whether a |typedef|
369            definition should be ignored or not.
370            (parse_char_string): Unescape the identifier of
371            exception members.
372            ($resolve): Return undef for builtin types and sequence<T>
373            types (we might not have to do this, however...).
374            (check): Support checking for Exceptions, Valuetypes,
375            and Typedefs.
376            ($serialize_type): Support for "DOMString::::" syntax.
377            (Typedef idl_text): Output Type as "DOMString" if it
378            is really "DOMString" (i.e. its internal representation
379            is "::DOMString::").
380    
381    2008-08-03  Wakaba  <wakaba@suika.fam.cx>
382    
383            * WebIDL.pm ($resolve): New code, based on resolve code
384            for constant types in the |check| method.
385            (check): Support for checking of attributes, operations, and
386            arguments.
387            (Attribute/Operation idl_text): Exception names in getraises,
388            setraises, and raises clauses is serizlied by |$serialize_type|
389            code.
390    
391    2008-08-02  Wakaba  <wakaba@suika.fam.cx>
392    
393            * WebIDL.pm ($integer): Order of selections are changed to match
394            hexadecimal numbers (the original pattern, taken from the spec,
395            was not work for hexadecimal numbers, because the "0" prefix
396            matches to the [0-7]* part (as an empty string) and therefore
397            it does not match with remaining "x..." part of a "0x..." integer
398            literal.
399            ($get_type): It now returns a string, not an array reference,
400            for regular types and |sequence| types (i.e. it in any case
401            returns a string).
402            ($get_next_token): The second item in the array that represents
403            a integer or float token is now a Perl number value, not the
404            original string representation of the number.
405            (check): Support for const value consistency checking.
406            No extended attribute is defined for constants.
407            (Node subclasses): Use simple strings rather than array references
408            for default data type values.
409            ($serialize_type): Type values are now simple strings.
410            (value): If the new attribute value is a false value, then
411            a FALSE value is set to the attribute.
412    
413    2008-08-02  Wakaba  <wakaba@suika.fam.cx>
414    
415            * WebIDL.pm ($get_scoped_name): Now scoped names are stored
416            in its stringified format ("scoped name" as defined in the
417            spec).  Note that future version of this module should not use
418            array references for type values and the |type_text| attribute
419            should be made obsolete.
420            (parse_char_string): Unescape attribute names.
421            (check): Support for checking of whether inherited interfaces
422            are actually defined or not.  Support for checking of whether
423            interface member identifiers are duplicated or not.
424            ($serialize_type): Scoped names are returned as is.  A future
425            version of this code should escape identifiers other than "DOMString",
426            otherwise the idl_text would be non-conforming.
427    
428    2008-08-02  Wakaba  <wakaba@suika.fam.cx>
429    
430            * WebIDL.pm (parse_char_string): Set line/column numbers
431            to generated nodes.  Unescape identifiers.  Extended attributes
432            for Definition's were ignored.
433            (append_child): Set |parent_node| attribute.
434            (parent_node): New attribute.
435            (check): Support interface/exception members.  Support
436            extended attributes.  Support definition identifier uniqueness
437            constraint.
438            (qualified_name): New attribute.
439            (Interface/Exception idl_text): Extended attributes were
440            not prepended to the returned text.
441    
442    2008-08-02  Wakaba  <wakaba@suika.fam.cx>
443    
444            * WebIDL.pm (parse_char_string): Set line/column numbers
445            to interface object experimentally.  s/shift/pop/g, shift
446            would make things wrong.  Support for interface forward
447            declarations was missing.  Broken interface declarations
448            with no block were not ignored entirely.
449            (Whatpm::WebIDL::Node): New abstract class.  This class
450            makes things easier.
451            (child_nodes): New attribute.  Unlike DOM's attribute with
452            same name, this attribute returns a dead list of nodes for
453            simplicity.
454            (get_user_data, set_user_data): New methods.
455            (Module idl_text): A SPACE character should be inserted
456            before the |{| character.
457            (Interface idl_text): Support for interface forward declarations.
458            (is_forward_declaration): New attribute.
459    
460    2008-07-19  Wakaba  <wakaba@suika.fam.cx>
461    
462            * WebIDL.pm (type_text): Better serializer.
463    
464    2008-07-19  Wakaba  <wakaba@suika.fam.cx>
465    
466            * WebIDL.pm: Revise forward-compatible parsing so that
467            it now can handle broken extended attributes and as such.
468    
469    2008-07-19  Wakaba  <wakaba@suika.fam.cx>
470    
471            * WebIDL.pm: Real support for extended attributes.
472            Support for extended attributes with arguments.
473    
474    2008-07-19  Wakaba  <wakaba@suika.fam.cx>
475    
476            * WebIDL.pm: Support for |exception| syntax.
477            (Interface->idl_text): Tentative support for inheritances.
478    
479    2008-07-19  Wakaba  <wakaba@suika.fam.cx>
480    
481            * WebIDL.pm: Hierarchical scoped name support was broken.
482            Support for raises, setraises, and getraises syntaxes.
483    
484    2008-07-18  Wakaba  <wakaba@suika.fam.cx>
485    
486            * WebIDL.pm: Support for |idl_text| attribute, version 1 (no
487            proper support for types, extended attributes, and exceptions yet).
488            WebIDL parser, version 1 (no support for exceptions yet,
489            no proper support for extended attributes yet).
490    
491    2008-07-09  Wakaba  <wakaba@suika.fam.cx>
492    
493            * WebIDL.pm (parse_char_string): Support for basic attribute syntax.
494    
495    2008-06-29  Wakaba  <wakaba@suika.fam.cx>
496    
497            * WebIDL.pm: Support for valuetype and const.
498    
499    2008-06-29  Wakaba  <wakaba@suika.fam.cx>
500            
501            * WebIDL.pm: New module.
502    
503    2008-06-15  Wakaba  <wakaba@suika.fam.cx>
504    
505            * Makefile (Entities.html): URI changed.
506    
507    2008-06-08  Wakaba  <wakaba@suika.fam.cx>
508    
509            * HTML.pm.src: Support for ruby parsing (HTML5 revision 1704).
510    
511    2008-06-01  Wakaba  <wakaba@suika.fam.cx>
512    
513            * HTML.pm.src (_get_next_token): A parse error was missing.
514    
515    2008-06-01  Wakaba  <wakaba@suika.fam.cx>
516    
517            * mklinktypelist.pl: rel=contact is no longer part of the HTML5
518            spec (commented out). (HTML5 revision 1711).
519    
520    2008-05-25  Wakaba  <wakaba@suika.fam.cx>
521    
522            * ContentType.pm: Drop support for UTF-32 (HTML5 revision 1701).
523    
524            * HTML.pm.src: UTF-16BE and UTF-16LE should be considered
525            as UTF-16 (HTML5 revision 1701).
526    
527    2008-05-25  Wakaba  <wakaba@suika.fam.cx>
528    
529            * HTML.pm.src: Support for <noframes> in <head> (HTML5 revision
530            1692).
531    
532    2008-05-25  Wakaba  <wakaba@suika.fam.cx>
533    
534            * HTML.pm.src: The secondary insertion mode used when switching
535            to foreign content is the "in body" insertion mode (HTML5 revision
536            1696).
537    
538    2008-05-25  Wakaba  <wakaba@suika.fam.cx>
539    
540            * HTML.pm.src: Don't raise parse error for <isindex/> (HTML5
541            revision 1697).
542    
543    2008-05-24  Wakaba  <wakaba@suika.fam.cx>
544    
545            * HTML.pm.src: Support for end-of-file token in foreign content
546            insertion mode (HTML5 revision 1693).  Update SVG camelCase
547            attribute list (HTML5 revision 1700).  <textarea> closes
548            </select> (HTML5 revision 1699).  More start tags close in
549            foreign content insertion mode (HTML5 revision 1698).
550    
551    2008-05-24  Wakaba  <wakaba@suika.fam.cx>
552    
553            * HTML.pm.src: ";" is not part of charset name (HTML5 revision 1665).
554    
555    2008-05-24  Wakaba  <wakaba@suika.fam.cx>
556    
557            * HTML.pm.src: More robust charset parameter detection (HTML5
558            revision 1674).
559    
560    2008-05-24  Wakaba  <wakaba@suika.fam.cx>
561    
562            * ContentType.pm: Support for image/vnd.microsoft.icon (HTML5
563            revision 1676).
564    
565    2008-05-24  Wakaba  <wakaba@suika.fam.cx>
566    
567            * HTML.pm.src: Ignore language part of public identifiers for
568            quriks mode detection (HTML5 revision 1679).
569    
570    2008-05-24  Wakaba  <wakaba@suika.fam.cx>
571    
572            * HTML.pm.src: Reduce the number of errors in truncated doctypes (HTML5
573            revision 1685).
574    
575    2008-05-24  Wakaba  <wakaba@suika.fam.cx>
576    
577            * HTML.pm.src: Support for EOF in new states for tags (HTML5
578            revision 1684).
579    
580    2008-05-24  Wakaba  <wakaba@suika.fam.cx>
581    
582            * HTML.pm.src (_reset_insertion_mode): Make <td>.innerHTML
583            work (HTML5 revision 1690).
584    
585    2008-05-24  Wakaba  <wakaba@suika.fam.cx>
586    
587            * HTML.pm.src (_tree_construction_main): Change handling of
588            end tags in head insertion modes (HTML5 revision 1686).
589            (parse_char_string): Bug fix for non-utf8 character string handlings.
590            (parse_char_stream): |ungetc| does not work well for this context.
591    
592    2008-05-18  Wakaba  <wakaba@suika.fam.cx>
593    
594            * HTML.pm.src (parse_byte_string): Redefined to invoke
595            |parse_byte_stream|.
596            (parse_byte_stream): New method.
597    
598    2008-05-18  Wakaba  <wakaba@suika.fam.cx>
599    
600            * HTML.pm.src (parse_byte_string): Fix the column number reported
601            by encoding layer error reporter.
602    
603    2008-05-17  Wakaba  <wakaba@suika.fam.cx>
604    
605            * HTML.pm.src (parse_byte_string): Use streaming decoder
606            rather than converting the whole byte string and then parsing.
607            Propagate errors in character encoding layer.
608            (get_next_token): Precise error reporting for |bare stago| error.
609    
610    2008-05-17  Wakaba  <wakaba@suika.fam.cx>
611    
612            * HTML.pm.src (parse_char_stream): New method.
613            (parse_char_string): This method is now defined as an invocation
614            of the |parse_char_stream| method.
615    
616    2008-05-17  Wakaba  <wakaba@suika.fam.cx>
617    
618            * HTML.pm.src (parse_byte_string): Report various status
619            of the sniffing as info-level errors.  Support for new
620            decoding framework in parser resestting.
621            (new): Various default error levels were not set.
622    
623    2008-05-17  Wakaba  <wakaba@suika.fam.cx>
624    
625            * HTML.pm.src (parse_byte_string): HTML5 encoding siniffing
626            algorithm, except for the actual sniffing, is implemented
627            with new framework with Message::Charset::Info.
628    
629    2008-05-16  Wakaba  <wakaba@suika.fam.cx>
630    
631            * CacheManifest.pm (_parse): Drop fragment identifiers from
632            URIs in fallback section (HTML5 revision 1596).
633    
634    2008-05-10  Wakaba  <wakaba@suika.fam.cx>
635    
636            * Makefile (Entities.html): URI has changed.
637    
638    2008-05-10  Wakaba  <wakaba@suika.fam.cx>
639    
640            * CacheManifest.pm: Don't replace U+0000 NULL (HTML5 revision
641            1553).
642    
643    2008-05-06  Wakaba  <wakaba@suika.fam.cx>
644    
645            * ContentChecker.pm: Noted that those returned in |table| are
646            no longer table elements, but table objects returned
647            by Whatpm::HTMLTable.
648    
649            * HTMLTable.pm (form_table): Return table element node
650            as |$table->{element}|.
651            (assign_header): Support for the |headers=""| attribute.
652    
653    2008-05-06  Wakaba  <wakaba@suika.fam.cx>
654    
655            * HTMLTable.pm (assign_header): New function; first version
656            with no support for headers="".
657            (form_table): Include table width and height to the returned
658            table object for covenience.  Indexing in column assignement was wrong.
659            Set whether a data cell is empty or not for convenience.
660    
661    2008-05-05  Wakaba  <wakaba@suika.fam.cx>
662    
663            * HTMLTable.pm: Robuster caption support (HTML5 revision 1393).
664    
665    2008-05-05  Wakaba  <wakaba@suika.fam.cx>
666    
667            * HTMLTable.pm: How table model errors are detected is
668            changed (HTML5 revision 1387).
669    
670    2008-05-05  Wakaba  <wakaba@suika.fam.cx>
671    
672            * HTMLTable.pm: The algorithm now moves |tfoot| elements
673            to the end of the table (HTML5 revision 1380).
674    
675    2008-05-05  Wakaba  <wakaba@suika.fam.cx>
676    
677            * HTMLTable.pm: The algorithm is now 0-based indexing, instead
678            of 1-based (HTML5 revision 1376).
679    
680    2008-05-05  Wakaba  <wakaba@suika.fam.cx>
681    
682            * ContentType.pm: "Content-Type: text/plain; charset=UTF-8"
683            and "Content-Encoding" no longer prevent sniffing (HTML5
684            revision 1288).
685    
686    2008-05-05  Wakaba  <wakaba@suika.fam.cx>
687    
688            * ContentType.pm: Skip BOMs n feed or HTML algorithm (HTML5 revision
689            1282).
690    
691    2008-05-03  Wakaba  <wakaba@suika.fam.cx>
692    
693            * ContentChecker.pm: Support for global attributes.
694            Status of XML specs are added.
695    
696    2008-05-03  Wakaba  <wakaba@suika.fam.cx>
697    
698            * ContentChecker.pm (check_element): Support for |template=""|
699            and |ref=""| attribute (referent element type checking).
700    
701    2008-04-29  Wakaba  <wakaba@suika.fam.cx>
702    
703            * CacheManifest.pm (_parse): New same origin definition (HTML5 revision
704            1500) is implemented (except for IDNA part and URI-scheme-specific
705            knowledge).  Line number counting was wrong for LF-only
706            documents.
707    
708    2008-04-13  Wakaba  <wakaba@suika.fam.cx>
709    
710            * HTML.pm.src: Raise an parse error for any disallowed
711            character (HTML5 revision 1263).
712    
713    2008-04-13  Wakaba  <wakaba@suika.fam.cx>
714    
715            * mkentitylist.pl: Support for new HTML5 entity table format (the
716            definition for |AElig;| was missing).
717    
718    2008-04-13  Wakaba  <wakaba@suika.fam.cx>
719    
720            * HTML.pm.src, mkhtmlparser.pl: Support for element/attribute
721            name/namespace fixup (HTML5 revisions 1413, 1415, 1416, and 1417).
722    
723    2008-04-13  Wakaba  <wakaba@suika.fam.cx>
724    
725            * HTML.pm.src: List of element names that close foreign content
726            insertion mode is added (HTML5 revisions 1412 and 1418).
727    
728    2008-04-13  Wakaba  <wakaba@suika.fam.cx>
729    
730            * HTML.pm.src: Support for |mglyph| and |malignmark|
731            elements (HTML5 revision 1410).
732    
733    2008-04-13  Wakaba  <wakaba@suika.fam.cx>
734    
735            * HTML.pm.src: Support for new long MathML entities (HTML5
736            revision 1406).
737    
738    2008-04-13  Wakaba  <wakaba@suika.fam.cx>
739    
740            * HTML.pm.src: CDATA section support for MathML and SVG
741            elements (HTML5 revision 1404 and 1420).
742    
743    2008-04-12  Wakaba  <wakaba@suika.fam.cx>
744    
745            * HTML.pm.src, mkhtmlparser.pl: Support for MathML and SVG
746            elements (HTML5 revision 1404).  Unused !!!macro definitions
747            are removed.
748    
749    2008-04-12  Wakaba  <wakaba@suika.fam.cx>
750    
751            * HTML.pm.src, mkhtmlparser.pl: The way permitted slash errors
752            are raised is changed (HTML5 revision 1404).
753    
754    2008-04-06  Wakaba  <wakaba@suika.fam.cx>
755    
756            * HTML.pm.src: Category sets that are no longer used are removed.
757    
758    2008-04-06  Wakaba  <wakaba@suika.fam.cx>
759    
760            * HTML.pm.src: The ->[1] property of stack entries are now
761            replaced by constants representing element category.
762    
763    2008-04-06  Wakaba  <wakaba@suika.fam.cx>
764    
765            * HTML.pm.src: Don't use local name stored in stack (i.e. ->[1])
766            for error reporting.  (This is a preparation for using constant
767            value for ->[1].)
768    
769    2008-03-22  Wakaba  <wakaba@suika.fam.cx>
770    
771            * RDFXML.pm: Typo fixed.
772    
773    2008-03-22  Wakaba  <wakaba@suika.fam.cx>
774    
775            * ContentChecker.pm: |fact_level| is now treated
776            as same as |must_level|, i.e. level = |m|.
777            (check_element): Make list of URIs in the DOM.
778    
779    2008-03-21  Wakaba  <wakaba@suika.fam.cx>
780    
781            * RDFXML.pm: Language accessor implemented.
782            Local (null-namespace) attribute support.
783    
784    2008-03-21  Wakaba  <wakaba@suika.fam.cx>
785    
786            * RDFXML.pm: Factored out ID checking code.
787    
788    2008-03-21  Wakaba  <wakaba@suika.fam.cx>
789    
790            * RDFXML.pm: TODO items noted.  Validation of ID
791            and URI attributes is implemented.  Warn if unknown
792            value is used in rdf:parseType="" attribute.
793    
794            * URIChecker.pm (check_rdf_uri_reference): New function.
795    
796    2008-03-21  Wakaba  <wakaba@suika.fam.cx>
797    
798            * RDFXML.pm: bnodeid implemented.  Relative references
799            are now resolved.
800    
801    2008-03-21  Wakaba  <wakaba@suika.fam.cx>
802    
803            * ContentChecker.pm: RDF reification implemented.
804    
805            * RDFXML.pm: undef vs false bug fixed.
806            Reification implemented.
807    
808    2008-03-21  Wakaba  <wakaba@suika.fam.cx>
809    
810            * RDFXML.pm: s/id/ID/ for attribute name.
811            The |node| arguments are added for |ontriple| calls.
812            Too many "attribute not allowed" errors were raised.
813    
814            * ContentChecker.pm: Initial experimental support for rdf:RDF
815            element.
816    
817    2008-03-21  Wakaba  <wakaba@suika.fam.cx>
818    
819            * RDFXML.pm: New module.
820    
821    2008-03-20  Wakaba  <wakaba@suika.fam.cx>
822    
823            * HTML.pm.src (set_inner_html): Line/column number
824            code was old one yet.
825    
826    2008-03-20  Wakaba  <wakaba@suika.fam.cx>
827    
828            * HTML.pm.src: Better line/column reporting for "duplicate attribute"
829            errors.  Line/column markings for DOCTYPE, comment, and
830            character tokens are reintroduced; otherwise,
831            error location for "not HTML5" error and errors
832            for implied elements are not attached.
833    
834    2008-03-20  Wakaba  <wakaba@suika.fam.cx>
835    
836            * HTML.pm.src: Set line/column numbers to attributes.
837    
838            * NanoDOM.pm (create_attribute_ns, set_attribute_node_ns): Added.
839            (value): Setter implemented.
840    
841            * mkhtmlparser.pl: Set line/column numbers to Attr nodes.
842    
843    2008-03-20  Wakaba  <wakaba@suika.fam.cx>
844    
845            * HTML.pm.src: Unused line/column markings are removed.
846    
847    2008-03-20  Wakaba  <wakaba@suika.fam.cx>
848    
849            * HTML.pm.src (_get_next_token): Remove |first_start_tag|
850            flag, which is no longer used.
851    
852    2008-03-17  Wakaba  <wakaba@suika.fam.cx>
853    
854            * HTML.pm.src: Set line/column information to element nodes.
855    
856            * mkhtmlparser.pl (!!!create-element, !!!insert-element,
857            and !!!insert-element-t): Set line/column information to
858            element nodes.
859    
860    2008-03-17  Wakaba  <wakaba@suika.fam.cx>
861    
862            * HTML.pm.src (_get_next_token): The first "<" character
863            in "<?", "<>", or "</>" should be the error point.
864    
865    2008-03-16  Wakaba  <wakaba@suika.fam.cx>
866    
867            * HTML.pm.src: Some more fixes on error position reporting.
868    
869    2008-03-16  Wakaba  <wakaba@suika.fam.cx>
870    
871            * HTML.pm.src: Token-level precious error reporting.
872    
873    2008-03-16  Wakaba  <wakaba@suika.fam.cx>
874    
875            * HTML.pm.src: Preparation for more precious error point
876            reporting.
877    
878    2008-03-11  Wakaba  <wakaba@suika.fam.cx>
879    
880            * HTML.pm.src: Error type revised.
881    
882    2008-03-11  Wakaba  <wakaba@suika.fam.cx>
883    
884            * HTML.pm.src: Similar codes are merged together, again.
885    
886    2008-03-11  Wakaba  <wakaba@suika.fam.cx>
887    
888            * HTML.pm.src: Similar codes are merged together.
889    
890    2008-03-10  Wakaba  <wakaba@suika.fam.cx>
891    
892            * mkhtmlparser.pl: Set "level" parameter to parse errors.
893    
894            * HTML.pm.src: Code refined.
895    
896    2008-03-09  Wakaba  <wakaba@suika.fam.cx>
897    
898            * HTML.pm.src: |</body>| treatement has been changed (HTML5
899            revision 1348).  Note that I really don't know this makes
900            any difference in the black-box behavior of the parser.
901    
902    2008-03-09  Wakaba  <wakaba@suika.fam.cx>
903    
904            * HTML.pm.src: New end-of-file token implementation (HTML5
905            revision 1348).
906    
907    2008-03-09  Wakaba  <wakaba@suika.fam.cx>
908    
909            * HTML.pm.src: |applet| support (HTML5 revision 1347).
910    
911    2008-03-09  Wakaba  <wakaba@suika.fam.cx>
912    
913            * HTML.pm.src: Foster parenting in AAA (HTML5 revision 1343).
914    
915  2008-03-09  Wakaba  <wakaba@suika.fam.cx>  2008-03-09  Wakaba  <wakaba@suika.fam.cx>
916    
917          * HTML.pm.src: Support for |<input>| in the "in select" insertion          * HTML.pm.src: Support for |<input>| in the "in select" insertion

Legend:
Removed from v.1.178  
changed lines
  Added in v.1.325

admin@suikawiki.org
ViewVC Help
Powered by ViewVC 1.1.24