/[suikacvs]/markup/html/whatpm/Whatpm/ChangeLog
Suika

Diff of /markup/html/whatpm/Whatpm/ChangeLog

Parent Directory Parent Directory | Revision Log Revision Log | View Patch Patch

revision 1.178 by wakaba, Sun Mar 9 05:12:04 2008 UTC revision 1.331 by wakaba, Sun Sep 21 09:45:02 2008 UTC
# Line 1  Line 1 
1    2008-09-21  Wakaba  <wakaba@suika.fam.cx>
2    
3            * ContentChecker.pm: Prepare for |form| |name| attribute's
4            duplication checking.
5    
6    2008-09-21  Wakaba  <wakaba@suika.fam.cx>
7    
8            * HTML.pm.src (parse_byte_stream): Support (or non-support) for
9            unsupported charset="" parameter value (HTML5 revision 2131).
10    
11    2008-09-20  Wakaba  <wakaba@suika.fam.cx>
12    
13            * HTML.pm.src: Reminding places where U+000B is allowed as a space
14            character is fixed (cf. HTML5 revision 1738).
15    
16            * ContentChecker.pm, HTMLTable.pm: U+000B is no longer part of
17            space characters (HTML5 revision 1738).
18    
19    2008-09-20  Wakaba  <wakaba@suika.fam.cx>
20    
21            * HTML.pm.src: The "anything else" case for the "after after body"
22            insertion mode was not updated to swtich to the "in body"
23            insertion mode.  U+000B is no longer a space character for the
24            purpose of tree construction phase (HTML5 revision 1738).
25    
26    2008-09-20  Wakaba  <wakaba@suika.fam.cx>
27    
28            * HTML.pm.src: U+000B is no longer a space character (HTML5
29            revision 1738).
30    
31    2008-09-20  Wakaba  <wakaba@suika.fam.cx>
32    
33            * ContentType.pm: 0x0B is no longer a space character (HTML5
34            revision 1738).
35    
36            * HTML.pm.src: U+000B is no longer a space character for the
37            algorithm for extracting an encoding from a Content-Type (HTML5
38            revision 1738).
39    
40    2008-09-20  Wakaba  <wakaba@suika.fam.cx>
41    
42            * ContentChecker.pm ($IsInHTMLInteractiveContent): New.
43    
44    2008-09-18  Wakaba  <wakaba@suika.fam.cx>
45    
46            * LangTag.pm: Add checks for remaining requirements from RFC 4646.
47    
48            * mklangreg.pl: Sort 'Prefix' values by their length, to ease
49            matching.
50    
51    2008-09-18  Wakaba  <wakaba@suika.fam.cx>
52    
53            * LangTag.pm: Warn for private use language subtags.  Error level
54            typos fixed.  Support for Suppress-Script field.
55    
56            * mklangreg.pl: Support for dumping of nested structure.
57    
58    2008-09-18  Wakaba  <wakaba@suika.fam.cx>
59    
60            * LangTag.pm (check_rfc4646_langtag): Check if a tag is in the
61            recommended case as per RFC 4646.
62    
63    2008-09-18  Wakaba  <wakaba@suika.fam.cx>
64    
65            * LangTag.pm (check_rfc4646_langtag): New method.
66    
67    2008-09-18  Wakaba  <wakaba@suika.fam.cx>
68    
69            * mklangreg.pl: New script.
70    
71            * Makefile: Updated for creation of the module for language subtag
72            registry.
73            
74    2008-09-16  Wakaba  <wakaba@suika.fam.cx>
75    
76            * Makefile: WebIDL.html added.
77    
78            * WebIDL.pod: New documentation.
79    
80    2008-09-16  Wakaba  <wakaba@suika.fam.cx>
81    
82            * WebIDL.pm: Checker's error types are redefined.
83    
84    2008-09-16  Wakaba  <wakaba@suika.fam.cx>
85    
86            * WebIDL.pm: Parser's error types are redefined.  Some forward
87            compatible parsing bugs are fixed.  Some unreachable codes are
88            commented out.
89    
90    2008-09-16  Wakaba  <wakaba@suika.fam.cx>
91    
92            * WebIDL.pm: Support for the reminding extended attributes are
93            added.  It does not satisfy the definition that a forward
94            interface declaration has an extended attribute.  It seems that
95            unless explicitly allowed multiple extended attributes with the
96            same name is not allowed, though it is not explicitly mentioned in
97            the spec.
98    
99    2008-09-16  Wakaba  <wakaba@suika.fam.cx>
100    
101            * WebIDL.pm: Unescapes extended attribute names and extended
102            attribute identifiers.  Preserve whether an extended attribute has
103            an argument list of not.  Support for extended attributes:
104            Constructor, ExceptionConsts, IndexGetter, IndexSetter,
105            NameGetter, NameSetter, and Null.
106            (has_argument_list): New attribute.
107            (idl_text): Stringifies argument lists, if any, even if it is
108            empty.
109    
110    2008-09-15  Wakaba  <wakaba@suika.fam.cx>
111    
112            * HTML.pm.src: New state |PCDATA_STATE|.  Use an empty string for
113            |{s_kwd}| in DATA_STATE as default.
114    
115    2008-09-15  Wakaba  <wakaba@suika.fam.cx>
116    
117            * HTML.pm.src, mkhtmlparser.pl: Replace |{prev_char}|
118            by |{s_kwd}| in DATA_STATE.
119    
120    2008-09-15  Wakaba  <wakaba@suika.fam.cx>
121    
122            * HTML.pm.src: Shorten keys.
123    
124    2008-09-15  Wakaba  <wakaba@suika.fam.cx>
125    
126            * HTML.pm.src: Remove checking for control character, surrogate
127            pair, or noncharacter code points and non-Unicode code
128            points (they should be handled by Whatpm::Charset::UnicodeChecker).
129            (parse_char_stream): Support for the |$get_wrapper| argument and
130            character stream error handlers.
131    
132    2008-09-15  Wakaba  <wakaba@suika.fam.cx>
133    
134            * ContentChecker.pm: Don't call |loda_ns_module|
135            for null-namespace elements/attributes.
136    
137            * HTML.pm.src: Fact out $disallowed_control_chars
138            as a hash.
139    
140    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
141    
142            * HTML.pm.src: Regexp typo fixed.  |{prev_char}|
143            and |{next_char}| initializations are moved to initialization
144            method.  |{read_until}| now supports buffering.  Sync |set_inner_html|
145            with |parse_char_stream|.
146    
147    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
148    
149            * HTML.pm.src (parse_char_stream): Make |set_next_char|
150            invoke |manakai_read_until|, not only |read|, where
151            possible, to decrease the number of |read| method calls.
152    
153            * mkhtmlparser.pl: Related changes to the aforementioned
154            modification.
155    
156    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
157    
158            * HTML.pm.src: Use |read| instead of |getc|.  |set_inner_html|
159            would report character error from now.
160    
161    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
162    
163            * HTML.pm.src: White-space-leaded non-white-space character
164            tokens in "before head insertion mode" was not
165            correctly handled.
166            (set_inner_html): Reimplemented using CharString decodehandle
167            class.  Support for $get_wrapper argument.  Support
168            for |{read_until}| feature.
169    
170    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
171    
172            * HTML.pm.src: Make a "bare ero" error for unknown
173            entities point the "&" character.
174    
175    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
176    
177            * HTML.pm.src: It turns out that U+FFFD don't have to
178            be added to the list of excluded characters.
179    
180    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
181    
182            * HTML.pm.src ($char_onerror): Have character decoder's |line|
183            and |column| a higher priority than the one set by the
184            tokenizer's input handler.
185            ($self->{read_until}): Exclude U+FFFD (but this might
186            not be necessary, since now we do line/column fixup in
187            the character decode handle).
188    
189    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
190    
191            * HTML.pm.src: Use |{read_until}| where possible.
192    
193    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
194    
195            * HTML.pm.src: Change |{getc_until}| to |{read_until}|
196            and |manakai_getc_until| to |manakai_read_until| to
197            reduce the number of string copies.
198    
199    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
200    
201            * HTML.pm.src (parse_char_string): Use newly created
202            |Whatpm::Charset::DecodeHandle::CharString| instead of Perl's
203            standard feature to |open| a string as a filehandle,
204            since Perl's string filehandle seems not supporting |ungetc|
205            method correctly.
206            (parse_char_stream): Define |{getc_until}| method.
207            (DATA_STATE): Experimental support for |getc_until| feature.
208    
209    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
210    
211            * HTML.pm.src: Check points added to newly added branches.
212    
213    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
214    
215            * HTML.pm.src: Remove |{char}|, which is no longer used.
216            Remove |{entity_in_attr}| and |{last_attribute_value_state}|
217            and replaced by |{prev_state}|.
218    
219            * mkhtmlparser.pl: Remove |{char}| feature.
220            Remove |!!!back-next-input-character;| macro.
221    
222    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
223    
224            * HTML.pm.src: Finally we get rid of all the inner loops.  Remove
225            entity related tokenizer states in favor of new states
226            implementing the consume character reference algorithm.
227    
228    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
229    
230            * HTML.pm.src: "Consume a character reference" algorithm is
231            now implemented as a tokenizer's state, rather than
232            a method, with minimum changes (more changes will
233            be made, in due course).  "Bogus comment state"'s inner
234            loop gets removed.
235    
236    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
237    
238            * HTML.pm.src: Make |PUBLIC| and |SYSTEM| keyword tokenizing
239            into their own tokenizer states.
240    
241    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
242    
243            * HTML.pm.src: |CDATA_SECTION_STATE| (formally |CDATA_BLOCK_STATE|
244            is split into three states.
245    
246    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
247    
248            * HTML.pm.src: |CLOSE_TAG_OPEN_STATE| is broken into
249            itself and new |CDATA_PCDATA_CLOSE_TAG_STATE| so that
250            no longer does the tokenizer have to push back next input
251            characters in those states.
252    
253    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
254    
255            * HTML.pm.src: |MARKUP_DECLARATION_OPEN_STATE| broken
256            into four states so that no longer does the tokenizer have to push
257            back next input characters in that state.
258    
259    2008-09-11  Wakaba  <wakaba@suika.fam.cx>
260    
261            * HTML.pm.src: Methods now accept additional parameter, $get_wrapper,
262            which can be used to insert some wrapper between the character
263            stream handle and the tokenizer.  (It is currently not supported
264            for |set_inner_html| for |Element|s).
265    
266    2008-09-10  Wakaba  <wakaba@suika.fam.cx>
267    
268            * HTML.pm.src: Ignore punctuations in charset names.
269    
270    2008-09-10  Wakaba  <wakaba@suika.fam.cx>
271    
272            * ContentChecker.pm: Support for charset-layer error levels.
273    
274            * HTML.pm.src: Don't specify |text| argument for the
275            |chardecode:fallback| error, since it is not the encoding
276            being used alternatively.
277    
278    2008-09-06  Wakaba  <wakaba@suika.fam.cx>
279    
280            * HTML.pm.src: Support for |XSLT-compat| (HTML5 revision 2141).
281    
282    2008-08-31  Wakaba  <wakaba@suika.fam.cx>
283    
284            * CacheManifest.pm: Support for extensibility (HTML5 revision 2051).
285    
286    2008-08-31  Wakaba  <wakaba@suika.fam.cx>
287    
288            * HTML.pm.src: Bug fix and sync with the spec with regard
289            to after after frameset insertion mode processing (HTML5
290            revision 1909).  Note that the implementation was wrong
291            per the old spec before the r1909 changes.
292    
293    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
294    
295            * HTMLTable.pm: scope=auto algorithm fix synced with the
296            spec (HTML5 revision 2093).
297            ($process_row): Algorithm step numbers synced with the
298            spec (HTML5 revision 2092).
299    
300    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
301    
302            * HTMLTable.pm: Zs is not what we want; we want White_Space! (HTML5
303            revision 2094).
304    
305    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
306    
307            * ContentType.pm: Support for image/svg+xml (HTML5 revision 2096).
308    
309    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
310    
311            * HTML.pm.src: '"' and "'" at the end of attribute
312            name (after another attribute) now raise parse error (HTML5
313            revision 2123).  Empty unquoted attribute values are no
314            longer allowed (HTML5 revision 2122).
315    
316    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
317    
318            * mkhtmlparser.pl: Support for MathML |definitionURL| attribute (HTML5
319            revision 2130).
320    
321    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
322    
323            * ContentChecker.pm: |xml:lang| attribute value must be same
324            as |lang| attribute value for HTML elements (HTML5 revision 2062
325            and so on).
326    
327    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
328    
329            * ContentChecker.pm: Error level definition for |xml_id_error|
330            was missing.
331    
332            * URIChecker.pm: The end of the URL should be marked as the
333            error location for an empty path error.  The position
334            between the userinfo and the port components should be
335            marked as the error location for an empty host error.
336    
337    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
338    
339            * URIChecker.pm: Set parameters representing where in the
340            value the error occurs for errors.  Report unknown
341            address format error in warning level, since address
342            formats are rarely added.  Path segments starting with "/.."
343            were misinterpreted as a dot-segment.
344    
345    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
346    
347            * URIChecker.pm (check_iri_reference): Requires
348            |Message::DOM::DOMImplementation|.
349    
350    2008-08-29  Wakaba  <wakaba@suika.fam.cx>
351    
352            * IMTChecker.pm: Updated for the new error reporting architecture.
353    
354            * ContentChecker.pm: Error levels for IMTs are added.
355    
356    2008-08-17  Wakaba  <wakaba@suika.fam.cx>
357    
358            * H2H.pm (_shift_token): Support for unquoted HTML attribute
359            values.
360    
361    2008-08-16  Wakaba  <wakaba@suika.fam.cx>
362    
363            * CacheManifest.pm: Support for new style of error
364            reports.
365    
366            * HTML.pm.src: Set line=1, column=1 to the document node.
367    
368    2008-08-16  Wakaba  <wakaba@suika.fam.cx>
369    
370            * ContentChecker.pm, RDFXML.pm: Pass {level} object to language tag
371            and URL checkers.  Support for more error levels for bogus
372            langauge tag and URL "standards".
373    
374            * LangTag.pm, URIChecker.pm: Support for new style error
375            level reporting.
376    
377    2008-08-15  Wakaba  <wakaba@suika.fam.cx>
378    
379            * ContentChecker.pm: Support for RDF/XML error levels.
380    
381            * HTMLTable.pm, RDFXML.pm: Support for new style of error level
382            specifying.  Error types are revised.
383    
384    2008-08-15  Wakaba  <wakaba@suika.fam.cx>
385    
386            * ContentChecker.pm: All error reporting method calls are
387            renewed.
388    
389    2008-08-15  Wakaba  <wakaba@suika.fam.cx>
390    
391            * HTML.pm.src: All error type names and "text" parameters
392            are revised.  Use new style for "level" specification.
393    
394            * mkhtmlparser.pl: Use new style for "level" specification.
395    
396    2008-08-03  Wakaba  <wakaba@suika.fam.cx>
397    
398            * WebIDL.pm (parse_char_string): Simplified error
399            reporting process for broken ignored valuetype definition.
400            (Valuetype idl_text): Support for special "DOMString" name.
401    
402    2008-08-03  Wakaba  <wakaba@suika.fam.cx>
403    
404            * WebIDL.pm ($get_scoped_name): Append "::::" if the last
405            terminal of the ScopedName is "DOMString", such that whether
406            the last part of the scoped name is "DOMString" or "_DOMString"
407            later.  It is necessary to determine whether a |typedef|
408            definition should be ignored or not.
409            (parse_char_string): Unescape the identifier of
410            exception members.
411            ($resolve): Return undef for builtin types and sequence<T>
412            types (we might not have to do this, however...).
413            (check): Support checking for Exceptions, Valuetypes,
414            and Typedefs.
415            ($serialize_type): Support for "DOMString::::" syntax.
416            (Typedef idl_text): Output Type as "DOMString" if it
417            is really "DOMString" (i.e. its internal representation
418            is "::DOMString::").
419    
420    2008-08-03  Wakaba  <wakaba@suika.fam.cx>
421    
422            * WebIDL.pm ($resolve): New code, based on resolve code
423            for constant types in the |check| method.
424            (check): Support for checking of attributes, operations, and
425            arguments.
426            (Attribute/Operation idl_text): Exception names in getraises,
427            setraises, and raises clauses is serizlied by |$serialize_type|
428            code.
429    
430    2008-08-02  Wakaba  <wakaba@suika.fam.cx>
431    
432            * WebIDL.pm ($integer): Order of selections are changed to match
433            hexadecimal numbers (the original pattern, taken from the spec,
434            was not work for hexadecimal numbers, because the "0" prefix
435            matches to the [0-7]* part (as an empty string) and therefore
436            it does not match with remaining "x..." part of a "0x..." integer
437            literal.
438            ($get_type): It now returns a string, not an array reference,
439            for regular types and |sequence| types (i.e. it in any case
440            returns a string).
441            ($get_next_token): The second item in the array that represents
442            a integer or float token is now a Perl number value, not the
443            original string representation of the number.
444            (check): Support for const value consistency checking.
445            No extended attribute is defined for constants.
446            (Node subclasses): Use simple strings rather than array references
447            for default data type values.
448            ($serialize_type): Type values are now simple strings.
449            (value): If the new attribute value is a false value, then
450            a FALSE value is set to the attribute.
451    
452    2008-08-02  Wakaba  <wakaba@suika.fam.cx>
453    
454            * WebIDL.pm ($get_scoped_name): Now scoped names are stored
455            in its stringified format ("scoped name" as defined in the
456            spec).  Note that future version of this module should not use
457            array references for type values and the |type_text| attribute
458            should be made obsolete.
459            (parse_char_string): Unescape attribute names.
460            (check): Support for checking of whether inherited interfaces
461            are actually defined or not.  Support for checking of whether
462            interface member identifiers are duplicated or not.
463            ($serialize_type): Scoped names are returned as is.  A future
464            version of this code should escape identifiers other than "DOMString",
465            otherwise the idl_text would be non-conforming.
466    
467    2008-08-02  Wakaba  <wakaba@suika.fam.cx>
468    
469            * WebIDL.pm (parse_char_string): Set line/column numbers
470            to generated nodes.  Unescape identifiers.  Extended attributes
471            for Definition's were ignored.
472            (append_child): Set |parent_node| attribute.
473            (parent_node): New attribute.
474            (check): Support interface/exception members.  Support
475            extended attributes.  Support definition identifier uniqueness
476            constraint.
477            (qualified_name): New attribute.
478            (Interface/Exception idl_text): Extended attributes were
479            not prepended to the returned text.
480    
481    2008-08-02  Wakaba  <wakaba@suika.fam.cx>
482    
483            * WebIDL.pm (parse_char_string): Set line/column numbers
484            to interface object experimentally.  s/shift/pop/g, shift
485            would make things wrong.  Support for interface forward
486            declarations was missing.  Broken interface declarations
487            with no block were not ignored entirely.
488            (Whatpm::WebIDL::Node): New abstract class.  This class
489            makes things easier.
490            (child_nodes): New attribute.  Unlike DOM's attribute with
491            same name, this attribute returns a dead list of nodes for
492            simplicity.
493            (get_user_data, set_user_data): New methods.
494            (Module idl_text): A SPACE character should be inserted
495            before the |{| character.
496            (Interface idl_text): Support for interface forward declarations.
497            (is_forward_declaration): New attribute.
498    
499    2008-07-19  Wakaba  <wakaba@suika.fam.cx>
500    
501            * WebIDL.pm (type_text): Better serializer.
502    
503    2008-07-19  Wakaba  <wakaba@suika.fam.cx>
504    
505            * WebIDL.pm: Revise forward-compatible parsing so that
506            it now can handle broken extended attributes and as such.
507    
508    2008-07-19  Wakaba  <wakaba@suika.fam.cx>
509    
510            * WebIDL.pm: Real support for extended attributes.
511            Support for extended attributes with arguments.
512    
513    2008-07-19  Wakaba  <wakaba@suika.fam.cx>
514    
515            * WebIDL.pm: Support for |exception| syntax.
516            (Interface->idl_text): Tentative support for inheritances.
517    
518    2008-07-19  Wakaba  <wakaba@suika.fam.cx>
519    
520            * WebIDL.pm: Hierarchical scoped name support was broken.
521            Support for raises, setraises, and getraises syntaxes.
522    
523    2008-07-18  Wakaba  <wakaba@suika.fam.cx>
524    
525            * WebIDL.pm: Support for |idl_text| attribute, version 1 (no
526            proper support for types, extended attributes, and exceptions yet).
527            WebIDL parser, version 1 (no support for exceptions yet,
528            no proper support for extended attributes yet).
529    
530    2008-07-09  Wakaba  <wakaba@suika.fam.cx>
531    
532            * WebIDL.pm (parse_char_string): Support for basic attribute syntax.
533    
534    2008-06-29  Wakaba  <wakaba@suika.fam.cx>
535    
536            * WebIDL.pm: Support for valuetype and const.
537    
538    2008-06-29  Wakaba  <wakaba@suika.fam.cx>
539            
540            * WebIDL.pm: New module.
541    
542    2008-06-15  Wakaba  <wakaba@suika.fam.cx>
543    
544            * Makefile (Entities.html): URI changed.
545    
546    2008-06-08  Wakaba  <wakaba@suika.fam.cx>
547    
548            * HTML.pm.src: Support for ruby parsing (HTML5 revision 1704).
549    
550    2008-06-01  Wakaba  <wakaba@suika.fam.cx>
551    
552            * HTML.pm.src (_get_next_token): A parse error was missing.
553    
554    2008-06-01  Wakaba  <wakaba@suika.fam.cx>
555    
556            * mklinktypelist.pl: rel=contact is no longer part of the HTML5
557            spec (commented out). (HTML5 revision 1711).
558    
559    2008-05-25  Wakaba  <wakaba@suika.fam.cx>
560    
561            * ContentType.pm: Drop support for UTF-32 (HTML5 revision 1701).
562    
563            * HTML.pm.src: UTF-16BE and UTF-16LE should be considered
564            as UTF-16 (HTML5 revision 1701).
565    
566    2008-05-25  Wakaba  <wakaba@suika.fam.cx>
567    
568            * HTML.pm.src: Support for <noframes> in <head> (HTML5 revision
569            1692).
570    
571    2008-05-25  Wakaba  <wakaba@suika.fam.cx>
572    
573            * HTML.pm.src: The secondary insertion mode used when switching
574            to foreign content is the "in body" insertion mode (HTML5 revision
575            1696).
576    
577    2008-05-25  Wakaba  <wakaba@suika.fam.cx>
578    
579            * HTML.pm.src: Don't raise parse error for <isindex/> (HTML5
580            revision 1697).
581    
582    2008-05-24  Wakaba  <wakaba@suika.fam.cx>
583    
584            * HTML.pm.src: Support for end-of-file token in foreign content
585            insertion mode (HTML5 revision 1693).  Update SVG camelCase
586            attribute list (HTML5 revision 1700).  <textarea> closes
587            </select> (HTML5 revision 1699).  More start tags close in
588            foreign content insertion mode (HTML5 revision 1698).
589    
590    2008-05-24  Wakaba  <wakaba@suika.fam.cx>
591    
592            * HTML.pm.src: ";" is not part of charset name (HTML5 revision 1665).
593    
594    2008-05-24  Wakaba  <wakaba@suika.fam.cx>
595    
596            * HTML.pm.src: More robust charset parameter detection (HTML5
597            revision 1674).
598    
599    2008-05-24  Wakaba  <wakaba@suika.fam.cx>
600    
601            * ContentType.pm: Support for image/vnd.microsoft.icon (HTML5
602            revision 1676).
603    
604    2008-05-24  Wakaba  <wakaba@suika.fam.cx>
605    
606            * HTML.pm.src: Ignore language part of public identifiers for
607            quriks mode detection (HTML5 revision 1679).
608    
609    2008-05-24  Wakaba  <wakaba@suika.fam.cx>
610    
611            * HTML.pm.src: Reduce the number of errors in truncated doctypes (HTML5
612            revision 1685).
613    
614    2008-05-24  Wakaba  <wakaba@suika.fam.cx>
615    
616            * HTML.pm.src: Support for EOF in new states for tags (HTML5
617            revision 1684).
618    
619    2008-05-24  Wakaba  <wakaba@suika.fam.cx>
620    
621            * HTML.pm.src (_reset_insertion_mode): Make <td>.innerHTML
622            work (HTML5 revision 1690).
623    
624    2008-05-24  Wakaba  <wakaba@suika.fam.cx>
625    
626            * HTML.pm.src (_tree_construction_main): Change handling of
627            end tags in head insertion modes (HTML5 revision 1686).
628            (parse_char_string): Bug fix for non-utf8 character string handlings.
629            (parse_char_stream): |ungetc| does not work well for this context.
630    
631    2008-05-18  Wakaba  <wakaba@suika.fam.cx>
632    
633            * HTML.pm.src (parse_byte_string): Redefined to invoke
634            |parse_byte_stream|.
635            (parse_byte_stream): New method.
636    
637    2008-05-18  Wakaba  <wakaba@suika.fam.cx>
638    
639            * HTML.pm.src (parse_byte_string): Fix the column number reported
640            by encoding layer error reporter.
641    
642    2008-05-17  Wakaba  <wakaba@suika.fam.cx>
643    
644            * HTML.pm.src (parse_byte_string): Use streaming decoder
645            rather than converting the whole byte string and then parsing.
646            Propagate errors in character encoding layer.
647            (get_next_token): Precise error reporting for |bare stago| error.
648    
649    2008-05-17  Wakaba  <wakaba@suika.fam.cx>
650    
651            * HTML.pm.src (parse_char_stream): New method.
652            (parse_char_string): This method is now defined as an invocation
653            of the |parse_char_stream| method.
654    
655    2008-05-17  Wakaba  <wakaba@suika.fam.cx>
656    
657            * HTML.pm.src (parse_byte_string): Report various status
658            of the sniffing as info-level errors.  Support for new
659            decoding framework in parser resestting.
660            (new): Various default error levels were not set.
661    
662    2008-05-17  Wakaba  <wakaba@suika.fam.cx>
663    
664            * HTML.pm.src (parse_byte_string): HTML5 encoding siniffing
665            algorithm, except for the actual sniffing, is implemented
666            with new framework with Message::Charset::Info.
667    
668    2008-05-16  Wakaba  <wakaba@suika.fam.cx>
669    
670            * CacheManifest.pm (_parse): Drop fragment identifiers from
671            URIs in fallback section (HTML5 revision 1596).
672    
673    2008-05-10  Wakaba  <wakaba@suika.fam.cx>
674    
675            * Makefile (Entities.html): URI has changed.
676    
677    2008-05-10  Wakaba  <wakaba@suika.fam.cx>
678    
679            * CacheManifest.pm: Don't replace U+0000 NULL (HTML5 revision
680            1553).
681    
682    2008-05-06  Wakaba  <wakaba@suika.fam.cx>
683    
684            * ContentChecker.pm: Noted that those returned in |table| are
685            no longer table elements, but table objects returned
686            by Whatpm::HTMLTable.
687    
688            * HTMLTable.pm (form_table): Return table element node
689            as |$table->{element}|.
690            (assign_header): Support for the |headers=""| attribute.
691    
692    2008-05-06  Wakaba  <wakaba@suika.fam.cx>
693    
694            * HTMLTable.pm (assign_header): New function; first version
695            with no support for headers="".
696            (form_table): Include table width and height to the returned
697            table object for covenience.  Indexing in column assignement was wrong.
698            Set whether a data cell is empty or not for convenience.
699    
700    2008-05-05  Wakaba  <wakaba@suika.fam.cx>
701    
702            * HTMLTable.pm: Robuster caption support (HTML5 revision 1393).
703    
704    2008-05-05  Wakaba  <wakaba@suika.fam.cx>
705    
706            * HTMLTable.pm: How table model errors are detected is
707            changed (HTML5 revision 1387).
708    
709    2008-05-05  Wakaba  <wakaba@suika.fam.cx>
710    
711            * HTMLTable.pm: The algorithm now moves |tfoot| elements
712            to the end of the table (HTML5 revision 1380).
713    
714    2008-05-05  Wakaba  <wakaba@suika.fam.cx>
715    
716            * HTMLTable.pm: The algorithm is now 0-based indexing, instead
717            of 1-based (HTML5 revision 1376).
718    
719    2008-05-05  Wakaba  <wakaba@suika.fam.cx>
720    
721            * ContentType.pm: "Content-Type: text/plain; charset=UTF-8"
722            and "Content-Encoding" no longer prevent sniffing (HTML5
723            revision 1288).
724    
725    2008-05-05  Wakaba  <wakaba@suika.fam.cx>
726    
727            * ContentType.pm: Skip BOMs n feed or HTML algorithm (HTML5 revision
728            1282).
729    
730    2008-05-03  Wakaba  <wakaba@suika.fam.cx>
731    
732            * ContentChecker.pm: Support for global attributes.
733            Status of XML specs are added.
734    
735    2008-05-03  Wakaba  <wakaba@suika.fam.cx>
736    
737            * ContentChecker.pm (check_element): Support for |template=""|
738            and |ref=""| attribute (referent element type checking).
739    
740    2008-04-29  Wakaba  <wakaba@suika.fam.cx>
741    
742            * CacheManifest.pm (_parse): New same origin definition (HTML5 revision
743            1500) is implemented (except for IDNA part and URI-scheme-specific
744            knowledge).  Line number counting was wrong for LF-only
745            documents.
746    
747    2008-04-13  Wakaba  <wakaba@suika.fam.cx>
748    
749            * HTML.pm.src: Raise an parse error for any disallowed
750            character (HTML5 revision 1263).
751    
752    2008-04-13  Wakaba  <wakaba@suika.fam.cx>
753    
754            * mkentitylist.pl: Support for new HTML5 entity table format (the
755            definition for |AElig;| was missing).
756    
757    2008-04-13  Wakaba  <wakaba@suika.fam.cx>
758    
759            * HTML.pm.src, mkhtmlparser.pl: Support for element/attribute
760            name/namespace fixup (HTML5 revisions 1413, 1415, 1416, and 1417).
761    
762    2008-04-13  Wakaba  <wakaba@suika.fam.cx>
763    
764            * HTML.pm.src: List of element names that close foreign content
765            insertion mode is added (HTML5 revisions 1412 and 1418).
766    
767    2008-04-13  Wakaba  <wakaba@suika.fam.cx>
768    
769            * HTML.pm.src: Support for |mglyph| and |malignmark|
770            elements (HTML5 revision 1410).
771    
772    2008-04-13  Wakaba  <wakaba@suika.fam.cx>
773    
774            * HTML.pm.src: Support for new long MathML entities (HTML5
775            revision 1406).
776    
777    2008-04-13  Wakaba  <wakaba@suika.fam.cx>
778    
779            * HTML.pm.src: CDATA section support for MathML and SVG
780            elements (HTML5 revision 1404 and 1420).
781    
782    2008-04-12  Wakaba  <wakaba@suika.fam.cx>
783    
784            * HTML.pm.src, mkhtmlparser.pl: Support for MathML and SVG
785            elements (HTML5 revision 1404).  Unused !!!macro definitions
786            are removed.
787    
788    2008-04-12  Wakaba  <wakaba@suika.fam.cx>
789    
790            * HTML.pm.src, mkhtmlparser.pl: The way permitted slash errors
791            are raised is changed (HTML5 revision 1404).
792    
793    2008-04-06  Wakaba  <wakaba@suika.fam.cx>
794    
795            * HTML.pm.src: Category sets that are no longer used are removed.
796    
797    2008-04-06  Wakaba  <wakaba@suika.fam.cx>
798    
799            * HTML.pm.src: The ->[1] property of stack entries are now
800            replaced by constants representing element category.
801    
802    2008-04-06  Wakaba  <wakaba@suika.fam.cx>
803    
804            * HTML.pm.src: Don't use local name stored in stack (i.e. ->[1])
805            for error reporting.  (This is a preparation for using constant
806            value for ->[1].)
807    
808    2008-03-22  Wakaba  <wakaba@suika.fam.cx>
809    
810            * RDFXML.pm: Typo fixed.
811    
812    2008-03-22  Wakaba  <wakaba@suika.fam.cx>
813    
814            * ContentChecker.pm: |fact_level| is now treated
815            as same as |must_level|, i.e. level = |m|.
816            (check_element): Make list of URIs in the DOM.
817    
818    2008-03-21  Wakaba  <wakaba@suika.fam.cx>
819    
820            * RDFXML.pm: Language accessor implemented.
821            Local (null-namespace) attribute support.
822    
823    2008-03-21  Wakaba  <wakaba@suika.fam.cx>
824    
825            * RDFXML.pm: Factored out ID checking code.
826    
827    2008-03-21  Wakaba  <wakaba@suika.fam.cx>
828    
829            * RDFXML.pm: TODO items noted.  Validation of ID
830            and URI attributes is implemented.  Warn if unknown
831            value is used in rdf:parseType="" attribute.
832    
833            * URIChecker.pm (check_rdf_uri_reference): New function.
834    
835    2008-03-21  Wakaba  <wakaba@suika.fam.cx>
836    
837            * RDFXML.pm: bnodeid implemented.  Relative references
838            are now resolved.
839    
840    2008-03-21  Wakaba  <wakaba@suika.fam.cx>
841    
842            * ContentChecker.pm: RDF reification implemented.
843    
844            * RDFXML.pm: undef vs false bug fixed.
845            Reification implemented.
846    
847    2008-03-21  Wakaba  <wakaba@suika.fam.cx>
848    
849            * RDFXML.pm: s/id/ID/ for attribute name.
850            The |node| arguments are added for |ontriple| calls.
851            Too many "attribute not allowed" errors were raised.
852    
853            * ContentChecker.pm: Initial experimental support for rdf:RDF
854            element.
855    
856    2008-03-21  Wakaba  <wakaba@suika.fam.cx>
857    
858            * RDFXML.pm: New module.
859    
860    2008-03-20  Wakaba  <wakaba@suika.fam.cx>
861    
862            * HTML.pm.src (set_inner_html): Line/column number
863            code was old one yet.
864    
865    2008-03-20  Wakaba  <wakaba@suika.fam.cx>
866    
867            * HTML.pm.src: Better line/column reporting for "duplicate attribute"
868            errors.  Line/column markings for DOCTYPE, comment, and
869            character tokens are reintroduced; otherwise,
870            error location for "not HTML5" error and errors
871            for implied elements are not attached.
872    
873    2008-03-20  Wakaba  <wakaba@suika.fam.cx>
874    
875            * HTML.pm.src: Set line/column numbers to attributes.
876    
877            * NanoDOM.pm (create_attribute_ns, set_attribute_node_ns): Added.
878            (value): Setter implemented.
879    
880            * mkhtmlparser.pl: Set line/column numbers to Attr nodes.
881    
882    2008-03-20  Wakaba  <wakaba@suika.fam.cx>
883    
884            * HTML.pm.src: Unused line/column markings are removed.
885    
886    2008-03-20  Wakaba  <wakaba@suika.fam.cx>
887    
888            * HTML.pm.src (_get_next_token): Remove |first_start_tag|
889            flag, which is no longer used.
890    
891    2008-03-17  Wakaba  <wakaba@suika.fam.cx>
892    
893            * HTML.pm.src: Set line/column information to element nodes.
894    
895            * mkhtmlparser.pl (!!!create-element, !!!insert-element,
896            and !!!insert-element-t): Set line/column information to
897            element nodes.
898    
899    2008-03-17  Wakaba  <wakaba@suika.fam.cx>
900    
901            * HTML.pm.src (_get_next_token): The first "<" character
902            in "<?", "<>", or "</>" should be the error point.
903    
904    2008-03-16  Wakaba  <wakaba@suika.fam.cx>
905    
906            * HTML.pm.src: Some more fixes on error position reporting.
907    
908    2008-03-16  Wakaba  <wakaba@suika.fam.cx>
909    
910            * HTML.pm.src: Token-level precious error reporting.
911    
912    2008-03-16  Wakaba  <wakaba@suika.fam.cx>
913    
914            * HTML.pm.src: Preparation for more precious error point
915            reporting.
916    
917    2008-03-11  Wakaba  <wakaba@suika.fam.cx>
918    
919            * HTML.pm.src: Error type revised.
920    
921    2008-03-11  Wakaba  <wakaba@suika.fam.cx>
922    
923            * HTML.pm.src: Similar codes are merged together, again.
924    
925    2008-03-11  Wakaba  <wakaba@suika.fam.cx>
926    
927            * HTML.pm.src: Similar codes are merged together.
928    
929    2008-03-10  Wakaba  <wakaba@suika.fam.cx>
930    
931            * mkhtmlparser.pl: Set "level" parameter to parse errors.
932    
933            * HTML.pm.src: Code refined.
934    
935    2008-03-09  Wakaba  <wakaba@suika.fam.cx>
936    
937            * HTML.pm.src: |</body>| treatement has been changed (HTML5
938            revision 1348).  Note that I really don't know this makes
939            any difference in the black-box behavior of the parser.
940    
941    2008-03-09  Wakaba  <wakaba@suika.fam.cx>
942    
943            * HTML.pm.src: New end-of-file token implementation (HTML5
944            revision 1348).
945    
946    2008-03-09  Wakaba  <wakaba@suika.fam.cx>
947    
948            * HTML.pm.src: |applet| support (HTML5 revision 1347).
949    
950    2008-03-09  Wakaba  <wakaba@suika.fam.cx>
951    
952            * HTML.pm.src: Foster parenting in AAA (HTML5 revision 1343).
953    
954  2008-03-09  Wakaba  <wakaba@suika.fam.cx>  2008-03-09  Wakaba  <wakaba@suika.fam.cx>
955    
956          * HTML.pm.src: Support for |<input>| in the "in select" insertion          * HTML.pm.src: Support for |<input>| in the "in select" insertion

Legend:
Removed from v.1.178  
changed lines
  Added in v.1.331

admin@suikawiki.org
ViewVC Help
Powered by ViewVC 1.1.24