/[suikacvs]/markup/html/whatpm/Whatpm/ChangeLog
Suika

Diff of /markup/html/whatpm/Whatpm/ChangeLog

Parent Directory Parent Directory | Revision Log Revision Log | View Patch Patch

revision 1.166 by wakaba, Sat Mar 8 02:48:19 2008 UTC revision 1.306 by wakaba, Sun Sep 14 09:05:54 2008 UTC
# Line 1  Line 1 
1    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
2    
3            * HTML.pm.src: White-space-leaded non-white-space character
4            tokens in "before head insertion mode" was not
5            correctly handled.
6            (set_inner_html): Reimplemented using CharString decodehandle
7            class.  Support for $get_wrapper argument.  Support
8            for |{read_until}| feature.
9    
10    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
11    
12            * HTML.pm.src: Make a "bare ero" error for unknown
13            entities point the "&" character.
14    
15    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
16    
17            * HTML.pm.src: It turns out that U+FFFD don't have to
18            be added to the list of excluded characters.
19    
20    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
21    
22            * HTML.pm.src ($char_onerror): Have character decoder's |line|
23            and |column| a higher priority than the one set by the
24            tokenizer's input handler.
25            ($self->{read_until}): Exclude U+FFFD (but this might
26            not be necessary, since now we do line/column fixup in
27            the character decode handle).
28    
29    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
30    
31            * HTML.pm.src: Use |{read_until}| where possible.
32    
33    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
34    
35            * HTML.pm.src: Change |{getc_until}| to |{read_until}|
36            and |manakai_getc_until| to |manakai_read_until| to
37            reduce the number of string copies.
38    
39    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
40    
41            * HTML.pm.src (parse_char_string): Use newly created
42            |Whatpm::Charset::DecodeHandle::CharString| instead of Perl's
43            standard feature to |open| a string as a filehandle,
44            since Perl's string filehandle seems not supporting |ungetc|
45            method correctly.
46            (parse_char_stream): Define |{getc_until}| method.
47            (DATA_STATE): Experimental support for |getc_until| feature.
48    
49    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
50    
51            * HTML.pm.src: Check points added to newly added branches.
52    
53    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
54    
55            * HTML.pm.src: Remove |{char}|, which is no longer used.
56            Remove |{entity_in_attr}| and |{last_attribute_value_state}|
57            and replaced by |{prev_state}|.
58    
59            * mkhtmlparser.pl: Remove |{char}| feature.
60            Remove |!!!back-next-input-character;| macro.
61    
62    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
63    
64            * HTML.pm.src: Finally we get rid of all the inner loops.  Remove
65            entity related tokenizer states in favor of new states
66            implementing the consume character reference algorithm.
67    
68    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
69    
70            * HTML.pm.src: "Consume a character reference" algorithm is
71            now implemented as a tokenizer's state, rather than
72            a method, with minimum changes (more changes will
73            be made, in due course).  "Bogus comment state"'s inner
74            loop gets removed.
75    
76    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
77    
78            * HTML.pm.src: Make |PUBLIC| and |SYSTEM| keyword tokenizing
79            into their own tokenizer states.
80    
81    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
82    
83            * HTML.pm.src: |CDATA_SECTION_STATE| (formally |CDATA_BLOCK_STATE|
84            is split into three states.
85    
86    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
87    
88            * HTML.pm.src: |CLOSE_TAG_OPEN_STATE| is broken into
89            itself and new |CDATA_PCDATA_CLOSE_TAG_STATE| so that
90            no longer does the tokenizer have to push back next input
91            characters in those states.
92    
93    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
94    
95            * HTML.pm.src: |MARKUP_DECLARATION_OPEN_STATE| broken
96            into four states so that no longer does the tokenizer have to push
97            back next input characters in that state.
98    
99    2008-09-11  Wakaba  <wakaba@suika.fam.cx>
100    
101            * HTML.pm.src: Methods now accept additional parameter, $get_wrapper,
102            which can be used to insert some wrapper between the character
103            stream handle and the tokenizer.  (It is currently not supported
104            for |set_inner_html| for |Element|s).
105    
106    2008-09-10  Wakaba  <wakaba@suika.fam.cx>
107    
108            * HTML.pm.src: Ignore punctuations in charset names.
109    
110    2008-09-10  Wakaba  <wakaba@suika.fam.cx>
111    
112            * ContentChecker.pm: Support for charset-layer error levels.
113    
114            * HTML.pm.src: Don't specify |text| argument for the
115            |chardecode:fallback| error, since it is not the encoding
116            being used alternatively.
117    
118    2008-09-06  Wakaba  <wakaba@suika.fam.cx>
119    
120            * HTML.pm.src: Support for |XSLT-compat| (HTML5 revision 2141).
121    
122    2008-08-31  Wakaba  <wakaba@suika.fam.cx>
123    
124            * CacheManifest.pm: Support for extensibility (HTML5 revision 2051).
125    
126    2008-08-31  Wakaba  <wakaba@suika.fam.cx>
127    
128            * HTML.pm.src: Bug fix and sync with the spec with regard
129            to after after frameset insertion mode processing (HTML5
130            revision 1909).  Note that the implementation was wrong
131            per the old spec before the r1909 changes.
132    
133    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
134    
135            * HTMLTable.pm: scope=auto algorithm fix synced with the
136            spec (HTML5 revision 2093).
137            ($process_row): Algorithm step numbers synced with the
138            spec (HTML5 revision 2092).
139    
140    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
141    
142            * HTMLTable.pm: Zs is not what we want; we want White_Space! (HTML5
143            revision 2094).
144    
145    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
146    
147            * ContentType.pm: Support for image/svg+xml (HTML5 revision 2096).
148    
149    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
150    
151            * HTML.pm.src: '"' and "'" at the end of attribute
152            name (after another attribute) now raise parse error (HTML5
153            revision 2123).  Empty unquoted attribute values are no
154            longer allowed (HTML5 revision 2122).
155    
156    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
157    
158            * mkhtmlparser.pl: Support for MathML |definitionURL| attribute (HTML5
159            revision 2130).
160    
161    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
162    
163            * ContentChecker.pm: |xml:lang| attribute value must be same
164            as |lang| attribute value for HTML elements (HTML5 revision 2062
165            and so on).
166    
167    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
168    
169            * ContentChecker.pm: Error level definition for |xml_id_error|
170            was missing.
171    
172            * URIChecker.pm: The end of the URL should be marked as the
173            error location for an empty path error.  The position
174            between the userinfo and the port components should be
175            marked as the error location for an empty host error.
176    
177    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
178    
179            * URIChecker.pm: Set parameters representing where in the
180            value the error occurs for errors.  Report unknown
181            address format error in warning level, since address
182            formats are rarely added.  Path segments starting with "/.."
183            were misinterpreted as a dot-segment.
184    
185    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
186    
187            * URIChecker.pm (check_iri_reference): Requires
188            |Message::DOM::DOMImplementation|.
189    
190    2008-08-29  Wakaba  <wakaba@suika.fam.cx>
191    
192            * IMTChecker.pm: Updated for the new error reporting architecture.
193    
194            * ContentChecker.pm: Error levels for IMTs are added.
195    
196    2008-08-17  Wakaba  <wakaba@suika.fam.cx>
197    
198            * H2H.pm (_shift_token): Support for unquoted HTML attribute
199            values.
200    
201    2008-08-16  Wakaba  <wakaba@suika.fam.cx>
202    
203            * CacheManifest.pm: Support for new style of error
204            reports.
205    
206            * HTML.pm.src: Set line=1, column=1 to the document node.
207    
208    2008-08-16  Wakaba  <wakaba@suika.fam.cx>
209    
210            * ContentChecker.pm, RDFXML.pm: Pass {level} object to language tag
211            and URL checkers.  Support for more error levels for bogus
212            langauge tag and URL "standards".
213    
214            * LangTag.pm, URIChecker.pm: Support for new style error
215            level reporting.
216    
217    2008-08-15  Wakaba  <wakaba@suika.fam.cx>
218    
219            * ContentChecker.pm: Support for RDF/XML error levels.
220    
221            * HTMLTable.pm, RDFXML.pm: Support for new style of error level
222            specifying.  Error types are revised.
223    
224    2008-08-15  Wakaba  <wakaba@suika.fam.cx>
225    
226            * ContentChecker.pm: All error reporting method calls are
227            renewed.
228    
229    2008-08-15  Wakaba  <wakaba@suika.fam.cx>
230    
231            * HTML.pm.src: All error type names and "text" parameters
232            are revised.  Use new style for "level" specification.
233    
234            * mkhtmlparser.pl: Use new style for "level" specification.
235    
236    2008-08-03  Wakaba  <wakaba@suika.fam.cx>
237    
238            * WebIDL.pm (parse_char_string): Simplified error
239            reporting process for broken ignored valuetype definition.
240            (Valuetype idl_text): Support for special "DOMString" name.
241    
242    2008-08-03  Wakaba  <wakaba@suika.fam.cx>
243    
244            * WebIDL.pm ($get_scoped_name): Append "::::" if the last
245            terminal of the ScopedName is "DOMString", such that whether
246            the last part of the scoped name is "DOMString" or "_DOMString"
247            later.  It is necessary to determine whether a |typedef|
248            definition should be ignored or not.
249            (parse_char_string): Unescape the identifier of
250            exception members.
251            ($resolve): Return undef for builtin types and sequence<T>
252            types (we might not have to do this, however...).
253            (check): Support checking for Exceptions, Valuetypes,
254            and Typedefs.
255            ($serialize_type): Support for "DOMString::::" syntax.
256            (Typedef idl_text): Output Type as "DOMString" if it
257            is really "DOMString" (i.e. its internal representation
258            is "::DOMString::").
259    
260    2008-08-03  Wakaba  <wakaba@suika.fam.cx>
261    
262            * WebIDL.pm ($resolve): New code, based on resolve code
263            for constant types in the |check| method.
264            (check): Support for checking of attributes, operations, and
265            arguments.
266            (Attribute/Operation idl_text): Exception names in getraises,
267            setraises, and raises clauses is serizlied by |$serialize_type|
268            code.
269    
270    2008-08-02  Wakaba  <wakaba@suika.fam.cx>
271    
272            * WebIDL.pm ($integer): Order of selections are changed to match
273            hexadecimal numbers (the original pattern, taken from the spec,
274            was not work for hexadecimal numbers, because the "0" prefix
275            matches to the [0-7]* part (as an empty string) and therefore
276            it does not match with remaining "x..." part of a "0x..." integer
277            literal.
278            ($get_type): It now returns a string, not an array reference,
279            for regular types and |sequence| types (i.e. it in any case
280            returns a string).
281            ($get_next_token): The second item in the array that represents
282            a integer or float token is now a Perl number value, not the
283            original string representation of the number.
284            (check): Support for const value consistency checking.
285            No extended attribute is defined for constants.
286            (Node subclasses): Use simple strings rather than array references
287            for default data type values.
288            ($serialize_type): Type values are now simple strings.
289            (value): If the new attribute value is a false value, then
290            a FALSE value is set to the attribute.
291    
292    2008-08-02  Wakaba  <wakaba@suika.fam.cx>
293    
294            * WebIDL.pm ($get_scoped_name): Now scoped names are stored
295            in its stringified format ("scoped name" as defined in the
296            spec).  Note that future version of this module should not use
297            array references for type values and the |type_text| attribute
298            should be made obsolete.
299            (parse_char_string): Unescape attribute names.
300            (check): Support for checking of whether inherited interfaces
301            are actually defined or not.  Support for checking of whether
302            interface member identifiers are duplicated or not.
303            ($serialize_type): Scoped names are returned as is.  A future
304            version of this code should escape identifiers other than "DOMString",
305            otherwise the idl_text would be non-conforming.
306    
307    2008-08-02  Wakaba  <wakaba@suika.fam.cx>
308    
309            * WebIDL.pm (parse_char_string): Set line/column numbers
310            to generated nodes.  Unescape identifiers.  Extended attributes
311            for Definition's were ignored.
312            (append_child): Set |parent_node| attribute.
313            (parent_node): New attribute.
314            (check): Support interface/exception members.  Support
315            extended attributes.  Support definition identifier uniqueness
316            constraint.
317            (qualified_name): New attribute.
318            (Interface/Exception idl_text): Extended attributes were
319            not prepended to the returned text.
320    
321    2008-08-02  Wakaba  <wakaba@suika.fam.cx>
322    
323            * WebIDL.pm (parse_char_string): Set line/column numbers
324            to interface object experimentally.  s/shift/pop/g, shift
325            would make things wrong.  Support for interface forward
326            declarations was missing.  Broken interface declarations
327            with no block were not ignored entirely.
328            (Whatpm::WebIDL::Node): New abstract class.  This class
329            makes things easier.
330            (child_nodes): New attribute.  Unlike DOM's attribute with
331            same name, this attribute returns a dead list of nodes for
332            simplicity.
333            (get_user_data, set_user_data): New methods.
334            (Module idl_text): A SPACE character should be inserted
335            before the |{| character.
336            (Interface idl_text): Support for interface forward declarations.
337            (is_forward_declaration): New attribute.
338    
339    2008-07-19  Wakaba  <wakaba@suika.fam.cx>
340    
341            * WebIDL.pm (type_text): Better serializer.
342    
343    2008-07-19  Wakaba  <wakaba@suika.fam.cx>
344    
345            * WebIDL.pm: Revise forward-compatible parsing so that
346            it now can handle broken extended attributes and as such.
347    
348    2008-07-19  Wakaba  <wakaba@suika.fam.cx>
349    
350            * WebIDL.pm: Real support for extended attributes.
351            Support for extended attributes with arguments.
352    
353    2008-07-19  Wakaba  <wakaba@suika.fam.cx>
354    
355            * WebIDL.pm: Support for |exception| syntax.
356            (Interface->idl_text): Tentative support for inheritances.
357    
358    2008-07-19  Wakaba  <wakaba@suika.fam.cx>
359    
360            * WebIDL.pm: Hierarchical scoped name support was broken.
361            Support for raises, setraises, and getraises syntaxes.
362    
363    2008-07-18  Wakaba  <wakaba@suika.fam.cx>
364    
365            * WebIDL.pm: Support for |idl_text| attribute, version 1 (no
366            proper support for types, extended attributes, and exceptions yet).
367            WebIDL parser, version 1 (no support for exceptions yet,
368            no proper support for extended attributes yet).
369    
370    2008-07-09  Wakaba  <wakaba@suika.fam.cx>
371    
372            * WebIDL.pm (parse_char_string): Support for basic attribute syntax.
373    
374    2008-06-29  Wakaba  <wakaba@suika.fam.cx>
375    
376            * WebIDL.pm: Support for valuetype and const.
377    
378    2008-06-29  Wakaba  <wakaba@suika.fam.cx>
379            
380            * WebIDL.pm: New module.
381    
382    2008-06-15  Wakaba  <wakaba@suika.fam.cx>
383    
384            * Makefile (Entities.html): URI changed.
385    
386    2008-06-08  Wakaba  <wakaba@suika.fam.cx>
387    
388            * HTML.pm.src: Support for ruby parsing (HTML5 revision 1704).
389    
390    2008-06-01  Wakaba  <wakaba@suika.fam.cx>
391    
392            * HTML.pm.src (_get_next_token): A parse error was missing.
393    
394    2008-06-01  Wakaba  <wakaba@suika.fam.cx>
395    
396            * mklinktypelist.pl: rel=contact is no longer part of the HTML5
397            spec (commented out). (HTML5 revision 1711).
398    
399    2008-05-25  Wakaba  <wakaba@suika.fam.cx>
400    
401            * ContentType.pm: Drop support for UTF-32 (HTML5 revision 1701).
402    
403            * HTML.pm.src: UTF-16BE and UTF-16LE should be considered
404            as UTF-16 (HTML5 revision 1701).
405    
406    2008-05-25  Wakaba  <wakaba@suika.fam.cx>
407    
408            * HTML.pm.src: Support for <noframes> in <head> (HTML5 revision
409            1692).
410    
411    2008-05-25  Wakaba  <wakaba@suika.fam.cx>
412    
413            * HTML.pm.src: The secondary insertion mode used when switching
414            to foreign content is the "in body" insertion mode (HTML5 revision
415            1696).
416    
417    2008-05-25  Wakaba  <wakaba@suika.fam.cx>
418    
419            * HTML.pm.src: Don't raise parse error for <isindex/> (HTML5
420            revision 1697).
421    
422    2008-05-24  Wakaba  <wakaba@suika.fam.cx>
423    
424            * HTML.pm.src: Support for end-of-file token in foreign content
425            insertion mode (HTML5 revision 1693).  Update SVG camelCase
426            attribute list (HTML5 revision 1700).  <textarea> closes
427            </select> (HTML5 revision 1699).  More start tags close in
428            foreign content insertion mode (HTML5 revision 1698).
429    
430    2008-05-24  Wakaba  <wakaba@suika.fam.cx>
431    
432            * HTML.pm.src: ";" is not part of charset name (HTML5 revision 1665).
433    
434    2008-05-24  Wakaba  <wakaba@suika.fam.cx>
435    
436            * HTML.pm.src: More robust charset parameter detection (HTML5
437            revision 1674).
438    
439    2008-05-24  Wakaba  <wakaba@suika.fam.cx>
440    
441            * ContentType.pm: Support for image/vnd.microsoft.icon (HTML5
442            revision 1676).
443    
444    2008-05-24  Wakaba  <wakaba@suika.fam.cx>
445    
446            * HTML.pm.src: Ignore language part of public identifiers for
447            quriks mode detection (HTML5 revision 1679).
448    
449    2008-05-24  Wakaba  <wakaba@suika.fam.cx>
450    
451            * HTML.pm.src: Reduce the number of errors in truncated doctypes (HTML5
452            revision 1685).
453    
454    2008-05-24  Wakaba  <wakaba@suika.fam.cx>
455    
456            * HTML.pm.src: Support for EOF in new states for tags (HTML5
457            revision 1684).
458    
459    2008-05-24  Wakaba  <wakaba@suika.fam.cx>
460    
461            * HTML.pm.src (_reset_insertion_mode): Make <td>.innerHTML
462            work (HTML5 revision 1690).
463    
464    2008-05-24  Wakaba  <wakaba@suika.fam.cx>
465    
466            * HTML.pm.src (_tree_construction_main): Change handling of
467            end tags in head insertion modes (HTML5 revision 1686).
468            (parse_char_string): Bug fix for non-utf8 character string handlings.
469            (parse_char_stream): |ungetc| does not work well for this context.
470    
471    2008-05-18  Wakaba  <wakaba@suika.fam.cx>
472    
473            * HTML.pm.src (parse_byte_string): Redefined to invoke
474            |parse_byte_stream|.
475            (parse_byte_stream): New method.
476    
477    2008-05-18  Wakaba  <wakaba@suika.fam.cx>
478    
479            * HTML.pm.src (parse_byte_string): Fix the column number reported
480            by encoding layer error reporter.
481    
482    2008-05-17  Wakaba  <wakaba@suika.fam.cx>
483    
484            * HTML.pm.src (parse_byte_string): Use streaming decoder
485            rather than converting the whole byte string and then parsing.
486            Propagate errors in character encoding layer.
487            (get_next_token): Precise error reporting for |bare stago| error.
488    
489    2008-05-17  Wakaba  <wakaba@suika.fam.cx>
490    
491            * HTML.pm.src (parse_char_stream): New method.
492            (parse_char_string): This method is now defined as an invocation
493            of the |parse_char_stream| method.
494    
495    2008-05-17  Wakaba  <wakaba@suika.fam.cx>
496    
497            * HTML.pm.src (parse_byte_string): Report various status
498            of the sniffing as info-level errors.  Support for new
499            decoding framework in parser resestting.
500            (new): Various default error levels were not set.
501    
502    2008-05-17  Wakaba  <wakaba@suika.fam.cx>
503    
504            * HTML.pm.src (parse_byte_string): HTML5 encoding siniffing
505            algorithm, except for the actual sniffing, is implemented
506            with new framework with Message::Charset::Info.
507    
508    2008-05-16  Wakaba  <wakaba@suika.fam.cx>
509    
510            * CacheManifest.pm (_parse): Drop fragment identifiers from
511            URIs in fallback section (HTML5 revision 1596).
512    
513    2008-05-10  Wakaba  <wakaba@suika.fam.cx>
514    
515            * Makefile (Entities.html): URI has changed.
516    
517    2008-05-10  Wakaba  <wakaba@suika.fam.cx>
518    
519            * CacheManifest.pm: Don't replace U+0000 NULL (HTML5 revision
520            1553).
521    
522    2008-05-06  Wakaba  <wakaba@suika.fam.cx>
523    
524            * ContentChecker.pm: Noted that those returned in |table| are
525            no longer table elements, but table objects returned
526            by Whatpm::HTMLTable.
527    
528            * HTMLTable.pm (form_table): Return table element node
529            as |$table->{element}|.
530            (assign_header): Support for the |headers=""| attribute.
531    
532    2008-05-06  Wakaba  <wakaba@suika.fam.cx>
533    
534            * HTMLTable.pm (assign_header): New function; first version
535            with no support for headers="".
536            (form_table): Include table width and height to the returned
537            table object for covenience.  Indexing in column assignement was wrong.
538            Set whether a data cell is empty or not for convenience.
539    
540    2008-05-05  Wakaba  <wakaba@suika.fam.cx>
541    
542            * HTMLTable.pm: Robuster caption support (HTML5 revision 1393).
543    
544    2008-05-05  Wakaba  <wakaba@suika.fam.cx>
545    
546            * HTMLTable.pm: How table model errors are detected is
547            changed (HTML5 revision 1387).
548    
549    2008-05-05  Wakaba  <wakaba@suika.fam.cx>
550    
551            * HTMLTable.pm: The algorithm now moves |tfoot| elements
552            to the end of the table (HTML5 revision 1380).
553    
554    2008-05-05  Wakaba  <wakaba@suika.fam.cx>
555    
556            * HTMLTable.pm: The algorithm is now 0-based indexing, instead
557            of 1-based (HTML5 revision 1376).
558    
559    2008-05-05  Wakaba  <wakaba@suika.fam.cx>
560    
561            * ContentType.pm: "Content-Type: text/plain; charset=UTF-8"
562            and "Content-Encoding" no longer prevent sniffing (HTML5
563            revision 1288).
564    
565    2008-05-05  Wakaba  <wakaba@suika.fam.cx>
566    
567            * ContentType.pm: Skip BOMs n feed or HTML algorithm (HTML5 revision
568            1282).
569    
570    2008-05-03  Wakaba  <wakaba@suika.fam.cx>
571    
572            * ContentChecker.pm: Support for global attributes.
573            Status of XML specs are added.
574    
575    2008-05-03  Wakaba  <wakaba@suika.fam.cx>
576    
577            * ContentChecker.pm (check_element): Support for |template=""|
578            and |ref=""| attribute (referent element type checking).
579    
580    2008-04-29  Wakaba  <wakaba@suika.fam.cx>
581    
582            * CacheManifest.pm (_parse): New same origin definition (HTML5 revision
583            1500) is implemented (except for IDNA part and URI-scheme-specific
584            knowledge).  Line number counting was wrong for LF-only
585            documents.
586    
587    2008-04-13  Wakaba  <wakaba@suika.fam.cx>
588    
589            * HTML.pm.src: Raise an parse error for any disallowed
590            character (HTML5 revision 1263).
591    
592    2008-04-13  Wakaba  <wakaba@suika.fam.cx>
593    
594            * mkentitylist.pl: Support for new HTML5 entity table format (the
595            definition for |AElig;| was missing).
596    
597    2008-04-13  Wakaba  <wakaba@suika.fam.cx>
598    
599            * HTML.pm.src, mkhtmlparser.pl: Support for element/attribute
600            name/namespace fixup (HTML5 revisions 1413, 1415, 1416, and 1417).
601    
602    2008-04-13  Wakaba  <wakaba@suika.fam.cx>
603    
604            * HTML.pm.src: List of element names that close foreign content
605            insertion mode is added (HTML5 revisions 1412 and 1418).
606    
607    2008-04-13  Wakaba  <wakaba@suika.fam.cx>
608    
609            * HTML.pm.src: Support for |mglyph| and |malignmark|
610            elements (HTML5 revision 1410).
611    
612    2008-04-13  Wakaba  <wakaba@suika.fam.cx>
613    
614            * HTML.pm.src: Support for new long MathML entities (HTML5
615            revision 1406).
616    
617    2008-04-13  Wakaba  <wakaba@suika.fam.cx>
618    
619            * HTML.pm.src: CDATA section support for MathML and SVG
620            elements (HTML5 revision 1404 and 1420).
621    
622    2008-04-12  Wakaba  <wakaba@suika.fam.cx>
623    
624            * HTML.pm.src, mkhtmlparser.pl: Support for MathML and SVG
625            elements (HTML5 revision 1404).  Unused !!!macro definitions
626            are removed.
627    
628    2008-04-12  Wakaba  <wakaba@suika.fam.cx>
629    
630            * HTML.pm.src, mkhtmlparser.pl: The way permitted slash errors
631            are raised is changed (HTML5 revision 1404).
632    
633    2008-04-06  Wakaba  <wakaba@suika.fam.cx>
634    
635            * HTML.pm.src: Category sets that are no longer used are removed.
636    
637    2008-04-06  Wakaba  <wakaba@suika.fam.cx>
638    
639            * HTML.pm.src: The ->[1] property of stack entries are now
640            replaced by constants representing element category.
641    
642    2008-04-06  Wakaba  <wakaba@suika.fam.cx>
643    
644            * HTML.pm.src: Don't use local name stored in stack (i.e. ->[1])
645            for error reporting.  (This is a preparation for using constant
646            value for ->[1].)
647    
648    2008-03-22  Wakaba  <wakaba@suika.fam.cx>
649    
650            * RDFXML.pm: Typo fixed.
651    
652    2008-03-22  Wakaba  <wakaba@suika.fam.cx>
653    
654            * ContentChecker.pm: |fact_level| is now treated
655            as same as |must_level|, i.e. level = |m|.
656            (check_element): Make list of URIs in the DOM.
657    
658    2008-03-21  Wakaba  <wakaba@suika.fam.cx>
659    
660            * RDFXML.pm: Language accessor implemented.
661            Local (null-namespace) attribute support.
662    
663    2008-03-21  Wakaba  <wakaba@suika.fam.cx>
664    
665            * RDFXML.pm: Factored out ID checking code.
666    
667    2008-03-21  Wakaba  <wakaba@suika.fam.cx>
668    
669            * RDFXML.pm: TODO items noted.  Validation of ID
670            and URI attributes is implemented.  Warn if unknown
671            value is used in rdf:parseType="" attribute.
672    
673            * URIChecker.pm (check_rdf_uri_reference): New function.
674    
675    2008-03-21  Wakaba  <wakaba@suika.fam.cx>
676    
677            * RDFXML.pm: bnodeid implemented.  Relative references
678            are now resolved.
679    
680    2008-03-21  Wakaba  <wakaba@suika.fam.cx>
681    
682            * ContentChecker.pm: RDF reification implemented.
683    
684            * RDFXML.pm: undef vs false bug fixed.
685            Reification implemented.
686    
687    2008-03-21  Wakaba  <wakaba@suika.fam.cx>
688    
689            * RDFXML.pm: s/id/ID/ for attribute name.
690            The |node| arguments are added for |ontriple| calls.
691            Too many "attribute not allowed" errors were raised.
692    
693            * ContentChecker.pm: Initial experimental support for rdf:RDF
694            element.
695    
696    2008-03-21  Wakaba  <wakaba@suika.fam.cx>
697    
698            * RDFXML.pm: New module.
699    
700    2008-03-20  Wakaba  <wakaba@suika.fam.cx>
701    
702            * HTML.pm.src (set_inner_html): Line/column number
703            code was old one yet.
704    
705    2008-03-20  Wakaba  <wakaba@suika.fam.cx>
706    
707            * HTML.pm.src: Better line/column reporting for "duplicate attribute"
708            errors.  Line/column markings for DOCTYPE, comment, and
709            character tokens are reintroduced; otherwise,
710            error location for "not HTML5" error and errors
711            for implied elements are not attached.
712    
713    2008-03-20  Wakaba  <wakaba@suika.fam.cx>
714    
715            * HTML.pm.src: Set line/column numbers to attributes.
716    
717            * NanoDOM.pm (create_attribute_ns, set_attribute_node_ns): Added.
718            (value): Setter implemented.
719    
720            * mkhtmlparser.pl: Set line/column numbers to Attr nodes.
721    
722    2008-03-20  Wakaba  <wakaba@suika.fam.cx>
723    
724            * HTML.pm.src: Unused line/column markings are removed.
725    
726    2008-03-20  Wakaba  <wakaba@suika.fam.cx>
727    
728            * HTML.pm.src (_get_next_token): Remove |first_start_tag|
729            flag, which is no longer used.
730    
731    2008-03-17  Wakaba  <wakaba@suika.fam.cx>
732    
733            * HTML.pm.src: Set line/column information to element nodes.
734    
735            * mkhtmlparser.pl (!!!create-element, !!!insert-element,
736            and !!!insert-element-t): Set line/column information to
737            element nodes.
738    
739    2008-03-17  Wakaba  <wakaba@suika.fam.cx>
740    
741            * HTML.pm.src (_get_next_token): The first "<" character
742            in "<?", "<>", or "</>" should be the error point.
743    
744    2008-03-16  Wakaba  <wakaba@suika.fam.cx>
745    
746            * HTML.pm.src: Some more fixes on error position reporting.
747    
748    2008-03-16  Wakaba  <wakaba@suika.fam.cx>
749    
750            * HTML.pm.src: Token-level precious error reporting.
751    
752    2008-03-16  Wakaba  <wakaba@suika.fam.cx>
753    
754            * HTML.pm.src: Preparation for more precious error point
755            reporting.
756    
757    2008-03-11  Wakaba  <wakaba@suika.fam.cx>
758    
759            * HTML.pm.src: Error type revised.
760    
761    2008-03-11  Wakaba  <wakaba@suika.fam.cx>
762    
763            * HTML.pm.src: Similar codes are merged together, again.
764    
765    2008-03-11  Wakaba  <wakaba@suika.fam.cx>
766    
767            * HTML.pm.src: Similar codes are merged together.
768    
769    2008-03-10  Wakaba  <wakaba@suika.fam.cx>
770    
771            * mkhtmlparser.pl: Set "level" parameter to parse errors.
772    
773            * HTML.pm.src: Code refined.
774    
775    2008-03-09  Wakaba  <wakaba@suika.fam.cx>
776    
777            * HTML.pm.src: |</body>| treatement has been changed (HTML5
778            revision 1348).  Note that I really don't know this makes
779            any difference in the black-box behavior of the parser.
780    
781    2008-03-09  Wakaba  <wakaba@suika.fam.cx>
782    
783            * HTML.pm.src: New end-of-file token implementation (HTML5
784            revision 1348).
785    
786    2008-03-09  Wakaba  <wakaba@suika.fam.cx>
787    
788            * HTML.pm.src: |applet| support (HTML5 revision 1347).
789    
790    2008-03-09  Wakaba  <wakaba@suika.fam.cx>
791    
792            * HTML.pm.src: Foster parenting in AAA (HTML5 revision 1343).
793    
794    2008-03-09  Wakaba  <wakaba@suika.fam.cx>
795    
796            * HTML.pm.src: Support for |<input>| in the "in select" insertion
797            mode and support for the "in select in table" insertion mode (HTML5
798            revision 1342).
799    
800    2008-03-09  Wakaba  <wakaba@suika.fam.cx>
801    
802            * HTML.pm.src: No foster parenting for <script> and <script>
803            in non-tainted <table>s (HTML5 revision 1336).
804    
805    2008-03-09  Wakaba  <wakaba@suika.fam.cx>
806    
807            * HTML.pm.src: Ignore white space characters between <html>
808            and <head> (HTML5 revision 1332).
809    
810    2008-03-09  Wakaba  <wakaba@suika.fam.cx>
811    
812            * HTML.pm.src: Treat <input type=hidden> as if it were a
813            white space (HTML5 revision 1331).
814    
815    2008-03-08  Wakaba  <wakaba@suika.fam.cx>
816    
817            * HTML.pm.src: Ignore U+000A at the beginning of a |listing|
818            element (HTML5 revision 1330).
819    
820    2008-03-08  Wakaba  <wakaba@suika.fam.cx>
821    
822            * HTML.pm.src: <title> is always appended to the current
823            element (HTML5 revision 1328).
824    
825    2008-03-08  Wakaba  <wakaba@suika.fam.cx>
826    
827            * HTML.pm.src: White space in tainted tables are moved
828            into foster parents (HTML5 revision 1326).
829    
830    2008-03-08  Wakaba  <wakaba@suika.fam.cx>
831    
832            * HTML.pm.src: Reduce errors from foster parenting cases (HTML5
833            revision 1321).
834    
835    2008-03-08  Wakaba  <wakaba@suika.fam.cx>
836    
837            * HTML.pm.src: |</h/n/>| case code rearranged to align with
838            the spec (HTML5 revision 1320).  Note that we finally complete
839            all of HTML5 revision 1320 changes.
840    
841    2008-03-08  Wakaba  <wakaba@suika.fam.cx>
842    
843            * HTML.pm.src: |</form>| now works similar to |</div>| for unclosed
844            tags (HTML5 revision 1320).
845    
846    2008-03-08  Wakaba  <wakaba@suika.fam.cx>
847    
848            * HTML.pm.src: |</p>| case rearranged with no actual change in fact.
849    
850    2008-03-08  Wakaba  <wakaba@suika.fam.cx>
851    
852            * HTML.pm.src: A "generate implied end tags" code (t409.1)
853            could not be reached so that it is now removed (HTML5 revision
854            1320).
855    
856  2008-03-08  Wakaba  <wakaba@suika.fam.cx>  2008-03-08  Wakaba  <wakaba@suika.fam.cx>
857    
858          * HTML.pm.src: Code for the case of |</div>| and so on are revised          * HTML.pm.src: Code for the case of |</div>| and so on are revised

Legend:
Removed from v.1.166  
changed lines
  Added in v.1.306

admin@suikawiki.org
ViewVC Help
Powered by ViewVC 1.1.24