/[suikacvs]/markup/html/whatpm/Whatpm/ChangeLog
Suika

Diff of /markup/html/whatpm/Whatpm/ChangeLog

Parent Directory Parent Directory | Revision Log Revision Log | View Patch Patch

revision 1.78 by wakaba, Sat Jul 21 08:52:59 2007 UTC revision 1.284 by wakaba, Sat Aug 30 14:37:46 2008 UTC
# Line 1  Line 1 
1    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
2    
3            * HTMLTable.pm: Zs is not what we want; we want White_Space! (HTML5
4            revision 2094).
5    
6    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
7    
8            * ContentType.pm: Support for image/svg+xml (HTML5 revision 2096).
9    
10    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
11    
12            * HTML.pm.src: '"' and "'" at the end of attribute
13            name (after another attribute) now raise parse error (HTML5
14            revision 2123).  Empty unquoted attribute values are no
15            longer allowed (HTML5 revision 2122).
16    
17    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
18    
19            * mkhtmlparser.pl: Support for MathML |definitionURL| attribute (HTML5
20            revision 2130).
21    
22    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
23    
24            * ContentChecker.pm: |xml:lang| attribute value must be same
25            as |lang| attribute value for HTML elements (HTML5 revision 2062
26            and so on).
27    
28    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
29    
30            * ContentChecker.pm: Error level definition for |xml_id_error|
31            was missing.
32    
33            * URIChecker.pm: The end of the URL should be marked as the
34            error location for an empty path error.  The position
35            between the userinfo and the port components should be
36            marked as the error location for an empty host error.
37    
38    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
39    
40            * URIChecker.pm: Set parameters representing where in the
41            value the error occurs for errors.  Report unknown
42            address format error in warning level, since address
43            formats are rarely added.  Path segments starting with "/.."
44            were misinterpreted as a dot-segment.
45    
46    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
47    
48            * URIChecker.pm (check_iri_reference): Requires
49            |Message::DOM::DOMImplementation|.
50    
51    2008-08-29  Wakaba  <wakaba@suika.fam.cx>
52    
53            * IMTChecker.pm: Updated for the new error reporting architecture.
54    
55            * ContentChecker.pm: Error levels for IMTs are added.
56    
57    2008-08-17  Wakaba  <wakaba@suika.fam.cx>
58    
59            * H2H.pm (_shift_token): Support for unquoted HTML attribute
60            values.
61    
62    2008-08-16  Wakaba  <wakaba@suika.fam.cx>
63    
64            * CacheManifest.pm: Support for new style of error
65            reports.
66    
67            * HTML.pm.src: Set line=1, column=1 to the document node.
68    
69    2008-08-16  Wakaba  <wakaba@suika.fam.cx>
70    
71            * ContentChecker.pm, RDFXML.pm: Pass {level} object to language tag
72            and URL checkers.  Support for more error levels for bogus
73            langauge tag and URL "standards".
74    
75            * LangTag.pm, URIChecker.pm: Support for new style error
76            level reporting.
77    
78    2008-08-15  Wakaba  <wakaba@suika.fam.cx>
79    
80            * ContentChecker.pm: Support for RDF/XML error levels.
81    
82            * HTMLTable.pm, RDFXML.pm: Support for new style of error level
83            specifying.  Error types are revised.
84    
85    2008-08-15  Wakaba  <wakaba@suika.fam.cx>
86    
87            * ContentChecker.pm: All error reporting method calls are
88            renewed.
89    
90    2008-08-15  Wakaba  <wakaba@suika.fam.cx>
91    
92            * HTML.pm.src: All error type names and "text" parameters
93            are revised.  Use new style for "level" specification.
94    
95            * mkhtmlparser.pl: Use new style for "level" specification.
96    
97    2008-08-03  Wakaba  <wakaba@suika.fam.cx>
98    
99            * WebIDL.pm (parse_char_string): Simplified error
100            reporting process for broken ignored valuetype definition.
101            (Valuetype idl_text): Support for special "DOMString" name.
102    
103    2008-08-03  Wakaba  <wakaba@suika.fam.cx>
104    
105            * WebIDL.pm ($get_scoped_name): Append "::::" if the last
106            terminal of the ScopedName is "DOMString", such that whether
107            the last part of the scoped name is "DOMString" or "_DOMString"
108            later.  It is necessary to determine whether a |typedef|
109            definition should be ignored or not.
110            (parse_char_string): Unescape the identifier of
111            exception members.
112            ($resolve): Return undef for builtin types and sequence<T>
113            types (we might not have to do this, however...).
114            (check): Support checking for Exceptions, Valuetypes,
115            and Typedefs.
116            ($serialize_type): Support for "DOMString::::" syntax.
117            (Typedef idl_text): Output Type as "DOMString" if it
118            is really "DOMString" (i.e. its internal representation
119            is "::DOMString::").
120    
121    2008-08-03  Wakaba  <wakaba@suika.fam.cx>
122    
123            * WebIDL.pm ($resolve): New code, based on resolve code
124            for constant types in the |check| method.
125            (check): Support for checking of attributes, operations, and
126            arguments.
127            (Attribute/Operation idl_text): Exception names in getraises,
128            setraises, and raises clauses is serizlied by |$serialize_type|
129            code.
130    
131    2008-08-02  Wakaba  <wakaba@suika.fam.cx>
132    
133            * WebIDL.pm ($integer): Order of selections are changed to match
134            hexadecimal numbers (the original pattern, taken from the spec,
135            was not work for hexadecimal numbers, because the "0" prefix
136            matches to the [0-7]* part (as an empty string) and therefore
137            it does not match with remaining "x..." part of a "0x..." integer
138            literal.
139            ($get_type): It now returns a string, not an array reference,
140            for regular types and |sequence| types (i.e. it in any case
141            returns a string).
142            ($get_next_token): The second item in the array that represents
143            a integer or float token is now a Perl number value, not the
144            original string representation of the number.
145            (check): Support for const value consistency checking.
146            No extended attribute is defined for constants.
147            (Node subclasses): Use simple strings rather than array references
148            for default data type values.
149            ($serialize_type): Type values are now simple strings.
150            (value): If the new attribute value is a false value, then
151            a FALSE value is set to the attribute.
152    
153    2008-08-02  Wakaba  <wakaba@suika.fam.cx>
154    
155            * WebIDL.pm ($get_scoped_name): Now scoped names are stored
156            in its stringified format ("scoped name" as defined in the
157            spec).  Note that future version of this module should not use
158            array references for type values and the |type_text| attribute
159            should be made obsolete.
160            (parse_char_string): Unescape attribute names.
161            (check): Support for checking of whether inherited interfaces
162            are actually defined or not.  Support for checking of whether
163            interface member identifiers are duplicated or not.
164            ($serialize_type): Scoped names are returned as is.  A future
165            version of this code should escape identifiers other than "DOMString",
166            otherwise the idl_text would be non-conforming.
167    
168    2008-08-02  Wakaba  <wakaba@suika.fam.cx>
169    
170            * WebIDL.pm (parse_char_string): Set line/column numbers
171            to generated nodes.  Unescape identifiers.  Extended attributes
172            for Definition's were ignored.
173            (append_child): Set |parent_node| attribute.
174            (parent_node): New attribute.
175            (check): Support interface/exception members.  Support
176            extended attributes.  Support definition identifier uniqueness
177            constraint.
178            (qualified_name): New attribute.
179            (Interface/Exception idl_text): Extended attributes were
180            not prepended to the returned text.
181    
182    2008-08-02  Wakaba  <wakaba@suika.fam.cx>
183    
184            * WebIDL.pm (parse_char_string): Set line/column numbers
185            to interface object experimentally.  s/shift/pop/g, shift
186            would make things wrong.  Support for interface forward
187            declarations was missing.  Broken interface declarations
188            with no block were not ignored entirely.
189            (Whatpm::WebIDL::Node): New abstract class.  This class
190            makes things easier.
191            (child_nodes): New attribute.  Unlike DOM's attribute with
192            same name, this attribute returns a dead list of nodes for
193            simplicity.
194            (get_user_data, set_user_data): New methods.
195            (Module idl_text): A SPACE character should be inserted
196            before the |{| character.
197            (Interface idl_text): Support for interface forward declarations.
198            (is_forward_declaration): New attribute.
199    
200    2008-07-19  Wakaba  <wakaba@suika.fam.cx>
201    
202            * WebIDL.pm (type_text): Better serializer.
203    
204    2008-07-19  Wakaba  <wakaba@suika.fam.cx>
205    
206            * WebIDL.pm: Revise forward-compatible parsing so that
207            it now can handle broken extended attributes and as such.
208    
209    2008-07-19  Wakaba  <wakaba@suika.fam.cx>
210    
211            * WebIDL.pm: Real support for extended attributes.
212            Support for extended attributes with arguments.
213    
214    2008-07-19  Wakaba  <wakaba@suika.fam.cx>
215    
216            * WebIDL.pm: Support for |exception| syntax.
217            (Interface->idl_text): Tentative support for inheritances.
218    
219    2008-07-19  Wakaba  <wakaba@suika.fam.cx>
220    
221            * WebIDL.pm: Hierarchical scoped name support was broken.
222            Support for raises, setraises, and getraises syntaxes.
223    
224    2008-07-18  Wakaba  <wakaba@suika.fam.cx>
225    
226            * WebIDL.pm: Support for |idl_text| attribute, version 1 (no
227            proper support for types, extended attributes, and exceptions yet).
228            WebIDL parser, version 1 (no support for exceptions yet,
229            no proper support for extended attributes yet).
230    
231    2008-07-09  Wakaba  <wakaba@suika.fam.cx>
232    
233            * WebIDL.pm (parse_char_string): Support for basic attribute syntax.
234    
235    2008-06-29  Wakaba  <wakaba@suika.fam.cx>
236    
237            * WebIDL.pm: Support for valuetype and const.
238    
239    2008-06-29  Wakaba  <wakaba@suika.fam.cx>
240            
241            * WebIDL.pm: New module.
242    
243    2008-06-15  Wakaba  <wakaba@suika.fam.cx>
244    
245            * Makefile (Entities.html): URI changed.
246    
247    2008-06-08  Wakaba  <wakaba@suika.fam.cx>
248    
249            * HTML.pm.src: Support for ruby parsing (HTML5 revision 1704).
250    
251    2008-06-01  Wakaba  <wakaba@suika.fam.cx>
252    
253            * HTML.pm.src (_get_next_token): A parse error was missing.
254    
255    2008-06-01  Wakaba  <wakaba@suika.fam.cx>
256    
257            * mklinktypelist.pl: rel=contact is no longer part of the HTML5
258            spec (commented out). (HTML5 revision 1711).
259    
260    2008-05-25  Wakaba  <wakaba@suika.fam.cx>
261    
262            * ContentType.pm: Drop support for UTF-32 (HTML5 revision 1701).
263    
264            * HTML.pm.src: UTF-16BE and UTF-16LE should be considered
265            as UTF-16 (HTML5 revision 1701).
266    
267    2008-05-25  Wakaba  <wakaba@suika.fam.cx>
268    
269            * HTML.pm.src: Support for <noframes> in <head> (HTML5 revision
270            1692).
271    
272    2008-05-25  Wakaba  <wakaba@suika.fam.cx>
273    
274            * HTML.pm.src: The secondary insertion mode used when switching
275            to foreign content is the "in body" insertion mode (HTML5 revision
276            1696).
277    
278    2008-05-25  Wakaba  <wakaba@suika.fam.cx>
279    
280            * HTML.pm.src: Don't raise parse error for <isindex/> (HTML5
281            revision 1697).
282    
283    2008-05-24  Wakaba  <wakaba@suika.fam.cx>
284    
285            * HTML.pm.src: Support for end-of-file token in foreign content
286            insertion mode (HTML5 revision 1693).  Update SVG camelCase
287            attribute list (HTML5 revision 1700).  <textarea> closes
288            </select> (HTML5 revision 1699).  More start tags close in
289            foreign content insertion mode (HTML5 revision 1698).
290    
291    2008-05-24  Wakaba  <wakaba@suika.fam.cx>
292    
293            * HTML.pm.src: ";" is not part of charset name (HTML5 revision 1665).
294    
295    2008-05-24  Wakaba  <wakaba@suika.fam.cx>
296    
297            * HTML.pm.src: More robust charset parameter detection (HTML5
298            revision 1674).
299    
300    2008-05-24  Wakaba  <wakaba@suika.fam.cx>
301    
302            * ContentType.pm: Support for image/vnd.microsoft.icon (HTML5
303            revision 1676).
304    
305    2008-05-24  Wakaba  <wakaba@suika.fam.cx>
306    
307            * HTML.pm.src: Ignore language part of public identifiers for
308            quriks mode detection (HTML5 revision 1679).
309    
310    2008-05-24  Wakaba  <wakaba@suika.fam.cx>
311    
312            * HTML.pm.src: Reduce the number of errors in truncated doctypes (HTML5
313            revision 1685).
314    
315    2008-05-24  Wakaba  <wakaba@suika.fam.cx>
316    
317            * HTML.pm.src: Support for EOF in new states for tags (HTML5
318            revision 1684).
319    
320    2008-05-24  Wakaba  <wakaba@suika.fam.cx>
321    
322            * HTML.pm.src (_reset_insertion_mode): Make <td>.innerHTML
323            work (HTML5 revision 1690).
324    
325    2008-05-24  Wakaba  <wakaba@suika.fam.cx>
326    
327            * HTML.pm.src (_tree_construction_main): Change handling of
328            end tags in head insertion modes (HTML5 revision 1686).
329            (parse_char_string): Bug fix for non-utf8 character string handlings.
330            (parse_char_stream): |ungetc| does not work well for this context.
331    
332    2008-05-18  Wakaba  <wakaba@suika.fam.cx>
333    
334            * HTML.pm.src (parse_byte_string): Redefined to invoke
335            |parse_byte_stream|.
336            (parse_byte_stream): New method.
337    
338    2008-05-18  Wakaba  <wakaba@suika.fam.cx>
339    
340            * HTML.pm.src (parse_byte_string): Fix the column number reported
341            by encoding layer error reporter.
342    
343    2008-05-17  Wakaba  <wakaba@suika.fam.cx>
344    
345            * HTML.pm.src (parse_byte_string): Use streaming decoder
346            rather than converting the whole byte string and then parsing.
347            Propagate errors in character encoding layer.
348            (get_next_token): Precise error reporting for |bare stago| error.
349    
350    2008-05-17  Wakaba  <wakaba@suika.fam.cx>
351    
352            * HTML.pm.src (parse_char_stream): New method.
353            (parse_char_string): This method is now defined as an invocation
354            of the |parse_char_stream| method.
355    
356    2008-05-17  Wakaba  <wakaba@suika.fam.cx>
357    
358            * HTML.pm.src (parse_byte_string): Report various status
359            of the sniffing as info-level errors.  Support for new
360            decoding framework in parser resestting.
361            (new): Various default error levels were not set.
362    
363    2008-05-17  Wakaba  <wakaba@suika.fam.cx>
364    
365            * HTML.pm.src (parse_byte_string): HTML5 encoding siniffing
366            algorithm, except for the actual sniffing, is implemented
367            with new framework with Message::Charset::Info.
368    
369    2008-05-16  Wakaba  <wakaba@suika.fam.cx>
370    
371            * CacheManifest.pm (_parse): Drop fragment identifiers from
372            URIs in fallback section (HTML5 revision 1596).
373    
374    2008-05-10  Wakaba  <wakaba@suika.fam.cx>
375    
376            * Makefile (Entities.html): URI has changed.
377    
378    2008-05-10  Wakaba  <wakaba@suika.fam.cx>
379    
380            * CacheManifest.pm: Don't replace U+0000 NULL (HTML5 revision
381            1553).
382    
383    2008-05-06  Wakaba  <wakaba@suika.fam.cx>
384    
385            * ContentChecker.pm: Noted that those returned in |table| are
386            no longer table elements, but table objects returned
387            by Whatpm::HTMLTable.
388    
389            * HTMLTable.pm (form_table): Return table element node
390            as |$table->{element}|.
391            (assign_header): Support for the |headers=""| attribute.
392    
393    2008-05-06  Wakaba  <wakaba@suika.fam.cx>
394    
395            * HTMLTable.pm (assign_header): New function; first version
396            with no support for headers="".
397            (form_table): Include table width and height to the returned
398            table object for covenience.  Indexing in column assignement was wrong.
399            Set whether a data cell is empty or not for convenience.
400    
401    2008-05-05  Wakaba  <wakaba@suika.fam.cx>
402    
403            * HTMLTable.pm: Robuster caption support (HTML5 revision 1393).
404    
405    2008-05-05  Wakaba  <wakaba@suika.fam.cx>
406    
407            * HTMLTable.pm: How table model errors are detected is
408            changed (HTML5 revision 1387).
409    
410    2008-05-05  Wakaba  <wakaba@suika.fam.cx>
411    
412            * HTMLTable.pm: The algorithm now moves |tfoot| elements
413            to the end of the table (HTML5 revision 1380).
414    
415    2008-05-05  Wakaba  <wakaba@suika.fam.cx>
416    
417            * HTMLTable.pm: The algorithm is now 0-based indexing, instead
418            of 1-based (HTML5 revision 1376).
419    
420    2008-05-05  Wakaba  <wakaba@suika.fam.cx>
421    
422            * ContentType.pm: "Content-Type: text/plain; charset=UTF-8"
423            and "Content-Encoding" no longer prevent sniffing (HTML5
424            revision 1288).
425    
426    2008-05-05  Wakaba  <wakaba@suika.fam.cx>
427    
428            * ContentType.pm: Skip BOMs n feed or HTML algorithm (HTML5 revision
429            1282).
430    
431    2008-05-03  Wakaba  <wakaba@suika.fam.cx>
432    
433            * ContentChecker.pm: Support for global attributes.
434            Status of XML specs are added.
435    
436    2008-05-03  Wakaba  <wakaba@suika.fam.cx>
437    
438            * ContentChecker.pm (check_element): Support for |template=""|
439            and |ref=""| attribute (referent element type checking).
440    
441    2008-04-29  Wakaba  <wakaba@suika.fam.cx>
442    
443            * CacheManifest.pm (_parse): New same origin definition (HTML5 revision
444            1500) is implemented (except for IDNA part and URI-scheme-specific
445            knowledge).  Line number counting was wrong for LF-only
446            documents.
447    
448    2008-04-13  Wakaba  <wakaba@suika.fam.cx>
449    
450            * HTML.pm.src: Raise an parse error for any disallowed
451            character (HTML5 revision 1263).
452    
453    2008-04-13  Wakaba  <wakaba@suika.fam.cx>
454    
455            * mkentitylist.pl: Support for new HTML5 entity table format (the
456            definition for |AElig;| was missing).
457    
458    2008-04-13  Wakaba  <wakaba@suika.fam.cx>
459    
460            * HTML.pm.src, mkhtmlparser.pl: Support for element/attribute
461            name/namespace fixup (HTML5 revisions 1413, 1415, 1416, and 1417).
462    
463    2008-04-13  Wakaba  <wakaba@suika.fam.cx>
464    
465            * HTML.pm.src: List of element names that close foreign content
466            insertion mode is added (HTML5 revisions 1412 and 1418).
467    
468    2008-04-13  Wakaba  <wakaba@suika.fam.cx>
469    
470            * HTML.pm.src: Support for |mglyph| and |malignmark|
471            elements (HTML5 revision 1410).
472    
473    2008-04-13  Wakaba  <wakaba@suika.fam.cx>
474    
475            * HTML.pm.src: Support for new long MathML entities (HTML5
476            revision 1406).
477    
478    2008-04-13  Wakaba  <wakaba@suika.fam.cx>
479    
480            * HTML.pm.src: CDATA section support for MathML and SVG
481            elements (HTML5 revision 1404 and 1420).
482    
483    2008-04-12  Wakaba  <wakaba@suika.fam.cx>
484    
485            * HTML.pm.src, mkhtmlparser.pl: Support for MathML and SVG
486            elements (HTML5 revision 1404).  Unused !!!macro definitions
487            are removed.
488    
489    2008-04-12  Wakaba  <wakaba@suika.fam.cx>
490    
491            * HTML.pm.src, mkhtmlparser.pl: The way permitted slash errors
492            are raised is changed (HTML5 revision 1404).
493    
494    2008-04-06  Wakaba  <wakaba@suika.fam.cx>
495    
496            * HTML.pm.src: Category sets that are no longer used are removed.
497    
498    2008-04-06  Wakaba  <wakaba@suika.fam.cx>
499    
500            * HTML.pm.src: The ->[1] property of stack entries are now
501            replaced by constants representing element category.
502    
503    2008-04-06  Wakaba  <wakaba@suika.fam.cx>
504    
505            * HTML.pm.src: Don't use local name stored in stack (i.e. ->[1])
506            for error reporting.  (This is a preparation for using constant
507            value for ->[1].)
508    
509    2008-03-22  Wakaba  <wakaba@suika.fam.cx>
510    
511            * RDFXML.pm: Typo fixed.
512    
513    2008-03-22  Wakaba  <wakaba@suika.fam.cx>
514    
515            * ContentChecker.pm: |fact_level| is now treated
516            as same as |must_level|, i.e. level = |m|.
517            (check_element): Make list of URIs in the DOM.
518    
519    2008-03-21  Wakaba  <wakaba@suika.fam.cx>
520    
521            * RDFXML.pm: Language accessor implemented.
522            Local (null-namespace) attribute support.
523    
524    2008-03-21  Wakaba  <wakaba@suika.fam.cx>
525    
526            * RDFXML.pm: Factored out ID checking code.
527    
528    2008-03-21  Wakaba  <wakaba@suika.fam.cx>
529    
530            * RDFXML.pm: TODO items noted.  Validation of ID
531            and URI attributes is implemented.  Warn if unknown
532            value is used in rdf:parseType="" attribute.
533    
534            * URIChecker.pm (check_rdf_uri_reference): New function.
535    
536    2008-03-21  Wakaba  <wakaba@suika.fam.cx>
537    
538            * RDFXML.pm: bnodeid implemented.  Relative references
539            are now resolved.
540    
541    2008-03-21  Wakaba  <wakaba@suika.fam.cx>
542    
543            * ContentChecker.pm: RDF reification implemented.
544    
545            * RDFXML.pm: undef vs false bug fixed.
546            Reification implemented.
547    
548    2008-03-21  Wakaba  <wakaba@suika.fam.cx>
549    
550            * RDFXML.pm: s/id/ID/ for attribute name.
551            The |node| arguments are added for |ontriple| calls.
552            Too many "attribute not allowed" errors were raised.
553    
554            * ContentChecker.pm: Initial experimental support for rdf:RDF
555            element.
556    
557    2008-03-21  Wakaba  <wakaba@suika.fam.cx>
558    
559            * RDFXML.pm: New module.
560    
561    2008-03-20  Wakaba  <wakaba@suika.fam.cx>
562    
563            * HTML.pm.src (set_inner_html): Line/column number
564            code was old one yet.
565    
566    2008-03-20  Wakaba  <wakaba@suika.fam.cx>
567    
568            * HTML.pm.src: Better line/column reporting for "duplicate attribute"
569            errors.  Line/column markings for DOCTYPE, comment, and
570            character tokens are reintroduced; otherwise,
571            error location for "not HTML5" error and errors
572            for implied elements are not attached.
573    
574    2008-03-20  Wakaba  <wakaba@suika.fam.cx>
575    
576            * HTML.pm.src: Set line/column numbers to attributes.
577    
578            * NanoDOM.pm (create_attribute_ns, set_attribute_node_ns): Added.
579            (value): Setter implemented.
580    
581            * mkhtmlparser.pl: Set line/column numbers to Attr nodes.
582    
583    2008-03-20  Wakaba  <wakaba@suika.fam.cx>
584    
585            * HTML.pm.src: Unused line/column markings are removed.
586    
587    2008-03-20  Wakaba  <wakaba@suika.fam.cx>
588    
589            * HTML.pm.src (_get_next_token): Remove |first_start_tag|
590            flag, which is no longer used.
591    
592    2008-03-17  Wakaba  <wakaba@suika.fam.cx>
593    
594            * HTML.pm.src: Set line/column information to element nodes.
595    
596            * mkhtmlparser.pl (!!!create-element, !!!insert-element,
597            and !!!insert-element-t): Set line/column information to
598            element nodes.
599    
600    2008-03-17  Wakaba  <wakaba@suika.fam.cx>
601    
602            * HTML.pm.src (_get_next_token): The first "<" character
603            in "<?", "<>", or "</>" should be the error point.
604    
605    2008-03-16  Wakaba  <wakaba@suika.fam.cx>
606    
607            * HTML.pm.src: Some more fixes on error position reporting.
608    
609    2008-03-16  Wakaba  <wakaba@suika.fam.cx>
610    
611            * HTML.pm.src: Token-level precious error reporting.
612    
613    2008-03-16  Wakaba  <wakaba@suika.fam.cx>
614    
615            * HTML.pm.src: Preparation for more precious error point
616            reporting.
617    
618    2008-03-11  Wakaba  <wakaba@suika.fam.cx>
619    
620            * HTML.pm.src: Error type revised.
621    
622    2008-03-11  Wakaba  <wakaba@suika.fam.cx>
623    
624            * HTML.pm.src: Similar codes are merged together, again.
625    
626    2008-03-11  Wakaba  <wakaba@suika.fam.cx>
627    
628            * HTML.pm.src: Similar codes are merged together.
629    
630    2008-03-10  Wakaba  <wakaba@suika.fam.cx>
631    
632            * mkhtmlparser.pl: Set "level" parameter to parse errors.
633    
634            * HTML.pm.src: Code refined.
635    
636    2008-03-09  Wakaba  <wakaba@suika.fam.cx>
637    
638            * HTML.pm.src: |</body>| treatement has been changed (HTML5
639            revision 1348).  Note that I really don't know this makes
640            any difference in the black-box behavior of the parser.
641    
642    2008-03-09  Wakaba  <wakaba@suika.fam.cx>
643    
644            * HTML.pm.src: New end-of-file token implementation (HTML5
645            revision 1348).
646    
647    2008-03-09  Wakaba  <wakaba@suika.fam.cx>
648    
649            * HTML.pm.src: |applet| support (HTML5 revision 1347).
650    
651    2008-03-09  Wakaba  <wakaba@suika.fam.cx>
652    
653            * HTML.pm.src: Foster parenting in AAA (HTML5 revision 1343).
654    
655    2008-03-09  Wakaba  <wakaba@suika.fam.cx>
656    
657            * HTML.pm.src: Support for |<input>| in the "in select" insertion
658            mode and support for the "in select in table" insertion mode (HTML5
659            revision 1342).
660    
661    2008-03-09  Wakaba  <wakaba@suika.fam.cx>
662    
663            * HTML.pm.src: No foster parenting for <script> and <script>
664            in non-tainted <table>s (HTML5 revision 1336).
665    
666    2008-03-09  Wakaba  <wakaba@suika.fam.cx>
667    
668            * HTML.pm.src: Ignore white space characters between <html>
669            and <head> (HTML5 revision 1332).
670    
671    2008-03-09  Wakaba  <wakaba@suika.fam.cx>
672    
673            * HTML.pm.src: Treat <input type=hidden> as if it were a
674            white space (HTML5 revision 1331).
675    
676    2008-03-08  Wakaba  <wakaba@suika.fam.cx>
677    
678            * HTML.pm.src: Ignore U+000A at the beginning of a |listing|
679            element (HTML5 revision 1330).
680    
681    2008-03-08  Wakaba  <wakaba@suika.fam.cx>
682    
683            * HTML.pm.src: <title> is always appended to the current
684            element (HTML5 revision 1328).
685    
686    2008-03-08  Wakaba  <wakaba@suika.fam.cx>
687    
688            * HTML.pm.src: White space in tainted tables are moved
689            into foster parents (HTML5 revision 1326).
690    
691    2008-03-08  Wakaba  <wakaba@suika.fam.cx>
692    
693            * HTML.pm.src: Reduce errors from foster parenting cases (HTML5
694            revision 1321).
695    
696    2008-03-08  Wakaba  <wakaba@suika.fam.cx>
697    
698            * HTML.pm.src: |</h/n/>| case code rearranged to align with
699            the spec (HTML5 revision 1320).  Note that we finally complete
700            all of HTML5 revision 1320 changes.
701    
702    2008-03-08  Wakaba  <wakaba@suika.fam.cx>
703    
704            * HTML.pm.src: |</form>| now works similar to |</div>| for unclosed
705            tags (HTML5 revision 1320).
706    
707    2008-03-08  Wakaba  <wakaba@suika.fam.cx>
708    
709            * HTML.pm.src: |</p>| case rearranged with no actual change in fact.
710    
711    2008-03-08  Wakaba  <wakaba@suika.fam.cx>
712    
713            * HTML.pm.src: A "generate implied end tags" code (t409.1)
714            could not be reached so that it is now removed (HTML5 revision
715            1320).
716    
717    2008-03-08  Wakaba  <wakaba@suika.fam.cx>
718    
719            * HTML.pm.src: Code for the case of |</div>| and so on are revised
720            to align with new spec text (HTML5 revision 1320).
721    
722    2008-03-08  Wakaba  <wakaba@suika.fam.cx>
723    
724            * HTML.pm.src: Remove strange |if| condition; however, it should
725            have had no harm in theory.
726    
727    2008-03-08  Wakaba  <wakaba@suika.fam.cx>
728    
729            * HTML.pm.src (_tree_construction_main): '</p>' in body
730            case is split from other end tags for the preparation
731            of implementing HTML revision 1320.
732    
733    2008-03-07  Wakaba  <wakaba@suika.fam.cx>
734    
735            * HTML.pm.src: Simplified "generate implied end tag" (HTML5
736            revision 1320).
737    
738    2008-03-07  Wakaba  <wakaba@suika.fam.cx>
739    
740            * HTML.pm.src (_tree_construction_main): Merge rules for "h1"
741            and "div" (HTML5 revision 1318).  Add comments to where
742            |form| pointer association codes should be inserted (HTML5 revision
743            1319).
744    
745    2008-03-06  Wakaba  <wakaba@suika.fam.cx>
746    
747            * HTML.pm.src: <html> treatement refined (HTML5 revision 1314).
748    
749    2008-03-05  Wakaba  <wakaba@suika.fam.cx>
750    
751            * HTML.pm.src: Since the case t268 should never be reached (no
752            other token type, there are), it is replaced by a |die| statement.
753    
754    2008-03-05  Wakaba  <wakaba@suika.fam.cx>
755    
756            * HTML.pm.src: Typo fixed.
757    
758    2008-03-04  Wakaba  <wakaba@suika.fam.cx>
759    
760            * HTML.pm.src (_tree_construction_initial): Some limited quirks
761            doctypes were not uppercased for comparison.
762    
763    2008-03-03  Wakaba  <wakaba@suika.fam.cx>
764    
765            * HTML.pm.src (tree construction and set_inner_html): Checkpoints
766            are added.
767    
768    2008-03-03  Wakaba  <wakaba@suika.fam.cx>
769    
770            * HTML.pm.src (_tokenize_attempt_to_consume_an_entity): Checkpoints
771            are set.  Cases that are unlikely reached are noted as so.
772    
773    2008-03-03  Wakaba  <wakaba@suika.fam.cx>
774    
775            * HTML.pm.src: Checkpoints for debugging are added.
776    
777            * mkhtmlparser.pl: Support for |!!!cp| syntax.
778    
779    2008-03-03  Wakaba  <wakaba@suika.fam.cx>
780    
781            * HTML.pm.src, mkhtmlparser.pl: s/_input_character/_char/g
782            for simplicity.
783    
784    2008-03-03  Wakaba  <wakaba@suika.fam.cx>
785    
786            * HTML.pm.src: Flag name changed: s/correct/force-quirks/g (HTML5
787            revision 1307).
788    
789    2008-03-03  Wakaba  <wakaba@suika.fam.cx>
790    
791            * HTML.pm.src (_get_next_token): Where the /incorrect/ flag
792            is set are changed (HTML5 revision 1305).
793    
794    2008-03-02  Wakaba  <wakaba@suika.fam.cx>
795    
796            * HTML.pm.src: Raise a parse error for |<span ===>| (HTML5 revision
797            1292).  Entities are not parsed in comment-like part in RCDATA
798            elements (HTML5 revision 1294).  Allow bare & at the end
799            of attribute value literals (HTML5 revision 1296).  More
800            quirks mode doctypes (HTML5 revision 1302).  Requires spaces
801            between attributes and ban attribute names or unquoted
802            attribute values containing single or double quotes (HTML5
803            revision 1303).
804    
805    2008-03-02  Wakaba  <wakaba@suika.fam.cx>
806    
807            * ContentChecker.pm: Typo fixed.  Don't raise "character encoding"
808            and related errors unless it is an HTML document (though the spec
809            is unclear on whether it is applied to XHTML document).
810    
811            * HTML.pm (%HTMLAttrStatus): WF2 repetition model attributes
812            are added.
813    
814    2008-03-02  Wakaba  <wakaba@suika.fam.cx>
815    
816            * HTML.pm.src: s/local_name/manakai_local_name/g.
817    
818    2008-03-01  Wakaba  <wakaba@suika.fam.cx>
819    
820            * _NamedEntityList.pm: Updated (HTML5 revision 1286).
821    
822            * HTML.pm.src: |charset| in |content| attribute is
823            case-insensitive (HTML5 revision 1270).
824    
825    2008-02-26  Wakaba  <wakaba@suika.fam.cx>
826    
827            * ContentChecker.pm: New status constants are added.
828            ($ElementDefault): |status| added.
829            (check_element): Err for non-standard or deprecated elements.
830            (_attr_status_info): For non-standard or deprecated attributes.
831    
832    2008-02-24  Wakaba  <wakaba@suika.fam.cx>
833    
834            * ContentChecker.pm (_attr_status_info): New internal method.
835    
836    2008-02-24  Wakaba  <wakaba@suika.fam.cx>
837    
838            * ContentChecker.pm (check_element): Element standardized
839            status information is now dispatched.
840    
841    2008-02-24  Wakaba  <wakaba@suika.fam.cx>
842    
843            * ContentChecker.pm (check_element): Fix |del|-and-significant
844            problem by adding some more arguments.
845    
846    2008-02-24  Wakaba  <wakaba@suika.fam.cx>
847    
848            * ContentChecker.pm (check_element): Use context of
849            container-for-the-purpose-of-content-model element (not transparent
850            element) for |check_child_element| calling and significant
851            text flag marking.  This reintroduces |<del>|-and-significant
852            problem again.
853    
854    2008-02-24  Wakaba  <wakaba@suika.fam.cx>
855    
856            * ContentChecker.pm (check_element): Make semi-transparent
857            elements ignored for the purpose of phase changes in
858            content model checking.
859    
860    2008-02-23  Wakaba  <wakaba@suika.fam.cx>
861    
862            * ContentChecker.pm (check_element): In-element state
863            was not properly managed for transparent cases.
864    
865    2008-02-23  Wakaba  <wakaba@suika.fam.cx>
866    
867            * ContentChecker.pm (check_element): Support for |video|
868            and |audio| as semi-transparent elements.
869    
870    2008-02-23  Wakaba  <wakaba@suika.fam.cx>
871    
872            * ContentChecker.pm ($HTMLSemiTransparentElements): New.
873            (check_element): s/minuses/minus_elements/, s/pluses/plus_elements/.
874            Support for |html:object| as a semi-transparent element.
875    
876    2008-02-23  Wakaba  <wakaba@suika.fam.cx>
877    
878            * ContentChecker.pm (check_element): The way to traverse
879            the tree is entirely revised to make it easier to track
880            the state of ancestors/descendants.  As a result of this
881            revision (which rewrites almost all of Whatpm::ContentChecker::HTML),
882            support for content model checking for HTML elements |figure|,
883            |object|, |video|, and |audio| and checking for XML elements (and
884            some XMLNS checkings) are dropped for now.  They will be
885            reimplemented in due cource.
886    
887    2008-02-17  Wakaba  <wakaba@suika.fam.cx>
888    
889            * HTML.pm.src: |>| in public or system literal closes the DOCTYPE
890            token (HTML5 revision 1225).
891    
892    2008-02-17  Wakaba  <wakaba@suika.fam.cx>
893    
894            * ContentChecker.pm ({unsupported_level}): New value.
895    
896            * HTML.pm.src: Save whether |meta| |content| attribute
897            contains character references or not.
898    
899    2008-02-17  Wakaba  <wakaba@suika.fam.cx>
900    
901            * ContentChecker.pm (_get_children): (Incompleted) attempt to
902            imlement significant content checking for contents
903            with |del| elements.
904    
905    2008-02-17  Wakaba  <wakaba@suika.fam.cx>
906    
907            * ContenteChecker.pm ($HTMLTransparentElements): More
908            elements are added.
909            (_get_children): HTML |object| elements are now semi-transparent.
910    
911            * NanoDOM.pm (manakai_html, manakai_head): New methods.
912    
913    2008-02-16  Wakaba  <wakaba@suika.fam.cx>
914    
915            * CacheManifest.pm: HTML5 revision 1211 implemented.
916    
917            * CacheManifest.pod: Updated.
918    
919    2008-02-10  Wakaba  <wakaba@suika.fam.cx>
920    
921            * ContentChecker.pm (check_document, check_element): Support
922            for second argument ($onsubdoc).
923            (_get_css_parser): Removed (now it is part of WDCC).
924    
925    2008-02-09  Wakaba  <wakaba@suika.fam.cx>
926    
927            * ContentChecker.pm (_get_css_parser): New.
928    
929    2007-11-25  Wakaba  <wakaba@suika.fam.cx>
930    
931            * ContentChecker.pm ($AnyChecker): Old way to add child elements
932            for checking had been used.
933    
934    2007-11-25  Wakaba  <wakaba@suika.fam.cx>
935    
936            * ContentChecker.pm (check_element): New todo item type |descendant|.
937    
938    2007-11-23  Wakaba  <wakaba@suika.fam.cx>
939    
940            * IMTChecker.pm: Revised to raise errors and warnings as (poorly)
941            specced in RFC 2046 and RFC 4288.
942            (application/atom+xml): Definition added.
943    
944    2007-11-23  Wakaba  <wakaba@suika.fam.cx>
945    
946            * URIChecker.pm: Make RFC 3986 should-level errors
947            warnings (rather than SHOULD-level errors).
948    
949    2007-11-23  Wakaba  <wakaba@suika.fam.cx>
950    
951            * NanoDOM.pm (get_user_data, set_user_data): New methods.
952    
953            * HTML.pm.src: A flag for character references in attribute
954            values are added.  Set |manakai_has_reference| user data
955            to |charset| attribute.
956    
957    2007-11-23  Wakaba  <wakaba@suika.fam.cx>
958    
959            * NanoDOM.pm (input_encoding, manakai_charset, manakai_has_bom): New
960            attributes.
961    
962            * ContentChecker.pm (check_document): Warn if charset requirements
963            cannot be tested.
964    
965    2007-11-19  Wakaba  <wakaba@suika.fam.cx>
966    
967            * HTML.pm.src (parse_byte_string): Detect charset
968            by universalchardet if charset parameter is not specified.
969    
970            * Makefile (Charset-all, Charset-clean): New rules.
971    
972    2007-11-18  Wakaba  <wakaba@suika.fam.cx>
973    
974            * ContentChecker.pm (check_document): Check the existence
975            of character encoding declaration and selection of encoding
976            for HTML document.
977    
978    2007-11-18  Wakaba  <wakaba@suika.fam.cx>
979    
980            * ContentType.pm (get_sniffed_type): Return also the official
981            type in list context.
982    
983    2007-11-18  Wakaba  <wakaba@suika.fam.cx>
984    
985            * ContentType.pm: Sniffing with leading white space
986            ignoring (HTML5 revisions 1013 and 1016).
987    
988    2007-11-18  Wakaba  <wakaba@suika.fam.cx>
989    
990            * ContentType.pm: HTML5 revision 1013 changes, except for leading
991            white spaces, are implemented.
992    
993    2007-11-11  Wakaba  <wakaba@suika.fam.cx>
994    
995            * HTML.pm.src (parse_char_string): Set |inner_encoding|
996            attribute if possible.
997    
998    2007-11-11  Wakaba  <wakaba@suika.fam.cx>
999    
1000            * HTML.pm.src (parse_byte_string): New method.
1001            (parse_char_string): New alias for |parse_string|.
1002            (main phase): Invoking "change the encoding" algorithm if desired.
1003    
1004            * HTML.pod: Updated.
1005    
1006    2007-11-11  Wakaba  <wakaba@suika.fam.cx>
1007    
1008            * HTML.pod (get_inner_html): Removed.
1009    
1010            * Makefile (HTML-all, HTML-clean): New.
1011    
1012    2007-11-11  Wakaba  <wakaba@suika.fam.cx>
1013    
1014            * HTML.pm.src (get_inner_html): Removed (moved to HTML/Serializer.pm).
1015    
1016    2007-11-08  Wakaba  <wakaba@suika.fam.cx>
1017    
1018            * mklinktypelist.pl: s/noreferer/noreferrer/ (HTML5 revision 1132).
1019    
1020    2007-11-04  Wakaba  <wakaba@suika.fam.cx>
1021    
1022            * Makefile: |CacheManifest.html| is added.
1023    
1024            * CacheManifest.pod: New file.
1025    
1026    2007-11-04  Wakaba  <wakaba@suika.fam.cx>
1027    
1028            * CacheManifest.pm: New module.
1029    
1030    2007-11-04  Wakaba  <wakaba@suika.fam.cx>
1031    
1032            * HTML.pm.src: Support for application cache selection algorithm
1033            callback.
1034    
1035    2007-11-04  Wakaba  <wakaba@suika.fam.cx>
1036    
1037            * mklinktypelist.pl: Support for rel=noreferer (HTML5 revision 1118).
1038    
1039    2007-10-17  Wakaba  <wakaba@suika.fam.cx>
1040    
1041            * Makefile (clean): New rule.
1042    
1043            * NanoDOM.pm (public_id, system_id): New attributes.
1044    
1045    2007-10-17  Wakaba  <wakaba@suika.fam.cx>
1046    
1047            * Makefile (CSS-all, CSS-clean, clean): New rules.
1048    
1049    2007-10-14  Wakaba  <wakaba@suika.fam.cx>
1050    
1051            * ContentChecker.pm (check_document): Support for
1052            new |is_xml_root| flag.
1053            (check_element): Support for new |pluses| state.
1054            (_add_pluses): New method.
1055            (_remove_minuses): Support for new |minus| item.
1056    
1057    2007-09-24  Wakaba  <wakaba@suika.fam.cx>
1058    
1059            * ContentChecker.pm: Raise specific error for invalid
1060            root element.
1061    
1062    2007-09-24  Wakaba  <wakaba@suika.fam.cx>
1063    
1064            * ContentChecker.pm: Set level values for later uses.
1065    
1066    2007-09-09  Wakaba  <wakaba@suika.fam.cx>
1067    
1068            * ContentChecker.pm: Support for language tag validation.
1069    
1070    2007-09-09  Wakaba  <wakaba@suika.fam.cx>
1071    
1072            * LangTag.pm (check_rfc3066_language_tag): New method.
1073    
1074    2007-09-09  Wakaba  <wakaba@suika.fam.cx>
1075    
1076            * LangTag.pm: New module.
1077    
1078    2007-09-04  Wakaba  <wakaba@suika.fam.cx>
1079    
1080            * HTML.pm.src: Some error types were wrong.
1081    
1082    2007-08-17  Wakaba  <wakaba@suika.fam.cx>
1083    
1084            * CSS/: New directory.
1085    
1086    2007-08-17  Wakaba  <wakaba@suika.fam.cx>
1087    
1088            * ContentChecker.pm (_check_get_children): Support
1089            for |noscript| in |head|.
1090    
1091    2007-08-12  Wakaba  <wakaba@suika.fam.cx>
1092    
1093            * URI/: New directory.
1094    
1095    2007-08-11  Wakaba  <wakaba@suika.fam.cx>
1096    
1097            * HTML.pm.src: Tokenizer's states are now represented in
1098            number.
1099    
1100    2007-08-11  Wakaba  <wakaba@suika.fam.cx>
1101    
1102            * HTML.pm.src: |or|s for insertion modes are replaced
1103            by |&|s.
1104    
1105    2007-08-11  Wakaba  <wakaba@suika.fam.cx>
1106    
1107            * HTML.pm.src: Token types are now represented in number.
1108    
1109    2007-08-11  Wakaba  <wakaba@suika.fam.cx>
1110    
1111            * ContentType.pm (SEE ALSO): Updated.
1112    
1113            * HTML.pm.src: Insertion modes are now represented in number.
1114    
1115    2007-08-11  Wakaba  <wakaba@suika.fam.cx>
1116    
1117            * ContentType.pm: Sniffing for bitmap images (HTML5 revision
1118            999) is implemented.
1119    
1120    2007-08-08  Wakaba  <wakaba@suika.fam.cx>
1121    
1122            * ContentType.pm: Sniffing for |<script| (HTML5 revision
1123            983) is implemented.
1124    
1125    2007-08-06  Wakaba  <wakaba@suika.fam.cx>
1126    
1127            * ContentChecker.pod: New documentation.
1128    
1129            * Makefile: A rule for |ContentChecker.html| is added.
1130    
1131            * ContentChecker.pm: A pod "LICENSE" section is added.
1132    
1133            * NanoDOM.pm ($VERSION): New variable.
1134    
1135    2007-08-05  Wakaba  <wakaba@suika.fam.cx>
1136    
1137            * H2H.pm: |b|, |i|, and |sub| are added to the
1138            list of allowed HTML elements.
1139    
1140    2007-08-05  Wakaba  <wakaba@suika.fam.cx>
1141    
1142            * H2H.pm: |samp| is added to the list of allowed
1143            HTML elements.
1144    
1145            * URIChecker.pm (check_iri): New.
1146            (check_iri_reference): Error type for IRI reference
1147            syntax error is changed.
1148    
1149    2007-08-04  Wakaba  <wakaba@suika.fam.cx>
1150    
1151            * ContentChecker.pm: Reference to the |Whatpm::ContentChecker::Atom|
1152            is added.
1153            (check_document): Load appropriate module before validation.
1154    
1155    2007-08-04  Wakaba  <wakaba@suika.fam.cx>
1156    
1157            * ContentChecker/: New directory.
1158    
1159    2007-08-04  Wakaba  <wakaba@suika.fam.cx>
1160    
1161            * ContentChecker.pm: HTML |time| element is implemented.
1162    
1163            * HTMLTable.pm: Comments are updated as HTML5 is revised.
1164    
1165    2007-07-21  Wakaba  <wakaba@suika.fam.cx>
1166    
1167            * ContentChecker.pm (check_document): Return value
1168            even if no document element is found.
1169    
1170    2007-07-21  Wakaba  <wakaba@suika.fam.cx>
1171    
1172            * HTML.pm.src: |$in_body| is no longer a function.
1173    
1174    2007-07-21  Wakaba  <wakaba@suika.fam.cx>
1175    
1176            * HTML.pm.src: The |$in_body| code has been moved down.
1177    
1178    2007-07-21  Wakaba  <wakaba@suika.fam.cx>
1179    
1180            * HTML.pm.src: The "trailing end" insertion mode
1181            is split into "after html body" and "after html frameset"
1182            insertion modes.  Their codes are merged with "after body"
1183            and "after frameset" codes.  |$previous_insertion_mode|
1184            has been removed.  "after frameset" code is
1185            merged with "in frameset" code.
1186    
1187    2007-07-21  Wakaba  <wakaba@suika.fam.cx>
1188    
1189            * HTML.pm.src: The "before head" insertion mode is
1190            merged with the "in head" insertion mode.
1191    
1192    2007-07-21  Wakaba  <wakaba@suika.fam.cx>
1193    
1194            * HTML.pm.src: Cases in "in head" insertion mode are
1195            reorganized.
1196    
1197    2007-07-21  Wakaba  <wakaba@suika.fam.cx>
1198    
1199            * HTML.pm.src: Some cases in "in table" insertion mode
1200            are merged.
1201    
1202    2007-07-21  Wakaba  <wakaba@suika.fam.cx>
1203    
1204            * HTML.pm.src: The "in row" insertion mode is merged
1205            with "in table" insertion mode.
1206    
1207  2007-07-21  Wakaba  <wakaba@suika.fam.cx>  2007-07-21  Wakaba  <wakaba@suika.fam.cx>
1208    
1209          * HTML.pm.src: The "in table" and "in table body" insertion          * HTML.pm.src: The "in table" and "in table body" insertion

Legend:
Removed from v.1.78  
changed lines
  Added in v.1.284

admin@suikawiki.org
ViewVC Help
Powered by ViewVC 1.1.24