/[suikacvs]/markup/html/whatpm/Whatpm/ChangeLog
Suika

Contents of /markup/html/whatpm/Whatpm/ChangeLog

Parent Directory Parent Directory | Revision Log Revision Log


Revision 1.404 - (hide annotations) (download)
Sun Sep 6 23:32:06 2009 UTC (15 years, 2 months ago) by wakaba
Branch: MAIN
CVS Tags: HEAD
Changes since 1.403: +5 -0 lines
++ whatpm/t/ChangeLog	6 Sep 2009 23:31:19 -0000
2009-09-07  Wakaba  <wakaba@suika.fam.cx>

	* tree-test-1.dat: Added new test data on obsolete permitted
	DOCTYPEs (HTML5 revision 3378).

++ whatpm/Whatpm/ChangeLog	6 Sep 2009 23:31:49 -0000
2009-09-07  Wakaba  <wakaba@suika.fam.cx>

	* HTML.pm.src (_tree_construction_initial): Implemented "obsolete
	permitted DOCTYPEs" (HTML5 revision 3378).

1 wakaba 1.404 2009-09-07 Wakaba <wakaba@suika.fam.cx>
2    
3     * HTML.pm.src (_tree_construction_initial): Implemented "obsolete
4     permitted DOCTYPEs" (HTML5 revision 3378).
5    
6 wakaba 1.391 2009-09-06 Wakaba <wakaba@suika.fam.cx>
7    
8 wakaba 1.403 * HTML.pm.src: Implemented the remaining parts of the
9     "frameset-ok" flag (HTML5 revision 2703 and HTML5 revision 2987).
10    
11     2009-09-06 Wakaba <wakaba@suika.fam.cx>
12    
13 wakaba 1.402 * HTML.pm.src: Fixed typos on |frameset-ok| implementation.
14    
15     2009-09-06 Wakaba <wakaba@suika.fam.cx>
16    
17 wakaba 1.401 * HTML.pm.src: The first cut to implement the |frameset-ok| flag
18     concept (HTML5 revision 2703).
19    
20     2009-09-06 Wakaba <wakaba@suika.fam.cx>
21    
22 wakaba 1.400 * HTML.pm.src: |<hr/>| was incorrectly marked as an error.
23    
24     2009-09-06 Wakaba <wakaba@suika.fam.cx>
25    
26 wakaba 1.399 * HTML.pm.src: Changed the handling of </html> end tag of "in
27     body" insertion mode according to the latest spec definition.
28    
29     2009-09-06 Wakaba <wakaba@suika.fam.cx>
30    
31 wakaba 1.398 * HTML.pm.src: Rewrote end tag handling in |head| area (cf. HTML5
32     revision 2730, but it was entirely broken, maybe I missed some
33     spec changes before rev.2730).
34    
35     2009-09-06 Wakaba <wakaba@suika.fam.cx>
36    
37 wakaba 1.397 * HTML.pm.src: Added the |hgroup| element (HTML5 revision 3039 and
38     HTML5 revision 3040).
39    
40     2009-09-06 Wakaba <wakaba@suika.fam.cx>
41    
42 wakaba 1.396 * HTML.pm.src: Added |xmp| to the list of Special elements (HTML5
43     revision 3689). It should make no difference since the |xmp|
44     element has the PCDATA content and cannot be the non-bottommost
45     element in the stack of open elements.
46    
47     2009-09-06 Wakaba <wakaba@suika.fam.cx>
48    
49 wakaba 1.395 * HTML.pm.src: Set the |confidence| flag to |irrlevant| when the
50     fragment parsing algorithm is invoked (HTML5 revision 2699).
51    
52     2009-09-06 Wakaba <wakaba@suika.fam.cx>
53    
54 wakaba 1.394 * HTML.pm.src: Simplified foster parenting (HTML5 revision 2732).
55    
56     2009-09-06 Wakaba <wakaba@suika.fam.cx>
57    
58 wakaba 1.393 * HTML.pm.src: No longer does foster parenting |style|, |script|,
59     and |input type=hidden| elements (HTML5 revision 2988).
60    
61     2009-09-06 Wakaba <wakaba@suika.fam.cx>
62    
63 wakaba 1.392 * HTML.pm.src: Dropped the |eventsource| element (HTML5 revision
64     2870).
65    
66     2009-09-06 Wakaba <wakaba@suika.fam.cx>
67    
68 wakaba 1.391 * HTML.pm.src (_tree_construction_main): Treate |keygen| as a void
69     element (HTML5 revision 2960).
70    
71 wakaba 1.389 2009-09-05 Wakaba <wakaba@suika.fam.cx>
72    
73 wakaba 1.390 * HTML.pm.src: Moved the "in table text" insertion mode code up
74     such that it does also work as specced in HTML5 in the remaining
75     cases (HTML5 revision 3382).
76    
77     2009-09-05 Wakaba <wakaba@suika.fam.cx>
78    
79 wakaba 1.389 * HTML.pm.src: Implementing the "in table text" insertion mode,
80     first half (HTML5 revision 3382).
81    
82 wakaba 1.388 2009-08-30 Wakaba <wakaba@suika.fam.cx>
83    
84     * ContentChecker.pm: Allow |xml:lang| in HTML document if the
85     owner element is not an HTML element (HTML5 revision 3697).
86    
87 wakaba 1.387 2009-08-23 Wakaba <wakaba@suika.fam.cx>
88    
89     * ContentChecker.pm: Allow <input type=hidden> in context where
90     interactive content is disallowed (HTML5 revision 2392).
91    
92 wakaba 1.380 2009-08-16 Wakaba <wakaba@suika.fam.cx>
93    
94 wakaba 1.386 * HTML.pm.src: Dropped support for "XSLT-compat" and added
95     "about:legacy-compat" (HTML5 revision 2725).
96    
97     2009-08-16 Wakaba <wakaba@suika.fam.cx>
98    
99 wakaba 1.385 * HTML.pm.src: "<!DOCTYPE>" is now explicitly supported in the
100     spec (HTML5 revision 2700).
101    
102     2009-08-16 Wakaba <wakaba@suika.fam.cx>
103    
104 wakaba 1.384 * HTML.pm.src: Added support for <script> element in the "in
105     select" and "in select in table" insertion modes (HTML5 revision
106     2500).
107    
108     2009-08-16 Wakaba <wakaba@suika.fam.cx>
109    
110 wakaba 1.383 * HTML.pm.src: An <hn> start tag in an <hn> element should close
111     the earlier element (HTML5 revision 2501).
112    
113     2009-08-16 Wakaba <wakaba@suika.fam.cx>
114    
115 wakaba 1.382 * HTML.pm.src: The </br> end tags before the <body> start tag
116     should equally survive as the first br element child of the body
117     element (HTML5 revision 2586).
118    
119     2009-08-16 Wakaba <wakaba@suika.fam.cx>
120    
121 wakaba 1.381 * HTML.pm.src: <font> in SVG land is handled as an SVG element as
122     long as it did not have HTML attributes (HTML5 revision 2590).
123    
124     2009-08-16 Wakaba <wakaba@suika.fam.cx>
125    
126 wakaba 1.380 * HTML.pm.src (_tree_construction_main): Improved error reporting
127     on "select" start tag in "in select" and "in select in table"
128     parsing modes (c.f. HTML5 revision 3604).
129    
130 wakaba 1.378 2009-07-26 Wakaba <wakaba@suika.fam.cx>
131    
132 wakaba 1.379 * mkhtmlclass.pl: Made xmlns:xlink with wrong namespace URL
133     invalid (HTML5 revision 2701).
134    
135     2009-07-26 Wakaba <wakaba@suika.fam.cx>
136    
137 wakaba 1.378 * HTML.pm.src: </rt>, </rp>, </option>, </optgroup> become
138     optional before </body> (HTML5 revision 2806).
139    
140 wakaba 1.376 2009-07-25 Wakaba <wakaba@suika.fam.cx>
141    
142 wakaba 1.377 * HTML.pm.src: Marked where SVG |script| execution should be
143     implemented (HTML5 revision 2904).
144    
145     2009-07-25 Wakaba <wakaba@suika.fam.cx>
146    
147 wakaba 1.376 * HTML.pm.src (set_inner_html): The fragment parsing algorithm
148     inherits the document quirksness flag of the main document (HTML5
149     revision 2935).
150    
151 wakaba 1.375 2009-07-05 Wakaba <wakaba@suika.fam.cx>
152    
153     * HTML.pm.src (_tree_construction_main): Put |table| in |p| in
154     quirks mode (The Hixie Quirk, HTML5 revision 3132).
155    
156 wakaba 1.372 2009-07-03 Wakaba <wakaba@suika.fam.cx>
157    
158 wakaba 1.374 * HTML.pm.src: <kegten> breaks out of <select> (HTML5 revision
159     3163).
160    
161     2009-07-03 Wakaba <wakaba@suika.fam.cx>
162    
163 wakaba 1.373 * HTML.pm.src: Don't change encoding to UTF-8 if the input
164     encoding is UTF-16 (HTML5 revision 3204).
165    
166     2009-07-03 Wakaba <wakaba@suika.fam.cx>
167    
168 wakaba 1.372 * HTML.pm.src (parse_byte_stream): Don't change to another
169     encoding if the input is UTF-16 (HTML5 revision 3205).
170    
171 wakaba 1.371 2009-06-28 Wakaba <wakaba@suika.fam.cx>
172    
173     * HTML.pm.src: |isindex| should no longer generate |p| element
174     node (HTML5 revision 3236).
175    
176 wakaba 1.368 2008-12-12 Wakaba <wakaba@suika.fam.cx>
177    
178 wakaba 1.370 * ContentChecker.pm: Introduced new flag |has_autofocus|. Added
179     support for new |id_type| value of |any|, which means any kind of
180     element is allowed as a referenced element.
181    
182     2008-12-12 Wakaba <wakaba@suika.fam.cx>
183    
184 wakaba 1.369 * ContentChecker.pm: Introduced |check_attrs2| method for the ease
185     of defining codes for checking required attribute and as such.
186    
187     2008-12-12 Wakaba <wakaba@suika.fam.cx>
188    
189 wakaba 1.368 * IMTChecker.pm: Added more definitions for subtypes.
190    
191 wakaba 1.367 2008-12-11 Wakaba <wakaba@suika.fam.cx>
192    
193     * URIChecker.pm: Some of |pos_end| values were wrong.
194    
195 wakaba 1.364 2008-12-06 Wakaba <wakaba@suika.fam.cx>
196    
197 wakaba 1.366 * ContentChecker.pm (check_element): Added support for "no
198     referenced datalist" error.
199    
200     2008-12-06 Wakaba <wakaba@suika.fam.cx>
201    
202 wakaba 1.365 * URIChecker.pm: Bug fix: It did not work unless
203     Message::DOM::DOMImplementation has been |require|d.
204    
205     2008-12-06 Wakaba <wakaba@suika.fam.cx>
206    
207 wakaba 1.364 * NanoDOM.pm (document_uri): New attribute.
208    
209     * ContentChecker.pm: Don't use methods not implemented by NanoDOM.
210    
211 wakaba 1.363 2008-11-07 Wakaba <wakaba@suika.fam.cx>
212    
213     * NanoDOM.pm (text_content): Don't create a Text node if the new
214     value is empty.
215    
216 wakaba 1.362 2008-11-06 Wakaba <wakaba@suika.fam.cx>
217    
218     * SWML/: New directory.
219    
220 wakaba 1.361 2008-10-20 Wakaba <wakaba@suika.fam.cx>
221    
222     * NanoDOM.pm (specified, all_declarations_processed,
223     manakai_attribute_type): New attributes.
224    
225 wakaba 1.359 2008-10-19 Wakaba <wakaba@suika.fam.cx>
226    
227 wakaba 1.360 * NanoDOM.pm (Entity->new): Initialize ->child_nodes as an empty
228     array.
229    
230     2008-10-19 Wakaba <wakaba@suika.fam.cx>
231    
232 wakaba 1.359 * NanoDOM.pm (notation_name): New attribute.
233    
234 wakaba 1.357 2008-10-18 Wakaba <wakaba@suika.fam.cx>
235    
236 wakaba 1.359 * NanoDOM.pm (public_id, system_id): New attributes.a
237 wakaba 1.358
238     2008-10-18 Wakaba <wakaba@suika.fam.cx>
239    
240 wakaba 1.357 * NanoDOM.pm (text_content): Moved to Node from Element. Setter
241     implemented.
242     (allowed_tokens, default_type, declared_type): Implemented.
243    
244 wakaba 1.356 2008-10-17 Wakaba <wakaba@suika.fam.cx>
245    
246     * NanoDOM.pm (node_name): New attribute.
247     (ELEMENT_TYPE_DEFINITION_NODE, ATTRIBUTE_DEFINITION_NODE): New
248     constants.
249     (create_element_type_definition_node, create_attribute_definition,
250     create_notation, create_general_entity,
251     get_element_type_definition_node,
252     set_element_type_definition_node, get_general_entity_node,
253     set_general_entity_node, get_notation_node, set_notation_node,
254     get_attribute_definition_node, set_attribute_definition_node): New
255     methods.
256     (element_types, entities, notations, attribute_definitions): New
257     attributes.
258     (DocumentType): Support for child nodes, entities, notations, and
259     element types.
260     (Entity, Notation, ElementTypeDefinition, AttributeDefinition):
261     New classes.
262    
263     * Dumper.pm: Support for general entities, notations, element type
264     definitions, and attribute definitions.
265    
266 wakaba 1.355 2008-10-15 Wakaba <wakaba@suika.fam.cx>
267    
268     * NanoDOM.pm (create_processing_instruction): New method.
269     (xml_version, xml_encoding, xml_standalone): New attributes.
270     (ProcessingInstruction): New class.
271    
272 wakaba 1.352 2008-10-14 Wakaba <wakaba@suika.fam.cx>
273    
274 wakaba 1.354 * HTML.pm.src: Handling of end tags in the foreign content
275     insertion mode was partially wrong, because of wrong bit
276     operations.
277    
278     2008-10-14 Wakaba <wakaba@suika.fam.cx>
279    
280 wakaba 1.353 * NanoDOM.pm (dom_config): New attribute (do nothing), for
281     Whatpm::XML::Parser support.
282    
283     2008-10-14 Wakaba <wakaba@suika.fam.cx>
284    
285 wakaba 1.352 * Makefile: New rule to make HTML/Tokenizer.pm is added.
286    
287     * HTML.pm.src: Tokenizer part moved to another file.
288    
289 wakaba 1.349 2008-10-13 Wakaba <wakaba@suika.fam.cx>
290    
291 wakaba 1.351 * HTML.pm.src: Merge |DT_EL| and |DD_EL| as |DTDD_EL|.
292    
293     2008-10-13 Wakaba <wakaba@suika.fam.cx>
294    
295 wakaba 1.350 * HTML.pm.src: Element category constants redefined.
296    
297     2008-10-13 Wakaba <wakaba@suika.fam.cx>
298    
299 wakaba 1.349 * HTML.pm.src: Steps for CDATA/RCDATA elements in tree
300     construction stage synced with the spec (HTML5 revisions 2139 and
301     2302).
302    
303 wakaba 1.348 2008-10-07 Wakaba <wakaba@suika.fam.cx>
304    
305     * ContentChecker.pm: New error level "html5_fact" added, which
306     should be tentatively used until all of requirements are properly
307     specced as RFC 2119 "MUST" in HTML5.
308    
309 wakaba 1.346 2008-10-05 Wakaba <wakaba@suika.fam.cx>
310    
311 wakaba 1.347 * ContentChecker.pod: Note on internal flags is added.
312    
313     2008-10-05 Wakaba <wakaba@suika.fam.cx>
314    
315 wakaba 1.346 * HTML.pm.src: An AAA bug fixed.
316    
317 wakaba 1.336 2008-10-04 Wakaba <wakaba@suika.fam.cx>
318    
319 wakaba 1.345 * HTML.pm.src: If another node is inserted by the parser, don't
320     reuse existing Text node to append a character (HTML5 revision
321     2124).
322    
323     2008-10-04 Wakaba <wakaba@suika.fam.cx>
324    
325 wakaba 1.344 * HTML.pm.src: Support for <option> and <optgroup> in body (HTML5
326     revisions 1731 and 2128).
327    
328     2008-10-04 Wakaba <wakaba@suika.fam.cx>
329    
330 wakaba 1.343 * HTML.pm.src: Make <foreignObject> scoping (HTML5 revision 1837).
331     Support for end tags of camelCase SVG elements were broken. A
332     wrong error type text fixed.
333    
334     2008-10-04 Wakaba <wakaba@suika.fam.cx>
335    
336 wakaba 1.342 * HTML.pm.src: Drop redundant code (HTML5 revision 1731).
337    
338     2008-10-04 Wakaba <wakaba@suika.fam.cx>
339    
340 wakaba 1.341 * HTML.pm.src: Support for new definition of |param| and |source|
341     start tag parsing (HTML5 revision 1731).
342    
343     2008-10-04 Wakaba <wakaba@suika.fam.cx>
344    
345 wakaba 1.340 * HTML.pm.src: <p> steps reimplemented (HTML5 revision 1731).
346    
347     2008-10-04 Wakaba <wakaba@suika.fam.cx>
348    
349 wakaba 1.339 * HTML.pm.src: <li>, <dt>, and <dd> steps reimplemented (HTML5
350     revisions 1731 and 1831).
351    
352     2008-10-04 Wakaba <wakaba@suika.fam.cx>
353    
354 wakaba 1.338 * HTML.pm.src: Support for new flow (but not phrasing) elements (HTML5
355     revisions 1731 and 1778). Support for the </sarcasm> end tag (HTML5
356     revision 1731).
357    
358     2008-10-04 Wakaba <wakaba@suika.fam.cx>
359    
360 wakaba 1.337 * HTML.pm.src: Support for |command| and |eventsource| elements (HTML5
361     revision 1731). End tags of |option| and |optgroup| elements are
362     now optional (HTML5 revision 1731).
363    
364     2008-10-04 Wakaba <wakaba@suika.fam.cx>
365    
366 wakaba 1.336 * HTML.pm.src: New "special" elements added to the list (HTML5
367     revision 1778). "strile" -> "strike".
368    
369 wakaba 1.335 2008-10-02 Wakaba <wakaba@suika.fam.cx>
370    
371     * ContentType.pm (get_sniffed_type): Support for the "better"
372     content sniffing (HTML5 revision 1927). In a case the official
373     type was not returned when the method is invoked in the list
374     context.
375    
376 wakaba 1.334 2008-09-22 Wakaba <wakaba@suika.fam.cx>
377    
378     * HTML.pm.src: Character references for non-space C0 characters,
379     including U+000B VT, DEL character, noncharacter code points, are
380     now converted to the U+FFFD character (cf. HTML5 revision 2138).
381    
382 wakaba 1.330 2008-09-21 Wakaba <wakaba@suika.fam.cx>
383    
384 wakaba 1.333 * ContentChecker.pm: |form=""| check support added.
385    
386     2008-09-21 Wakaba <wakaba@suika.fam.cx>
387    
388 wakaba 1.332 * ContentChecker.pm: |contextmenu| validness is now checked using
389     |id| and |id_type| properties, and |menu| property is removed.
390    
391     2008-09-21 Wakaba <wakaba@suika.fam.cx>
392    
393 wakaba 1.331 * ContentChecker.pm: Prepare for |form| |name| attribute's
394     duplication checking.
395    
396     2008-09-21 Wakaba <wakaba@suika.fam.cx>
397    
398 wakaba 1.330 * HTML.pm.src (parse_byte_stream): Support (or non-support) for
399     unsupported charset="" parameter value (HTML5 revision 2131).
400    
401 wakaba 1.325 2008-09-20 Wakaba <wakaba@suika.fam.cx>
402    
403 wakaba 1.329 * HTML.pm.src: Reminding places where U+000B is allowed as a space
404     character is fixed (cf. HTML5 revision 1738).
405    
406     * ContentChecker.pm, HTMLTable.pm: U+000B is no longer part of
407     space characters (HTML5 revision 1738).
408    
409     2008-09-20 Wakaba <wakaba@suika.fam.cx>
410    
411 wakaba 1.328 * HTML.pm.src: The "anything else" case for the "after after body"
412     insertion mode was not updated to swtich to the "in body"
413     insertion mode. U+000B is no longer a space character for the
414     purpose of tree construction phase (HTML5 revision 1738).
415    
416     2008-09-20 Wakaba <wakaba@suika.fam.cx>
417    
418 wakaba 1.327 * HTML.pm.src: U+000B is no longer a space character (HTML5
419     revision 1738).
420    
421     2008-09-20 Wakaba <wakaba@suika.fam.cx>
422    
423 wakaba 1.326 * ContentType.pm: 0x0B is no longer a space character (HTML5
424     revision 1738).
425    
426     * HTML.pm.src: U+000B is no longer a space character for the
427     algorithm for extracting an encoding from a Content-Type (HTML5
428     revision 1738).
429    
430     2008-09-20 Wakaba <wakaba@suika.fam.cx>
431    
432 wakaba 1.325 * ContentChecker.pm ($IsInHTMLInteractiveContent): New.
433    
434 wakaba 1.320 2008-09-18 Wakaba <wakaba@suika.fam.cx>
435    
436 wakaba 1.324 * LangTag.pm: Add checks for remaining requirements from RFC 4646.
437    
438     * mklangreg.pl: Sort 'Prefix' values by their length, to ease
439     matching.
440    
441     2008-09-18 Wakaba <wakaba@suika.fam.cx>
442    
443 wakaba 1.323 * LangTag.pm: Warn for private use language subtags. Error level
444     typos fixed. Support for Suppress-Script field.
445    
446     * mklangreg.pl: Support for dumping of nested structure.
447    
448     2008-09-18 Wakaba <wakaba@suika.fam.cx>
449    
450 wakaba 1.322 * LangTag.pm (check_rfc4646_langtag): Check if a tag is in the
451     recommended case as per RFC 4646.
452    
453     2008-09-18 Wakaba <wakaba@suika.fam.cx>
454    
455 wakaba 1.321 * LangTag.pm (check_rfc4646_langtag): New method.
456    
457     2008-09-18 Wakaba <wakaba@suika.fam.cx>
458    
459 wakaba 1.320 * mklangreg.pl: New script.
460    
461     * Makefile: Updated for creation of the module for language subtag
462     registry.
463    
464 wakaba 1.315 2008-09-16 Wakaba <wakaba@suika.fam.cx>
465    
466 wakaba 1.319 * Makefile: WebIDL.html added.
467    
468     * WebIDL.pod: New documentation.
469    
470     2008-09-16 Wakaba <wakaba@suika.fam.cx>
471    
472 wakaba 1.318 * WebIDL.pm: Checker's error types are redefined.
473    
474     2008-09-16 Wakaba <wakaba@suika.fam.cx>
475    
476 wakaba 1.317 * WebIDL.pm: Parser's error types are redefined. Some forward
477     compatible parsing bugs are fixed. Some unreachable codes are
478     commented out.
479    
480     2008-09-16 Wakaba <wakaba@suika.fam.cx>
481    
482 wakaba 1.316 * WebIDL.pm: Support for the reminding extended attributes are
483     added. It does not satisfy the definition that a forward
484     interface declaration has an extended attribute. It seems that
485     unless explicitly allowed multiple extended attributes with the
486     same name is not allowed, though it is not explicitly mentioned in
487     the spec.
488    
489     2008-09-16 Wakaba <wakaba@suika.fam.cx>
490    
491 wakaba 1.315 * WebIDL.pm: Unescapes extended attribute names and extended
492     attribute identifiers. Preserve whether an extended attribute has
493     an argument list of not. Support for extended attributes:
494     Constructor, ExceptionConsts, IndexGetter, IndexSetter,
495     NameGetter, NameSetter, and Null.
496     (has_argument_list): New attribute.
497     (idl_text): Stringifies argument lists, if any, even if it is
498     empty.
499    
500 wakaba 1.310 2008-09-15 Wakaba <wakaba@suika.fam.cx>
501    
502 wakaba 1.314 * HTML.pm.src: New state |PCDATA_STATE|. Use an empty string for
503     |{s_kwd}| in DATA_STATE as default.
504    
505     2008-09-15 Wakaba <wakaba@suika.fam.cx>
506    
507 wakaba 1.313 * HTML.pm.src, mkhtmlparser.pl: Replace |{prev_char}|
508     by |{s_kwd}| in DATA_STATE.
509    
510     2008-09-15 Wakaba <wakaba@suika.fam.cx>
511    
512 wakaba 1.312 * HTML.pm.src: Shorten keys.
513    
514     2008-09-15 Wakaba <wakaba@suika.fam.cx>
515    
516 wakaba 1.311 * HTML.pm.src: Remove checking for control character, surrogate
517     pair, or noncharacter code points and non-Unicode code
518     points (they should be handled by Whatpm::Charset::UnicodeChecker).
519     (parse_char_stream): Support for the |$get_wrapper| argument and
520     character stream error handlers.
521    
522     2008-09-15 Wakaba <wakaba@suika.fam.cx>
523    
524 wakaba 1.310 * ContentChecker.pm: Don't call |loda_ns_module|
525     for null-namespace elements/attributes.
526    
527     * HTML.pm.src: Fact out $disallowed_control_chars
528     as a hash.
529    
530 wakaba 1.300 2008-09-14 Wakaba <wakaba@suika.fam.cx>
531    
532 wakaba 1.309 * HTML.pm.src: Regexp typo fixed. |{prev_char}|
533     and |{next_char}| initializations are moved to initialization
534     method. |{read_until}| now supports buffering. Sync |set_inner_html|
535     with |parse_char_stream|.
536    
537     2008-09-14 Wakaba <wakaba@suika.fam.cx>
538    
539 wakaba 1.308 * HTML.pm.src (parse_char_stream): Make |set_next_char|
540     invoke |manakai_read_until|, not only |read|, where
541     possible, to decrease the number of |read| method calls.
542    
543     * mkhtmlparser.pl: Related changes to the aforementioned
544     modification.
545    
546     2008-09-14 Wakaba <wakaba@suika.fam.cx>
547    
548 wakaba 1.307 * HTML.pm.src: Use |read| instead of |getc|. |set_inner_html|
549     would report character error from now.
550    
551     2008-09-14 Wakaba <wakaba@suika.fam.cx>
552    
553 wakaba 1.306 * HTML.pm.src: White-space-leaded non-white-space character
554     tokens in "before head insertion mode" was not
555     correctly handled.
556     (set_inner_html): Reimplemented using CharString decodehandle
557     class. Support for $get_wrapper argument. Support
558     for |{read_until}| feature.
559    
560     2008-09-14 Wakaba <wakaba@suika.fam.cx>
561    
562 wakaba 1.305 * HTML.pm.src: Make a "bare ero" error for unknown
563     entities point the "&" character.
564    
565     2008-09-14 Wakaba <wakaba@suika.fam.cx>
566    
567 wakaba 1.304 * HTML.pm.src: It turns out that U+FFFD don't have to
568     be added to the list of excluded characters.
569    
570     2008-09-14 Wakaba <wakaba@suika.fam.cx>
571    
572 wakaba 1.303 * HTML.pm.src ($char_onerror): Have character decoder's |line|
573     and |column| a higher priority than the one set by the
574     tokenizer's input handler.
575     ($self->{read_until}): Exclude U+FFFD (but this might
576     not be necessary, since now we do line/column fixup in
577     the character decode handle).
578    
579     2008-09-14 Wakaba <wakaba@suika.fam.cx>
580    
581 wakaba 1.302 * HTML.pm.src: Use |{read_until}| where possible.
582    
583     2008-09-14 Wakaba <wakaba@suika.fam.cx>
584    
585 wakaba 1.301 * HTML.pm.src: Change |{getc_until}| to |{read_until}|
586     and |manakai_getc_until| to |manakai_read_until| to
587     reduce the number of string copies.
588    
589     2008-09-14 Wakaba <wakaba@suika.fam.cx>
590    
591 wakaba 1.300 * HTML.pm.src (parse_char_string): Use newly created
592     |Whatpm::Charset::DecodeHandle::CharString| instead of Perl's
593     standard feature to |open| a string as a filehandle,
594     since Perl's string filehandle seems not supporting |ungetc|
595     method correctly.
596     (parse_char_stream): Define |{getc_until}| method.
597     (DATA_STATE): Experimental support for |getc_until| feature.
598    
599 wakaba 1.292 2008-09-13 Wakaba <wakaba@suika.fam.cx>
600    
601 wakaba 1.299 * HTML.pm.src: Check points added to newly added branches.
602    
603     2008-09-13 Wakaba <wakaba@suika.fam.cx>
604    
605 wakaba 1.298 * HTML.pm.src: Remove |{char}|, which is no longer used.
606     Remove |{entity_in_attr}| and |{last_attribute_value_state}|
607     and replaced by |{prev_state}|.
608    
609     * mkhtmlparser.pl: Remove |{char}| feature.
610     Remove |!!!back-next-input-character;| macro.
611    
612     2008-09-13 Wakaba <wakaba@suika.fam.cx>
613    
614 wakaba 1.297 * HTML.pm.src: Finally we get rid of all the inner loops. Remove
615     entity related tokenizer states in favor of new states
616     implementing the consume character reference algorithm.
617    
618     2008-09-13 Wakaba <wakaba@suika.fam.cx>
619    
620     * HTML.pm.src: "Consume a character reference" algorithm is
621 wakaba 1.296 now implemented as a tokenizer's state, rather than
622     a method, with minimum changes (more changes will
623     be made, in due course). "Bogus comment state"'s inner
624     loop gets removed.
625    
626     2008-09-13 Wakaba <wakaba@suika.fam.cx>
627    
628 wakaba 1.297 * HTML.pm.src: Make |PUBLIC| and |SYSTEM| keyword tokenizing
629 wakaba 1.295 into their own tokenizer states.
630    
631     2008-09-13 Wakaba <wakaba@suika.fam.cx>
632    
633 wakaba 1.294 * HTML.pm.src: |CDATA_SECTION_STATE| (formally |CDATA_BLOCK_STATE|
634     is split into three states.
635    
636     2008-09-13 Wakaba <wakaba@suika.fam.cx>
637    
638 wakaba 1.293 * HTML.pm.src: |CLOSE_TAG_OPEN_STATE| is broken into
639     itself and new |CDATA_PCDATA_CLOSE_TAG_STATE| so that
640     no longer does the tokenizer have to push back next input
641     characters in those states.
642    
643     2008-09-13 Wakaba <wakaba@suika.fam.cx>
644    
645 wakaba 1.292 * HTML.pm.src: |MARKUP_DECLARATION_OPEN_STATE| broken
646     into four states so that no longer does the tokenizer have to push
647     back next input characters in that state.
648    
649 wakaba 1.291 2008-09-11 Wakaba <wakaba@suika.fam.cx>
650    
651     * HTML.pm.src: Methods now accept additional parameter, $get_wrapper,
652     which can be used to insert some wrapper between the character
653     stream handle and the tokenizer. (It is currently not supported
654     for |set_inner_html| for |Element|s).
655    
656 wakaba 1.289 2008-09-10 Wakaba <wakaba@suika.fam.cx>
657    
658 wakaba 1.290 * HTML.pm.src: Ignore punctuations in charset names.
659    
660     2008-09-10 Wakaba <wakaba@suika.fam.cx>
661    
662 wakaba 1.289 * ContentChecker.pm: Support for charset-layer error levels.
663    
664     * HTML.pm.src: Don't specify |text| argument for the
665     |chardecode:fallback| error, since it is not the encoding
666     being used alternatively.
667    
668 wakaba 1.288 2008-09-06 Wakaba <wakaba@suika.fam.cx>
669    
670     * HTML.pm.src: Support for |XSLT-compat| (HTML5 revision 2141).
671    
672 wakaba 1.286 2008-08-31 Wakaba <wakaba@suika.fam.cx>
673    
674 wakaba 1.287 * CacheManifest.pm: Support for extensibility (HTML5 revision 2051).
675    
676     2008-08-31 Wakaba <wakaba@suika.fam.cx>
677    
678 wakaba 1.286 * HTML.pm.src: Bug fix and sync with the spec with regard
679     to after after frameset insertion mode processing (HTML5
680     revision 1909). Note that the implementation was wrong
681     per the old spec before the r1909 changes.
682    
683 wakaba 1.278 2008-08-30 Wakaba <wakaba@suika.fam.cx>
684    
685 wakaba 1.285 * HTMLTable.pm: scope=auto algorithm fix synced with the
686     spec (HTML5 revision 2093).
687     ($process_row): Algorithm step numbers synced with the
688     spec (HTML5 revision 2092).
689    
690     2008-08-30 Wakaba <wakaba@suika.fam.cx>
691    
692 wakaba 1.284 * HTMLTable.pm: Zs is not what we want; we want White_Space! (HTML5
693     revision 2094).
694    
695     2008-08-30 Wakaba <wakaba@suika.fam.cx>
696    
697     * ContentType.pm: Support for image/svg+xml (HTML5 revision 2096).
698    
699     2008-08-30 Wakaba <wakaba@suika.fam.cx>
700    
701 wakaba 1.283 * HTML.pm.src: '"' and "'" at the end of attribute
702     name (after another attribute) now raise parse error (HTML5
703     revision 2123). Empty unquoted attribute values are no
704     longer allowed (HTML5 revision 2122).
705    
706     2008-08-30 Wakaba <wakaba@suika.fam.cx>
707    
708 wakaba 1.282 * mkhtmlparser.pl: Support for MathML |definitionURL| attribute (HTML5
709     revision 2130).
710    
711     2008-08-30 Wakaba <wakaba@suika.fam.cx>
712    
713 wakaba 1.281 * ContentChecker.pm: |xml:lang| attribute value must be same
714     as |lang| attribute value for HTML elements (HTML5 revision 2062
715     and so on).
716    
717     2008-08-30 Wakaba <wakaba@suika.fam.cx>
718    
719 wakaba 1.280 * ContentChecker.pm: Error level definition for |xml_id_error|
720     was missing.
721    
722     * URIChecker.pm: The end of the URL should be marked as the
723     error location for an empty path error. The position
724     between the userinfo and the port components should be
725     marked as the error location for an empty host error.
726    
727     2008-08-30 Wakaba <wakaba@suika.fam.cx>
728    
729 wakaba 1.279 * URIChecker.pm: Set parameters representing where in the
730     value the error occurs for errors. Report unknown
731     address format error in warning level, since address
732     formats are rarely added. Path segments starting with "/.."
733     were misinterpreted as a dot-segment.
734    
735     2008-08-30 Wakaba <wakaba@suika.fam.cx>
736    
737 wakaba 1.278 * URIChecker.pm (check_iri_reference): Requires
738     |Message::DOM::DOMImplementation|.
739    
740 wakaba 1.277 2008-08-29 Wakaba <wakaba@suika.fam.cx>
741    
742     * IMTChecker.pm: Updated for the new error reporting architecture.
743    
744     * ContentChecker.pm: Error levels for IMTs are added.
745    
746 wakaba 1.276 2008-08-17 Wakaba <wakaba@suika.fam.cx>
747    
748     * H2H.pm (_shift_token): Support for unquoted HTML attribute
749     values.
750    
751 wakaba 1.274 2008-08-16 Wakaba <wakaba@suika.fam.cx>
752    
753 wakaba 1.275 * CacheManifest.pm: Support for new style of error
754     reports.
755    
756     * HTML.pm.src: Set line=1, column=1 to the document node.
757    
758     2008-08-16 Wakaba <wakaba@suika.fam.cx>
759    
760 wakaba 1.274 * ContentChecker.pm, RDFXML.pm: Pass {level} object to language tag
761     and URL checkers. Support for more error levels for bogus
762     langauge tag and URL "standards".
763    
764     * LangTag.pm, URIChecker.pm: Support for new style error
765     level reporting.
766    
767 wakaba 1.271 2008-08-15 Wakaba <wakaba@suika.fam.cx>
768    
769 wakaba 1.273 * ContentChecker.pm: Support for RDF/XML error levels.
770    
771     * HTMLTable.pm, RDFXML.pm: Support for new style of error level
772     specifying. Error types are revised.
773    
774     2008-08-15 Wakaba <wakaba@suika.fam.cx>
775    
776 wakaba 1.272 * ContentChecker.pm: All error reporting method calls are
777     renewed.
778    
779     2008-08-15 Wakaba <wakaba@suika.fam.cx>
780    
781 wakaba 1.271 * HTML.pm.src: All error type names and "text" parameters
782     are revised. Use new style for "level" specification.
783    
784     * mkhtmlparser.pl: Use new style for "level" specification.
785    
786 wakaba 1.268 2008-08-03 Wakaba <wakaba@suika.fam.cx>
787    
788 wakaba 1.270 * WebIDL.pm (parse_char_string): Simplified error
789     reporting process for broken ignored valuetype definition.
790     (Valuetype idl_text): Support for special "DOMString" name.
791    
792     2008-08-03 Wakaba <wakaba@suika.fam.cx>
793    
794 wakaba 1.269 * WebIDL.pm ($get_scoped_name): Append "::::" if the last
795     terminal of the ScopedName is "DOMString", such that whether
796     the last part of the scoped name is "DOMString" or "_DOMString"
797     later. It is necessary to determine whether a |typedef|
798     definition should be ignored or not.
799     (parse_char_string): Unescape the identifier of
800     exception members.
801     ($resolve): Return undef for builtin types and sequence<T>
802     types (we might not have to do this, however...).
803     (check): Support checking for Exceptions, Valuetypes,
804     and Typedefs.
805     ($serialize_type): Support for "DOMString::::" syntax.
806     (Typedef idl_text): Output Type as "DOMString" if it
807     is really "DOMString" (i.e. its internal representation
808     is "::DOMString::").
809    
810     2008-08-03 Wakaba <wakaba@suika.fam.cx>
811    
812 wakaba 1.268 * WebIDL.pm ($resolve): New code, based on resolve code
813     for constant types in the |check| method.
814     (check): Support for checking of attributes, operations, and
815     arguments.
816     (Attribute/Operation idl_text): Exception names in getraises,
817     setraises, and raises clauses is serizlied by |$serialize_type|
818     code.
819    
820 wakaba 1.264 2008-08-02 Wakaba <wakaba@suika.fam.cx>
821    
822 wakaba 1.267 * WebIDL.pm ($integer): Order of selections are changed to match
823     hexadecimal numbers (the original pattern, taken from the spec,
824     was not work for hexadecimal numbers, because the "0" prefix
825     matches to the [0-7]* part (as an empty string) and therefore
826     it does not match with remaining "x..." part of a "0x..." integer
827     literal.
828     ($get_type): It now returns a string, not an array reference,
829     for regular types and |sequence| types (i.e. it in any case
830     returns a string).
831     ($get_next_token): The second item in the array that represents
832     a integer or float token is now a Perl number value, not the
833     original string representation of the number.
834     (check): Support for const value consistency checking.
835     No extended attribute is defined for constants.
836     (Node subclasses): Use simple strings rather than array references
837     for default data type values.
838     ($serialize_type): Type values are now simple strings.
839     (value): If the new attribute value is a false value, then
840     a FALSE value is set to the attribute.
841    
842     2008-08-02 Wakaba <wakaba@suika.fam.cx>
843    
844 wakaba 1.266 * WebIDL.pm ($get_scoped_name): Now scoped names are stored
845     in its stringified format ("scoped name" as defined in the
846     spec). Note that future version of this module should not use
847     array references for type values and the |type_text| attribute
848     should be made obsolete.
849     (parse_char_string): Unescape attribute names.
850     (check): Support for checking of whether inherited interfaces
851     are actually defined or not. Support for checking of whether
852     interface member identifiers are duplicated or not.
853     ($serialize_type): Scoped names are returned as is. A future
854     version of this code should escape identifiers other than "DOMString",
855     otherwise the idl_text would be non-conforming.
856    
857     2008-08-02 Wakaba <wakaba@suika.fam.cx>
858    
859 wakaba 1.264 * WebIDL.pm (parse_char_string): Set line/column numbers
860 wakaba 1.265 to generated nodes. Unescape identifiers. Extended attributes
861     for Definition's were ignored.
862     (append_child): Set |parent_node| attribute.
863     (parent_node): New attribute.
864     (check): Support interface/exception members. Support
865     extended attributes. Support definition identifier uniqueness
866     constraint.
867     (qualified_name): New attribute.
868     (Interface/Exception idl_text): Extended attributes were
869     not prepended to the returned text.
870    
871     2008-08-02 Wakaba <wakaba@suika.fam.cx>
872    
873     * WebIDL.pm (parse_char_string): Set line/column numbers
874 wakaba 1.264 to interface object experimentally. s/shift/pop/g, shift
875     would make things wrong. Support for interface forward
876     declarations was missing. Broken interface declarations
877     with no block were not ignored entirely.
878     (Whatpm::WebIDL::Node): New abstract class. This class
879     makes things easier.
880     (child_nodes): New attribute. Unlike DOM's attribute with
881     same name, this attribute returns a dead list of nodes for
882     simplicity.
883     (get_user_data, set_user_data): New methods.
884     (Module idl_text): A SPACE character should be inserted
885     before the |{| character.
886     (Interface idl_text): Support for interface forward declarations.
887     (is_forward_declaration): New attribute.
888    
889 wakaba 1.259 2008-07-19 Wakaba <wakaba@suika.fam.cx>
890    
891 wakaba 1.263 * WebIDL.pm (type_text): Better serializer.
892    
893     2008-07-19 Wakaba <wakaba@suika.fam.cx>
894    
895 wakaba 1.262 * WebIDL.pm: Revise forward-compatible parsing so that
896     it now can handle broken extended attributes and as such.
897    
898     2008-07-19 Wakaba <wakaba@suika.fam.cx>
899    
900 wakaba 1.261 * WebIDL.pm: Real support for extended attributes.
901     Support for extended attributes with arguments.
902    
903     2008-07-19 Wakaba <wakaba@suika.fam.cx>
904    
905 wakaba 1.260 * WebIDL.pm: Support for |exception| syntax.
906     (Interface->idl_text): Tentative support for inheritances.
907    
908     2008-07-19 Wakaba <wakaba@suika.fam.cx>
909    
910 wakaba 1.259 * WebIDL.pm: Hierarchical scoped name support was broken.
911     Support for raises, setraises, and getraises syntaxes.
912    
913 wakaba 1.258 2008-07-18 Wakaba <wakaba@suika.fam.cx>
914    
915     * WebIDL.pm: Support for |idl_text| attribute, version 1 (no
916     proper support for types, extended attributes, and exceptions yet).
917     WebIDL parser, version 1 (no support for exceptions yet,
918     no proper support for extended attributes yet).
919    
920 wakaba 1.257 2008-07-09 Wakaba <wakaba@suika.fam.cx>
921    
922     * WebIDL.pm (parse_char_string): Support for basic attribute syntax.
923    
924 wakaba 1.255 2008-06-29 Wakaba <wakaba@suika.fam.cx>
925    
926 wakaba 1.256 * WebIDL.pm: Support for valuetype and const.
927    
928     2008-06-29 Wakaba <wakaba@suika.fam.cx>
929    
930 wakaba 1.255 * WebIDL.pm: New module.
931    
932 wakaba 1.254 2008-06-15 Wakaba <wakaba@suika.fam.cx>
933    
934     * Makefile (Entities.html): URI changed.
935    
936 wakaba 1.253 2008-06-08 Wakaba <wakaba@suika.fam.cx>
937    
938     * HTML.pm.src: Support for ruby parsing (HTML5 revision 1704).
939    
940 wakaba 1.251 2008-06-01 Wakaba <wakaba@suika.fam.cx>
941    
942 wakaba 1.252 * HTML.pm.src (_get_next_token): A parse error was missing.
943    
944     2008-06-01 Wakaba <wakaba@suika.fam.cx>
945    
946 wakaba 1.251 * mklinktypelist.pl: rel=contact is no longer part of the HTML5
947     spec (commented out). (HTML5 revision 1711).
948    
949 wakaba 1.248 2008-05-25 Wakaba <wakaba@suika.fam.cx>
950    
951 wakaba 1.250 * ContentType.pm: Drop support for UTF-32 (HTML5 revision 1701).
952    
953     * HTML.pm.src: UTF-16BE and UTF-16LE should be considered
954     as UTF-16 (HTML5 revision 1701).
955    
956     2008-05-25 Wakaba <wakaba@suika.fam.cx>
957    
958 wakaba 1.249 * HTML.pm.src: Support for <noframes> in <head> (HTML5 revision
959     1692).
960    
961     2008-05-25 Wakaba <wakaba@suika.fam.cx>
962    
963     * HTML.pm.src: The secondary insertion mode used when switching
964     to foreign content is the "in body" insertion mode (HTML5 revision
965     1696).
966    
967     2008-05-25 Wakaba <wakaba@suika.fam.cx>
968    
969 wakaba 1.248 * HTML.pm.src: Don't raise parse error for <isindex/> (HTML5
970     revision 1697).
971    
972 wakaba 1.239 2008-05-24 Wakaba <wakaba@suika.fam.cx>
973    
974 wakaba 1.247 * HTML.pm.src: Support for end-of-file token in foreign content
975     insertion mode (HTML5 revision 1693). Update SVG camelCase
976     attribute list (HTML5 revision 1700). <textarea> closes
977     </select> (HTML5 revision 1699). More start tags close in
978     foreign content insertion mode (HTML5 revision 1698).
979    
980     2008-05-24 Wakaba <wakaba@suika.fam.cx>
981    
982 wakaba 1.246 * HTML.pm.src: ";" is not part of charset name (HTML5 revision 1665).
983    
984     2008-05-24 Wakaba <wakaba@suika.fam.cx>
985    
986 wakaba 1.245 * HTML.pm.src: More robust charset parameter detection (HTML5
987     revision 1674).
988    
989     2008-05-24 Wakaba <wakaba@suika.fam.cx>
990    
991 wakaba 1.244 * ContentType.pm: Support for image/vnd.microsoft.icon (HTML5
992     revision 1676).
993    
994     2008-05-24 Wakaba <wakaba@suika.fam.cx>
995    
996 wakaba 1.243 * HTML.pm.src: Ignore language part of public identifiers for
997     quriks mode detection (HTML5 revision 1679).
998    
999     2008-05-24 Wakaba <wakaba@suika.fam.cx>
1000    
1001 wakaba 1.242 * HTML.pm.src: Reduce the number of errors in truncated doctypes (HTML5
1002     revision 1685).
1003    
1004     2008-05-24 Wakaba <wakaba@suika.fam.cx>
1005    
1006 wakaba 1.241 * HTML.pm.src: Support for EOF in new states for tags (HTML5
1007     revision 1684).
1008    
1009     2008-05-24 Wakaba <wakaba@suika.fam.cx>
1010    
1011 wakaba 1.240 * HTML.pm.src (_reset_insertion_mode): Make <td>.innerHTML
1012     work (HTML5 revision 1690).
1013    
1014     2008-05-24 Wakaba <wakaba@suika.fam.cx>
1015    
1016 wakaba 1.239 * HTML.pm.src (_tree_construction_main): Change handling of
1017     end tags in head insertion modes (HTML5 revision 1686).
1018     (parse_char_string): Bug fix for non-utf8 character string handlings.
1019     (parse_char_stream): |ungetc| does not work well for this context.
1020    
1021 wakaba 1.237 2008-05-18 Wakaba <wakaba@suika.fam.cx>
1022    
1023 wakaba 1.238 * HTML.pm.src (parse_byte_string): Redefined to invoke
1024     |parse_byte_stream|.
1025     (parse_byte_stream): New method.
1026    
1027     2008-05-18 Wakaba <wakaba@suika.fam.cx>
1028    
1029 wakaba 1.237 * HTML.pm.src (parse_byte_string): Fix the column number reported
1030     by encoding layer error reporter.
1031    
1032 wakaba 1.233 2008-05-17 Wakaba <wakaba@suika.fam.cx>
1033    
1034 wakaba 1.236 * HTML.pm.src (parse_byte_string): Use streaming decoder
1035     rather than converting the whole byte string and then parsing.
1036     Propagate errors in character encoding layer.
1037     (get_next_token): Precise error reporting for |bare stago| error.
1038    
1039     2008-05-17 Wakaba <wakaba@suika.fam.cx>
1040    
1041 wakaba 1.235 * HTML.pm.src (parse_char_stream): New method.
1042     (parse_char_string): This method is now defined as an invocation
1043     of the |parse_char_stream| method.
1044    
1045     2008-05-17 Wakaba <wakaba@suika.fam.cx>
1046    
1047 wakaba 1.234 * HTML.pm.src (parse_byte_string): Report various status
1048     of the sniffing as info-level errors. Support for new
1049     decoding framework in parser resestting.
1050     (new): Various default error levels were not set.
1051    
1052     2008-05-17 Wakaba <wakaba@suika.fam.cx>
1053    
1054 wakaba 1.233 * HTML.pm.src (parse_byte_string): HTML5 encoding siniffing
1055     algorithm, except for the actual sniffing, is implemented
1056     with new framework with Message::Charset::Info.
1057    
1058 wakaba 1.232 2008-05-16 Wakaba <wakaba@suika.fam.cx>
1059    
1060     * CacheManifest.pm (_parse): Drop fragment identifiers from
1061     URIs in fallback section (HTML5 revision 1596).
1062    
1063 wakaba 1.230 2008-05-10 Wakaba <wakaba@suika.fam.cx>
1064    
1065 wakaba 1.231 * Makefile (Entities.html): URI has changed.
1066    
1067     2008-05-10 Wakaba <wakaba@suika.fam.cx>
1068    
1069 wakaba 1.230 * CacheManifest.pm: Don't replace U+0000 NULL (HTML5 revision
1070     1553).
1071    
1072 wakaba 1.228 2008-05-06 Wakaba <wakaba@suika.fam.cx>
1073    
1074 wakaba 1.229 * ContentChecker.pm: Noted that those returned in |table| are
1075     no longer table elements, but table objects returned
1076     by Whatpm::HTMLTable.
1077    
1078     * HTMLTable.pm (form_table): Return table element node
1079     as |$table->{element}|.
1080     (assign_header): Support for the |headers=""| attribute.
1081    
1082     2008-05-06 Wakaba <wakaba@suika.fam.cx>
1083    
1084 wakaba 1.228 * HTMLTable.pm (assign_header): New function; first version
1085     with no support for headers="".
1086     (form_table): Include table width and height to the returned
1087     table object for covenience. Indexing in column assignement was wrong.
1088     Set whether a data cell is empty or not for convenience.
1089    
1090 wakaba 1.222 2008-05-05 Wakaba <wakaba@suika.fam.cx>
1091    
1092 wakaba 1.227 * HTMLTable.pm: Robuster caption support (HTML5 revision 1393).
1093    
1094     2008-05-05 Wakaba <wakaba@suika.fam.cx>
1095    
1096 wakaba 1.226 * HTMLTable.pm: How table model errors are detected is
1097     changed (HTML5 revision 1387).
1098    
1099     2008-05-05 Wakaba <wakaba@suika.fam.cx>
1100    
1101 wakaba 1.225 * HTMLTable.pm: The algorithm now moves |tfoot| elements
1102     to the end of the table (HTML5 revision 1380).
1103    
1104     2008-05-05 Wakaba <wakaba@suika.fam.cx>
1105    
1106 wakaba 1.224 * HTMLTable.pm: The algorithm is now 0-based indexing, instead
1107     of 1-based (HTML5 revision 1376).
1108    
1109     2008-05-05 Wakaba <wakaba@suika.fam.cx>
1110    
1111 wakaba 1.223 * ContentType.pm: "Content-Type: text/plain; charset=UTF-8"
1112     and "Content-Encoding" no longer prevent sniffing (HTML5
1113     revision 1288).
1114    
1115     2008-05-05 Wakaba <wakaba@suika.fam.cx>
1116    
1117 wakaba 1.222 * ContentType.pm: Skip BOMs n feed or HTML algorithm (HTML5 revision
1118     1282).
1119    
1120 wakaba 1.220 2008-05-03 Wakaba <wakaba@suika.fam.cx>
1121    
1122 wakaba 1.221 * ContentChecker.pm: Support for global attributes.
1123     Status of XML specs are added.
1124    
1125     2008-05-03 Wakaba <wakaba@suika.fam.cx>
1126    
1127 wakaba 1.220 * ContentChecker.pm (check_element): Support for |template=""|
1128     and |ref=""| attribute (referent element type checking).
1129    
1130 wakaba 1.218 2008-04-29 Wakaba <wakaba@suika.fam.cx>
1131    
1132 wakaba 1.219 * CacheManifest.pm (_parse): New same origin definition (HTML5 revision
1133     1500) is implemented (except for IDNA part and URI-scheme-specific
1134 wakaba 1.218 knowledge). Line number counting was wrong for LF-only
1135     documents.
1136    
1137 wakaba 1.211 2008-04-13 Wakaba <wakaba@suika.fam.cx>
1138    
1139 wakaba 1.217 * HTML.pm.src: Raise an parse error for any disallowed
1140     character (HTML5 revision 1263).
1141    
1142     2008-04-13 Wakaba <wakaba@suika.fam.cx>
1143    
1144 wakaba 1.216 * mkentitylist.pl: Support for new HTML5 entity table format (the
1145     definition for |AElig;| was missing).
1146    
1147     2008-04-13 Wakaba <wakaba@suika.fam.cx>
1148    
1149 wakaba 1.215 * HTML.pm.src, mkhtmlparser.pl: Support for element/attribute
1150     name/namespace fixup (HTML5 revisions 1413, 1415, 1416, and 1417).
1151    
1152     2008-04-13 Wakaba <wakaba@suika.fam.cx>
1153    
1154 wakaba 1.214 * HTML.pm.src: List of element names that close foreign content
1155     insertion mode is added (HTML5 revisions 1412 and 1418).
1156    
1157     2008-04-13 Wakaba <wakaba@suika.fam.cx>
1158    
1159 wakaba 1.213 * HTML.pm.src: Support for |mglyph| and |malignmark|
1160     elements (HTML5 revision 1410).
1161    
1162     2008-04-13 Wakaba <wakaba@suika.fam.cx>
1163    
1164 wakaba 1.212 * HTML.pm.src: Support for new long MathML entities (HTML5
1165     revision 1406).
1166    
1167     2008-04-13 Wakaba <wakaba@suika.fam.cx>
1168    
1169 wakaba 1.211 * HTML.pm.src: CDATA section support for MathML and SVG
1170     elements (HTML5 revision 1404 and 1420).
1171    
1172 wakaba 1.209 2008-04-12 Wakaba <wakaba@suika.fam.cx>
1173    
1174 wakaba 1.210 * HTML.pm.src, mkhtmlparser.pl: Support for MathML and SVG
1175     elements (HTML5 revision 1404). Unused !!!macro definitions
1176     are removed.
1177    
1178     2008-04-12 Wakaba <wakaba@suika.fam.cx>
1179    
1180 wakaba 1.209 * HTML.pm.src, mkhtmlparser.pl: The way permitted slash errors
1181     are raised is changed (HTML5 revision 1404).
1182    
1183 wakaba 1.206 2008-04-06 Wakaba <wakaba@suika.fam.cx>
1184    
1185 wakaba 1.208 * HTML.pm.src: Category sets that are no longer used are removed.
1186    
1187     2008-04-06 Wakaba <wakaba@suika.fam.cx>
1188    
1189 wakaba 1.207 * HTML.pm.src: The ->[1] property of stack entries are now
1190     replaced by constants representing element category.
1191    
1192     2008-04-06 Wakaba <wakaba@suika.fam.cx>
1193    
1194 wakaba 1.206 * HTML.pm.src: Don't use local name stored in stack (i.e. ->[1])
1195     for error reporting. (This is a preparation for using constant
1196     value for ->[1].)
1197    
1198 wakaba 1.204 2008-03-22 Wakaba <wakaba@suika.fam.cx>
1199    
1200 wakaba 1.205 * RDFXML.pm: Typo fixed.
1201    
1202     2008-03-22 Wakaba <wakaba@suika.fam.cx>
1203    
1204 wakaba 1.204 * ContentChecker.pm: |fact_level| is now treated
1205     as same as |must_level|, i.e. level = |m|.
1206     (check_element): Make list of URIs in the DOM.
1207    
1208 wakaba 1.197 2008-03-21 Wakaba <wakaba@suika.fam.cx>
1209    
1210 wakaba 1.203 * RDFXML.pm: Language accessor implemented.
1211     Local (null-namespace) attribute support.
1212    
1213     2008-03-21 Wakaba <wakaba@suika.fam.cx>
1214    
1215 wakaba 1.202 * RDFXML.pm: Factored out ID checking code.
1216    
1217     2008-03-21 Wakaba <wakaba@suika.fam.cx>
1218    
1219 wakaba 1.201 * RDFXML.pm: TODO items noted. Validation of ID
1220     and URI attributes is implemented. Warn if unknown
1221     value is used in rdf:parseType="" attribute.
1222    
1223     * URIChecker.pm (check_rdf_uri_reference): New function.
1224    
1225     2008-03-21 Wakaba <wakaba@suika.fam.cx>
1226    
1227 wakaba 1.200 * RDFXML.pm: bnodeid implemented. Relative references
1228     are now resolved.
1229    
1230     2008-03-21 Wakaba <wakaba@suika.fam.cx>
1231    
1232 wakaba 1.199 * ContentChecker.pm: RDF reification implemented.
1233    
1234     * RDFXML.pm: undef vs false bug fixed.
1235     Reification implemented.
1236    
1237     2008-03-21 Wakaba <wakaba@suika.fam.cx>
1238    
1239 wakaba 1.198 * RDFXML.pm: s/id/ID/ for attribute name.
1240     The |node| arguments are added for |ontriple| calls.
1241     Too many "attribute not allowed" errors were raised.
1242    
1243     * ContentChecker.pm: Initial experimental support for rdf:RDF
1244     element.
1245    
1246     2008-03-21 Wakaba <wakaba@suika.fam.cx>
1247    
1248 wakaba 1.197 * RDFXML.pm: New module.
1249    
1250 wakaba 1.192 2008-03-20 Wakaba <wakaba@suika.fam.cx>
1251    
1252 wakaba 1.196 * HTML.pm.src (set_inner_html): Line/column number
1253     code was old one yet.
1254    
1255     2008-03-20 Wakaba <wakaba@suika.fam.cx>
1256    
1257 wakaba 1.195 * HTML.pm.src: Better line/column reporting for "duplicate attribute"
1258     errors. Line/column markings for DOCTYPE, comment, and
1259     character tokens are reintroduced; otherwise,
1260     error location for "not HTML5" error and errors
1261     for implied elements are not attached.
1262    
1263     2008-03-20 Wakaba <wakaba@suika.fam.cx>
1264    
1265 wakaba 1.194 * HTML.pm.src: Set line/column numbers to attributes.
1266    
1267     * NanoDOM.pm (create_attribute_ns, set_attribute_node_ns): Added.
1268     (value): Setter implemented.
1269    
1270     * mkhtmlparser.pl: Set line/column numbers to Attr nodes.
1271    
1272     2008-03-20 Wakaba <wakaba@suika.fam.cx>
1273    
1274 wakaba 1.193 * HTML.pm.src: Unused line/column markings are removed.
1275    
1276     2008-03-20 Wakaba <wakaba@suika.fam.cx>
1277    
1278 wakaba 1.192 * HTML.pm.src (_get_next_token): Remove |first_start_tag|
1279     flag, which is no longer used.
1280    
1281 wakaba 1.190 2008-03-17 Wakaba <wakaba@suika.fam.cx>
1282    
1283 wakaba 1.191 * HTML.pm.src: Set line/column information to element nodes.
1284    
1285     * mkhtmlparser.pl (!!!create-element, !!!insert-element,
1286     and !!!insert-element-t): Set line/column information to
1287     element nodes.
1288    
1289     2008-03-17 Wakaba <wakaba@suika.fam.cx>
1290    
1291 wakaba 1.190 * HTML.pm.src (_get_next_token): The first "<" character
1292     in "<?", "<>", or "</>" should be the error point.
1293    
1294 wakaba 1.187 2008-03-16 Wakaba <wakaba@suika.fam.cx>
1295    
1296 wakaba 1.189 * HTML.pm.src: Some more fixes on error position reporting.
1297    
1298     2008-03-16 Wakaba <wakaba@suika.fam.cx>
1299    
1300 wakaba 1.188 * HTML.pm.src: Token-level precious error reporting.
1301    
1302     2008-03-16 Wakaba <wakaba@suika.fam.cx>
1303    
1304 wakaba 1.187 * HTML.pm.src: Preparation for more precious error point
1305     reporting.
1306    
1307 wakaba 1.184 2008-03-11 Wakaba <wakaba@suika.fam.cx>
1308    
1309 wakaba 1.186 * HTML.pm.src: Error type revised.
1310    
1311     2008-03-11 Wakaba <wakaba@suika.fam.cx>
1312    
1313 wakaba 1.185 * HTML.pm.src: Similar codes are merged together, again.
1314    
1315     2008-03-11 Wakaba <wakaba@suika.fam.cx>
1316    
1317 wakaba 1.184 * HTML.pm.src: Similar codes are merged together.
1318    
1319 wakaba 1.183 2008-03-10 Wakaba <wakaba@suika.fam.cx>
1320    
1321     * mkhtmlparser.pl: Set "level" parameter to parse errors.
1322    
1323     * HTML.pm.src: Code refined.
1324    
1325 wakaba 1.175 2008-03-09 Wakaba <wakaba@suika.fam.cx>
1326    
1327 wakaba 1.182 * HTML.pm.src: |</body>| treatement has been changed (HTML5
1328     revision 1348). Note that I really don't know this makes
1329     any difference in the black-box behavior of the parser.
1330    
1331     2008-03-09 Wakaba <wakaba@suika.fam.cx>
1332    
1333 wakaba 1.181 * HTML.pm.src: New end-of-file token implementation (HTML5
1334     revision 1348).
1335    
1336     2008-03-09 Wakaba <wakaba@suika.fam.cx>
1337    
1338 wakaba 1.180 * HTML.pm.src: |applet| support (HTML5 revision 1347).
1339    
1340     2008-03-09 Wakaba <wakaba@suika.fam.cx>
1341    
1342 wakaba 1.179 * HTML.pm.src: Foster parenting in AAA (HTML5 revision 1343).
1343    
1344     2008-03-09 Wakaba <wakaba@suika.fam.cx>
1345    
1346 wakaba 1.178 * HTML.pm.src: Support for |<input>| in the "in select" insertion
1347     mode and support for the "in select in table" insertion mode (HTML5
1348     revision 1342).
1349    
1350     2008-03-09 Wakaba <wakaba@suika.fam.cx>
1351    
1352 wakaba 1.177 * HTML.pm.src: No foster parenting for <script> and <script>
1353     in non-tainted <table>s (HTML5 revision 1336).
1354    
1355     2008-03-09 Wakaba <wakaba@suika.fam.cx>
1356    
1357 wakaba 1.176 * HTML.pm.src: Ignore white space characters between <html>
1358     and <head> (HTML5 revision 1332).
1359    
1360     2008-03-09 Wakaba <wakaba@suika.fam.cx>
1361    
1362 wakaba 1.175 * HTML.pm.src: Treat <input type=hidden> as if it were a
1363     white space (HTML5 revision 1331).
1364    
1365 wakaba 1.164 2008-03-08 Wakaba <wakaba@suika.fam.cx>
1366    
1367 wakaba 1.174 * HTML.pm.src: Ignore U+000A at the beginning of a |listing|
1368     element (HTML5 revision 1330).
1369    
1370     2008-03-08 Wakaba <wakaba@suika.fam.cx>
1371    
1372 wakaba 1.173 * HTML.pm.src: <title> is always appended to the current
1373     element (HTML5 revision 1328).
1374    
1375     2008-03-08 Wakaba <wakaba@suika.fam.cx>
1376    
1377 wakaba 1.172 * HTML.pm.src: White space in tainted tables are moved
1378     into foster parents (HTML5 revision 1326).
1379    
1380     2008-03-08 Wakaba <wakaba@suika.fam.cx>
1381    
1382 wakaba 1.171 * HTML.pm.src: Reduce errors from foster parenting cases (HTML5
1383     revision 1321).
1384    
1385     2008-03-08 Wakaba <wakaba@suika.fam.cx>
1386    
1387 wakaba 1.170 * HTML.pm.src: |</h/n/>| case code rearranged to align with
1388     the spec (HTML5 revision 1320). Note that we finally complete
1389     all of HTML5 revision 1320 changes.
1390    
1391     2008-03-08 Wakaba <wakaba@suika.fam.cx>
1392    
1393 wakaba 1.169 * HTML.pm.src: |</form>| now works similar to |</div>| for unclosed
1394     tags (HTML5 revision 1320).
1395    
1396     2008-03-08 Wakaba <wakaba@suika.fam.cx>
1397    
1398 wakaba 1.168 * HTML.pm.src: |</p>| case rearranged with no actual change in fact.
1399    
1400     2008-03-08 Wakaba <wakaba@suika.fam.cx>
1401    
1402 wakaba 1.167 * HTML.pm.src: A "generate implied end tags" code (t409.1)
1403     could not be reached so that it is now removed (HTML5 revision
1404     1320).
1405    
1406     2008-03-08 Wakaba <wakaba@suika.fam.cx>
1407    
1408 wakaba 1.166 * HTML.pm.src: Code for the case of |</div>| and so on are revised
1409     to align with new spec text (HTML5 revision 1320).
1410    
1411     2008-03-08 Wakaba <wakaba@suika.fam.cx>
1412    
1413 wakaba 1.165 * HTML.pm.src: Remove strange |if| condition; however, it should
1414     have had no harm in theory.
1415    
1416     2008-03-08 Wakaba <wakaba@suika.fam.cx>
1417    
1418 wakaba 1.164 * HTML.pm.src (_tree_construction_main): '</p>' in body
1419     case is split from other end tags for the preparation
1420     of implementing HTML revision 1320.
1421    
1422 wakaba 1.162 2008-03-07 Wakaba <wakaba@suika.fam.cx>
1423    
1424 wakaba 1.163 * HTML.pm.src: Simplified "generate implied end tag" (HTML5
1425     revision 1320).
1426    
1427     2008-03-07 Wakaba <wakaba@suika.fam.cx>
1428    
1429 wakaba 1.162 * HTML.pm.src (_tree_construction_main): Merge rules for "h1"
1430     and "div" (HTML5 revision 1318). Add comments to where
1431     |form| pointer association codes should be inserted (HTML5 revision
1432     1319).
1433    
1434 wakaba 1.161 2008-03-06 Wakaba <wakaba@suika.fam.cx>
1435    
1436     * HTML.pm.src: <html> treatement refined (HTML5 revision 1314).
1437    
1438 wakaba 1.159 2008-03-05 Wakaba <wakaba@suika.fam.cx>
1439    
1440 wakaba 1.160 * HTML.pm.src: Since the case t268 should never be reached (no
1441     other token type, there are), it is replaced by a |die| statement.
1442    
1443     2008-03-05 Wakaba <wakaba@suika.fam.cx>
1444    
1445 wakaba 1.159 * HTML.pm.src: Typo fixed.
1446    
1447 wakaba 1.158 2008-03-04 Wakaba <wakaba@suika.fam.cx>
1448    
1449     * HTML.pm.src (_tree_construction_initial): Some limited quirks
1450     doctypes were not uppercased for comparison.
1451    
1452 wakaba 1.152 2008-03-03 Wakaba <wakaba@suika.fam.cx>
1453    
1454 wakaba 1.157 * HTML.pm.src (tree construction and set_inner_html): Checkpoints
1455     are added.
1456    
1457     2008-03-03 Wakaba <wakaba@suika.fam.cx>
1458    
1459 wakaba 1.156 * HTML.pm.src (_tokenize_attempt_to_consume_an_entity): Checkpoints
1460     are set. Cases that are unlikely reached are noted as so.
1461    
1462     2008-03-03 Wakaba <wakaba@suika.fam.cx>
1463    
1464 wakaba 1.155 * HTML.pm.src: Checkpoints for debugging are added.
1465    
1466     * mkhtmlparser.pl: Support for |!!!cp| syntax.
1467    
1468     2008-03-03 Wakaba <wakaba@suika.fam.cx>
1469    
1470 wakaba 1.154 * HTML.pm.src, mkhtmlparser.pl: s/_input_character/_char/g
1471     for simplicity.
1472    
1473     2008-03-03 Wakaba <wakaba@suika.fam.cx>
1474    
1475 wakaba 1.153 * HTML.pm.src: Flag name changed: s/correct/force-quirks/g (HTML5
1476     revision 1307).
1477    
1478     2008-03-03 Wakaba <wakaba@suika.fam.cx>
1479    
1480 wakaba 1.152 * HTML.pm.src (_get_next_token): Where the /incorrect/ flag
1481     is set are changed (HTML5 revision 1305).
1482    
1483 wakaba 1.149 2008-03-02 Wakaba <wakaba@suika.fam.cx>
1484    
1485 wakaba 1.151 * HTML.pm.src: Raise a parse error for |<span ===>| (HTML5 revision
1486     1292). Entities are not parsed in comment-like part in RCDATA
1487     elements (HTML5 revision 1294). Allow bare & at the end
1488     of attribute value literals (HTML5 revision 1296). More
1489     quirks mode doctypes (HTML5 revision 1302). Requires spaces
1490     between attributes and ban attribute names or unquoted
1491     attribute values containing single or double quotes (HTML5
1492     revision 1303).
1493    
1494     2008-03-02 Wakaba <wakaba@suika.fam.cx>
1495    
1496 wakaba 1.150 * ContentChecker.pm: Typo fixed. Don't raise "character encoding"
1497     and related errors unless it is an HTML document (though the spec
1498     is unclear on whether it is applied to XHTML document).
1499    
1500     * HTML.pm (%HTMLAttrStatus): WF2 repetition model attributes
1501     are added.
1502    
1503     2008-03-02 Wakaba <wakaba@suika.fam.cx>
1504    
1505 wakaba 1.149 * HTML.pm.src: s/local_name/manakai_local_name/g.
1506    
1507 wakaba 1.148 2008-03-01 Wakaba <wakaba@suika.fam.cx>
1508    
1509     * _NamedEntityList.pm: Updated (HTML5 revision 1286).
1510    
1511     * HTML.pm.src: |charset| in |content| attribute is
1512     case-insensitive (HTML5 revision 1270).
1513    
1514 wakaba 1.147 2008-02-26 Wakaba <wakaba@suika.fam.cx>
1515    
1516     * ContentChecker.pm: New status constants are added.
1517     ($ElementDefault): |status| added.
1518     (check_element): Err for non-standard or deprecated elements.
1519     (_attr_status_info): For non-standard or deprecated attributes.
1520    
1521 wakaba 1.142 2008-02-24 Wakaba <wakaba@suika.fam.cx>
1522    
1523 wakaba 1.146 * ContentChecker.pm (_attr_status_info): New internal method.
1524    
1525     2008-02-24 Wakaba <wakaba@suika.fam.cx>
1526    
1527 wakaba 1.145 * ContentChecker.pm (check_element): Element standardized
1528     status information is now dispatched.
1529    
1530     2008-02-24 Wakaba <wakaba@suika.fam.cx>
1531    
1532 wakaba 1.144 * ContentChecker.pm (check_element): Fix |del|-and-significant
1533     problem by adding some more arguments.
1534    
1535     2008-02-24 Wakaba <wakaba@suika.fam.cx>
1536    
1537 wakaba 1.143 * ContentChecker.pm (check_element): Use context of
1538     container-for-the-purpose-of-content-model element (not transparent
1539     element) for |check_child_element| calling and significant
1540     text flag marking. This reintroduces |<del>|-and-significant
1541     problem again.
1542    
1543     2008-02-24 Wakaba <wakaba@suika.fam.cx>
1544    
1545 wakaba 1.142 * ContentChecker.pm (check_element): Make semi-transparent
1546     elements ignored for the purpose of phase changes in
1547     content model checking.
1548    
1549 wakaba 1.138 2008-02-23 Wakaba <wakaba@suika.fam.cx>
1550    
1551 wakaba 1.141 * ContentChecker.pm (check_element): In-element state
1552     was not properly managed for transparent cases.
1553    
1554     2008-02-23 Wakaba <wakaba@suika.fam.cx>
1555    
1556 wakaba 1.140 * ContentChecker.pm (check_element): Support for |video|
1557     and |audio| as semi-transparent elements.
1558    
1559     2008-02-23 Wakaba <wakaba@suika.fam.cx>
1560    
1561 wakaba 1.139 * ContentChecker.pm ($HTMLSemiTransparentElements): New.
1562     (check_element): s/minuses/minus_elements/, s/pluses/plus_elements/.
1563     Support for |html:object| as a semi-transparent element.
1564    
1565     2008-02-23 Wakaba <wakaba@suika.fam.cx>
1566    
1567 wakaba 1.138 * ContentChecker.pm (check_element): The way to traverse
1568     the tree is entirely revised to make it easier to track
1569     the state of ancestors/descendants. As a result of this
1570     revision (which rewrites almost all of Whatpm::ContentChecker::HTML),
1571     support for content model checking for HTML elements |figure|,
1572     |object|, |video|, and |audio| and checking for XML elements (and
1573     some XMLNS checkings) are dropped for now. They will be
1574     reimplemented in due cource.
1575    
1576 wakaba 1.134 2008-02-17 Wakaba <wakaba@suika.fam.cx>
1577    
1578 wakaba 1.137 * HTML.pm.src: |>| in public or system literal closes the DOCTYPE
1579     token (HTML5 revision 1225).
1580    
1581     2008-02-17 Wakaba <wakaba@suika.fam.cx>
1582    
1583 wakaba 1.136 * ContentChecker.pm ({unsupported_level}): New value.
1584    
1585     * HTML.pm.src: Save whether |meta| |content| attribute
1586     contains character references or not.
1587    
1588     2008-02-17 Wakaba <wakaba@suika.fam.cx>
1589    
1590 wakaba 1.135 * ContentChecker.pm (_get_children): (Incompleted) attempt to
1591     imlement significant content checking for contents
1592     with |del| elements.
1593    
1594     2008-02-17 Wakaba <wakaba@suika.fam.cx>
1595    
1596 wakaba 1.134 * ContenteChecker.pm ($HTMLTransparentElements): More
1597     elements are added.
1598     (_get_children): HTML |object| elements are now semi-transparent.
1599    
1600     * NanoDOM.pm (manakai_html, manakai_head): New methods.
1601    
1602 wakaba 1.133 2008-02-16 Wakaba <wakaba@suika.fam.cx>
1603    
1604     * CacheManifest.pm: HTML5 revision 1211 implemented.
1605    
1606     * CacheManifest.pod: Updated.
1607    
1608 wakaba 1.132 2008-02-10 Wakaba <wakaba@suika.fam.cx>
1609    
1610     * ContentChecker.pm (check_document, check_element): Support
1611     for second argument ($onsubdoc).
1612     (_get_css_parser): Removed (now it is part of WDCC).
1613    
1614 wakaba 1.131 2008-02-09 Wakaba <wakaba@suika.fam.cx>
1615    
1616     * ContentChecker.pm (_get_css_parser): New.
1617    
1618 wakaba 1.129 2007-11-25 Wakaba <wakaba@suika.fam.cx>
1619    
1620 wakaba 1.130 * ContentChecker.pm ($AnyChecker): Old way to add child elements
1621     for checking had been used.
1622    
1623     2007-11-25 Wakaba <wakaba@suika.fam.cx>
1624    
1625 wakaba 1.129 * ContentChecker.pm (check_element): New todo item type |descendant|.
1626    
1627 wakaba 1.125 2007-11-23 Wakaba <wakaba@suika.fam.cx>
1628    
1629 wakaba 1.128 * IMTChecker.pm: Revised to raise errors and warnings as (poorly)
1630     specced in RFC 2046 and RFC 4288.
1631     (application/atom+xml): Definition added.
1632    
1633     2007-11-23 Wakaba <wakaba@suika.fam.cx>
1634    
1635 wakaba 1.127 * URIChecker.pm: Make RFC 3986 should-level errors
1636     warnings (rather than SHOULD-level errors).
1637    
1638     2007-11-23 Wakaba <wakaba@suika.fam.cx>
1639    
1640 wakaba 1.126 * NanoDOM.pm (get_user_data, set_user_data): New methods.
1641    
1642     * HTML.pm.src: A flag for character references in attribute
1643     values are added. Set |manakai_has_reference| user data
1644     to |charset| attribute.
1645    
1646     2007-11-23 Wakaba <wakaba@suika.fam.cx>
1647    
1648 wakaba 1.125 * NanoDOM.pm (input_encoding, manakai_charset, manakai_has_bom): New
1649     attributes.
1650    
1651     * ContentChecker.pm (check_document): Warn if charset requirements
1652     cannot be tested.
1653    
1654 wakaba 1.124 2007-11-19 Wakaba <wakaba@suika.fam.cx>
1655    
1656     * HTML.pm.src (parse_byte_string): Detect charset
1657     by universalchardet if charset parameter is not specified.
1658    
1659     * Makefile (Charset-all, Charset-clean): New rules.
1660    
1661 wakaba 1.120 2007-11-18 Wakaba <wakaba@suika.fam.cx>
1662    
1663 wakaba 1.123 * ContentChecker.pm (check_document): Check the existence
1664     of character encoding declaration and selection of encoding
1665     for HTML document.
1666    
1667     2007-11-18 Wakaba <wakaba@suika.fam.cx>
1668    
1669 wakaba 1.122 * ContentType.pm (get_sniffed_type): Return also the official
1670     type in list context.
1671    
1672     2007-11-18 Wakaba <wakaba@suika.fam.cx>
1673    
1674 wakaba 1.121 * ContentType.pm: Sniffing with leading white space
1675     ignoring (HTML5 revisions 1013 and 1016).
1676    
1677     2007-11-18 Wakaba <wakaba@suika.fam.cx>
1678    
1679 wakaba 1.120 * ContentType.pm: HTML5 revision 1013 changes, except for leading
1680     white spaces, are implemented.
1681    
1682 wakaba 1.117 2007-11-11 Wakaba <wakaba@suika.fam.cx>
1683    
1684 wakaba 1.119 * HTML.pm.src (parse_char_string): Set |inner_encoding|
1685     attribute if possible.
1686    
1687     2007-11-11 Wakaba <wakaba@suika.fam.cx>
1688    
1689 wakaba 1.118 * HTML.pm.src (parse_byte_string): New method.
1690     (parse_char_string): New alias for |parse_string|.
1691     (main phase): Invoking "change the encoding" algorithm if desired.
1692    
1693     * HTML.pod: Updated.
1694    
1695     2007-11-11 Wakaba <wakaba@suika.fam.cx>
1696    
1697 wakaba 1.117 * HTML.pod (get_inner_html): Removed.
1698    
1699     * Makefile (HTML-all, HTML-clean): New.
1700    
1701     2007-11-11 Wakaba <wakaba@suika.fam.cx>
1702    
1703     * HTML.pm.src (get_inner_html): Removed (moved to HTML/Serializer.pm).
1704    
1705 wakaba 1.116 2007-11-08 Wakaba <wakaba@suika.fam.cx>
1706    
1707     * mklinktypelist.pl: s/noreferer/noreferrer/ (HTML5 revision 1132).
1708    
1709 wakaba 1.112 2007-11-04 Wakaba <wakaba@suika.fam.cx>
1710    
1711 wakaba 1.115 * Makefile: |CacheManifest.html| is added.
1712    
1713     * CacheManifest.pod: New file.
1714    
1715     2007-11-04 Wakaba <wakaba@suika.fam.cx>
1716    
1717 wakaba 1.114 * CacheManifest.pm: New module.
1718    
1719     2007-11-04 Wakaba <wakaba@suika.fam.cx>
1720    
1721 wakaba 1.113 * HTML.pm.src: Support for application cache selection algorithm
1722     callback.
1723    
1724     2007-11-04 Wakaba <wakaba@suika.fam.cx>
1725    
1726 wakaba 1.112 * mklinktypelist.pl: Support for rel=noreferer (HTML5 revision 1118).
1727    
1728 wakaba 1.110 2007-10-17 Wakaba <wakaba@suika.fam.cx>
1729    
1730 wakaba 1.111 * Makefile (clean): New rule.
1731    
1732     * NanoDOM.pm (public_id, system_id): New attributes.
1733    
1734     2007-10-17 Wakaba <wakaba@suika.fam.cx>
1735    
1736 wakaba 1.110 * Makefile (CSS-all, CSS-clean, clean): New rules.
1737    
1738 wakaba 1.109 2007-10-14 Wakaba <wakaba@suika.fam.cx>
1739    
1740     * ContentChecker.pm (check_document): Support for
1741     new |is_xml_root| flag.
1742     (check_element): Support for new |pluses| state.
1743     (_add_pluses): New method.
1744     (_remove_minuses): Support for new |minus| item.
1745    
1746 wakaba 1.106 2007-09-24 Wakaba <wakaba@suika.fam.cx>
1747    
1748 wakaba 1.107 * ContentChecker.pm: Raise specific error for invalid
1749     root element.
1750    
1751     2007-09-24 Wakaba <wakaba@suika.fam.cx>
1752    
1753 wakaba 1.106 * ContentChecker.pm: Set level values for later uses.
1754    
1755 wakaba 1.103 2007-09-09 Wakaba <wakaba@suika.fam.cx>
1756    
1757 wakaba 1.105 * ContentChecker.pm: Support for language tag validation.
1758    
1759     2007-09-09 Wakaba <wakaba@suika.fam.cx>
1760    
1761 wakaba 1.104 * LangTag.pm (check_rfc3066_language_tag): New method.
1762    
1763     2007-09-09 Wakaba <wakaba@suika.fam.cx>
1764    
1765 wakaba 1.103 * LangTag.pm: New module.
1766    
1767 wakaba 1.102 2007-09-04 Wakaba <wakaba@suika.fam.cx>
1768    
1769     * HTML.pm.src: Some error types were wrong.
1770    
1771 wakaba 1.100 2007-08-17 Wakaba <wakaba@suika.fam.cx>
1772    
1773 wakaba 1.101 * CSS/: New directory.
1774    
1775     2007-08-17 Wakaba <wakaba@suika.fam.cx>
1776    
1777 wakaba 1.100 * ContentChecker.pm (_check_get_children): Support
1778     for |noscript| in |head|.
1779    
1780 wakaba 1.99 2007-08-12 Wakaba <wakaba@suika.fam.cx>
1781    
1782     * URI/: New directory.
1783    
1784 wakaba 1.94 2007-08-11 Wakaba <wakaba@suika.fam.cx>
1785    
1786 wakaba 1.98 * HTML.pm.src: Tokenizer's states are now represented in
1787     number.
1788    
1789     2007-08-11 Wakaba <wakaba@suika.fam.cx>
1790    
1791 wakaba 1.97 * HTML.pm.src: |or|s for insertion modes are replaced
1792     by |&|s.
1793    
1794     2007-08-11 Wakaba <wakaba@suika.fam.cx>
1795    
1796 wakaba 1.96 * HTML.pm.src: Token types are now represented in number.
1797    
1798     2007-08-11 Wakaba <wakaba@suika.fam.cx>
1799    
1800 wakaba 1.95 * ContentType.pm (SEE ALSO): Updated.
1801    
1802     * HTML.pm.src: Insertion modes are now represented in number.
1803    
1804     2007-08-11 Wakaba <wakaba@suika.fam.cx>
1805    
1806 wakaba 1.94 * ContentType.pm: Sniffing for bitmap images (HTML5 revision
1807     999) is implemented.
1808    
1809 wakaba 1.93 2007-08-08 Wakaba <wakaba@suika.fam.cx>
1810    
1811     * ContentType.pm: Sniffing for |<script| (HTML5 revision
1812     983) is implemented.
1813    
1814 wakaba 1.92 2007-08-06 Wakaba <wakaba@suika.fam.cx>
1815    
1816     * ContentChecker.pod: New documentation.
1817    
1818     * Makefile: A rule for |ContentChecker.html| is added.
1819    
1820     * ContentChecker.pm: A pod "LICENSE" section is added.
1821    
1822     * NanoDOM.pm ($VERSION): New variable.
1823    
1824 wakaba 1.90 2007-08-05 Wakaba <wakaba@suika.fam.cx>
1825    
1826 wakaba 1.91 * H2H.pm: |b|, |i|, and |sub| are added to the
1827     list of allowed HTML elements.
1828    
1829     2007-08-05 Wakaba <wakaba@suika.fam.cx>
1830    
1831 wakaba 1.90 * H2H.pm: |samp| is added to the list of allowed
1832     HTML elements.
1833    
1834     * URIChecker.pm (check_iri): New.
1835     (check_iri_reference): Error type for IRI reference
1836     syntax error is changed.
1837    
1838 wakaba 1.87 2007-08-04 Wakaba <wakaba@suika.fam.cx>
1839    
1840 wakaba 1.89 * ContentChecker.pm: Reference to the |Whatpm::ContentChecker::Atom|
1841     is added.
1842     (check_document): Load appropriate module before validation.
1843    
1844     2007-08-04 Wakaba <wakaba@suika.fam.cx>
1845    
1846 wakaba 1.88 * ContentChecker/: New directory.
1847    
1848     2007-08-04 Wakaba <wakaba@suika.fam.cx>
1849    
1850 wakaba 1.87 * ContentChecker.pm: HTML |time| element is implemented.
1851    
1852     * HTMLTable.pm: Comments are updated as HTML5 is revised.
1853    
1854 wakaba 1.71 2007-07-21 Wakaba <wakaba@suika.fam.cx>
1855    
1856 wakaba 1.86 * ContentChecker.pm (check_document): Return value
1857     even if no document element is found.
1858    
1859     2007-07-21 Wakaba <wakaba@suika.fam.cx>
1860    
1861 wakaba 1.85 * HTML.pm.src: |$in_body| is no longer a function.
1862    
1863     2007-07-21 Wakaba <wakaba@suika.fam.cx>
1864    
1865 wakaba 1.84 * HTML.pm.src: The |$in_body| code has been moved down.
1866    
1867     2007-07-21 Wakaba <wakaba@suika.fam.cx>
1868    
1869 wakaba 1.83 * HTML.pm.src: The "trailing end" insertion mode
1870     is split into "after html body" and "after html frameset"
1871     insertion modes. Their codes are merged with "after body"
1872     and "after frameset" codes. |$previous_insertion_mode|
1873     has been removed. "after frameset" code is
1874     merged with "in frameset" code.
1875    
1876     2007-07-21 Wakaba <wakaba@suika.fam.cx>
1877    
1878 wakaba 1.82 * HTML.pm.src: The "before head" insertion mode is
1879     merged with the "in head" insertion mode.
1880    
1881     2007-07-21 Wakaba <wakaba@suika.fam.cx>
1882    
1883 wakaba 1.81 * HTML.pm.src: Cases in "in head" insertion mode are
1884     reorganized.
1885    
1886     2007-07-21 Wakaba <wakaba@suika.fam.cx>
1887    
1888 wakaba 1.80 * HTML.pm.src: Some cases in "in table" insertion mode
1889     are merged.
1890    
1891     2007-07-21 Wakaba <wakaba@suika.fam.cx>
1892    
1893 wakaba 1.79 * HTML.pm.src: The "in row" insertion mode is merged
1894     with "in table" insertion mode.
1895    
1896     2007-07-21 Wakaba <wakaba@suika.fam.cx>
1897    
1898 wakaba 1.78 * HTML.pm.src: The "in table" and "in table body" insertion
1899     modes are merged.
1900    
1901     2007-07-21 Wakaba <wakaba@suika.fam.cx>
1902    
1903 wakaba 1.77 * HTML.pm.src: There is no "in table head" or "in table foot"
1904     insertion mode!
1905    
1906     2007-07-21 Wakaba <wakaba@suika.fam.cx>
1907    
1908 wakaba 1.76 * HTML.pm.src: |<noframes>| "in frameset" and "in noframes"
1909     now directly invoke the handler.
1910    
1911     2007-07-21 Wakaba <wakaba@suika.fam.cx>
1912    
1913 wakaba 1.75 * HTML.pm.src: Codes for "in cell" insertion mode
1914     is merged to the "in body" insertion mode code.
1915    
1916     2007-07-21 Wakaba <wakaba@suika.fam.cx>
1917    
1918 wakaba 1.74 * HTML.pm.src: Codes for "in body" and "in caption"
1919     insertion modes are merged.
1920    
1921     2007-07-21 Wakaba <wakaba@suika.fam.cx>
1922    
1923 wakaba 1.73 * HTML.pm.src: Two |!!!next-token|s were missing.
1924    
1925     2007-07-21 Wakaba <wakaba@suika.fam.cx>
1926    
1927 wakaba 1.72 * HTML.pm.src: Use numeric constant for |{content_mode}|
1928     instead of string constant for |{content_model_flag}|.
1929    
1930     2007-07-21 Wakaba <wakaba@suika.fam.cx>
1931    
1932 wakaba 1.71 * HTML.pm.src: Add the name of the attribute
1933     to the "duplicate attribute" error.
1934    
1935 wakaba 1.69 2007-07-17 Wakaba <wakaba@suika.fam.cx>
1936    
1937 wakaba 1.70 * ContentChecker.pm: Return the |class| node list.
1938    
1939     2007-07-17 Wakaba <wakaba@suika.fam.cx>
1940    
1941 wakaba 1.69 * ContentChecker.pm: Return the |id| node list.
1942    
1943     * HTML.pm.src: A typo is fixed.
1944    
1945 wakaba 1.63 2007-07-16 Wakaba <wakaba@suika.fam.cx>
1946    
1947 wakaba 1.68 * ContentChecker.pm: Drop wrong |level => 'error'| specification
1948     from "in HTML:xml:lang" error. Character position
1949     is now the last part of the error type in the URI error
1950     description. Report "unsupported" status for language
1951     tags, media queries, script codes, and style sheets.
1952    
1953     2007-07-16 Wakaba <wakaba@suika.fam.cx>
1954    
1955 wakaba 1.67 * ContentChecker.pm: Report error if |xml:lang|
1956     in HTML, |lang| in XML, |xmlns| in XML, and |meta| |charset|
1957     in XML.
1958    
1959     * NanoDOM.pm (Attr.owner_document): New attribute.
1960    
1961     2007-07-16 Wakaba <wakaba@suika.fam.cx>
1962    
1963 wakaba 1.66 * HTML.pm.src: The character immediately following
1964     a bare |hcro| was discarded. Fix handling of
1965     entity references in attribute values.
1966    
1967     2007-07-16 Wakaba <wakaba@suika.fam.cx>
1968    
1969 wakaba 1.65 * HTML.pm.src (main and trailin end phases): Token
1970     types |DOCTYPE|, |comment|, |end-of-file|, and |<html>| are
1971     factored out. Error types |in html:#DOCTYPE|
1972     and |after html:#DOCTYPE| are merged
1973     into |DOCTYPE in the middle|. |</frameset>|
1974     in fragment parsing mode changed the
1975     insertion mode.
1976    
1977     2007-07-16 Wakaba <wakaba@suika.fam.cx>
1978    
1979 wakaba 1.64 * HTML.pm.src: |$phase| has been removed; The |trailing end|
1980     phase is now an insertion mode. Treatments for white
1981     space character tokens were incorrect for some
1982     insertion modes. An old |meta| case was not removed.
1983    
1984     2007-07-16 Wakaba <wakaba@suika.fam.cx>
1985    
1986 wakaba 1.63 * HTML.pm.src: |meta| charset declaration extraction
1987     implemented (but changing the encoding is not yet:-).
1988    
1989 wakaba 1.60 2007-07-15 Wakaba <wakaba@suika.fam.cx>
1990    
1991 wakaba 1.62 * Charset/: New directory.
1992    
1993     2007-07-15 Wakaba <wakaba@suika.fam.cx>
1994    
1995 wakaba 1.61 * H2H.pm: New Perl module (created from
1996     manakai's H2H.dis).
1997    
1998     2007-07-15 Wakaba <wakaba@suika.fam.cx>
1999    
2000 wakaba 1.60 * XMLSerializer.pm: New Perl module (created from
2001     manakai's SimpleLS.dis).
2002    
2003 wakaba 1.59 2007-07-07 Wakaba <wakaba@suika.fam.cx>
2004    
2005     * HTML.pm.src: |<!---x-->| was not processed correctly.
2006    
2007 wakaba 1.57 2007-07-01 Wakaba <wakaba@suika.fam.cx>
2008    
2009 wakaba 1.58 * HTML.pm.src: Report correct error message
2010     for |<body></div></body>|.
2011    
2012     2007-07-01 Wakaba <wakaba@suika.fam.cx>
2013    
2014 wakaba 1.57 * HTMLTable.pm: An error description was incorrect.
2015    
2016     2007-06-30 Wakaba <wakaba@suika.fam.cx>
2017    
2018     * ContentChecker.pm: Return |{term}| list.
2019    
2020 wakaba 1.55 2007-06-30 Wakaba <wakaba@suika.fam.cx>
2021    
2022 wakaba 1.56 * HTML.pm.src: HTML5 revisions 961-966 (</p>, </br>,
2023     nested <nobr>, implied </tbody>, </tfoot>, and </thead>,
2024     and <title> outside of head).
2025    
2026     2007-06-30 Wakaba <wakaba@suika.fam.cx>
2027    
2028 wakaba 1.55 * IMTChecker.pm: Report warning for unregistered
2029     and private types/subtypes.
2030    
2031     * ContentChecker.pm, HTML.pm.src, IMTChecker.pm,
2032     URIChecker.pm, HTMLTable.pm: Error messages are now
2033     consistent; they are all listed in
2034     <http://suika.fam.cx/gate/2005/sw/Whatpm%20Error%20Types>.
2035    
2036 wakaba 1.52 2007-06-25 Wakaba <wakaba@suika.fam.cx>
2037    
2038 wakaba 1.54 * ContentChecker.pm: |<img ismap>| not in |<a></a>|
2039     is now erred. |<datalist>| is implemented.
2040     Attribute checker for |<command>| and |<menu>| are
2041     added. Support for |contextmenu| global attribute
2042     is added.
2043    
2044     2007-06-25 Wakaba <wakaba@suika.fam.cx>
2045    
2046 wakaba 1.53 * HTML.pm.src (_reset_insertion_mode): Interpretation
2047     of Step 3 has been changed.
2048    
2049     2007-06-25 Wakaba <wakaba@suika.fam.cx>
2050    
2051 wakaba 1.52 * HTML.pm.src: Late |<html>| parse error is implemented.
2052    
2053 wakaba 1.47 2007-06-24 Wakaba <wakaba@suika.fam.cx>
2054    
2055 wakaba 1.51 * URIChecker.pm (check_iri_reference): A |decode| method name was
2056     incorrect.
2057    
2058     * ContentChecker.pm: Support for the |footer| element.
2059     Check URI syntax for space-separated URI attributes.
2060     Support for the |tabindex| attribute. Support
2061     for |datetime| attribute.
2062    
2063     2007-06-24 Wakaba <wakaba@suika.fam.cx>
2064    
2065 wakaba 1.50 * HTML.pm.src: HTML5 revision 1.144 (&#x0D;) and 1.145 (invalid
2066     character references). HTML5 revision 1.146 (white space
2067     characters before root start tag). HTML5 revision
2068     1.148 (named character references in attribute values).
2069     HTML5 revision 1.152 (<plaintext>.innerHTML get).
2070    
2071     2007-06-24 Wakaba <wakaba@suika.fam.cx>
2072    
2073     * HTML.pm.src: HTML5 revisions 1.142 and 1.143 (<noscript>
2074 wakaba 1.49 in <head>).
2075    
2076     2007-06-24 Wakaba <wakaba@suika.fam.cx>
2077    
2078 wakaba 1.48 * HTML.pm.src: HTML5 revision 935 (<base>, <link>, <meta>
2079     in body).
2080    
2081     * ContentChecker.pm: HTML5 revision 938 (scoped="").
2082    
2083     2007-06-24 Wakaba <wakaba@suika.fam.cx>
2084    
2085 wakaba 1.47 * HTML.pm.src: HTML5 revision 923 (matching end tag in CDATA
2086     or RCDATA in fragment parsing mode). HTML5 revision
2087     924 (<!--> and <!--->). HTML5 revision 926 (hn in hn).
2088    
2089     2007-06-23 Wakaba <wakaba@suika.fam.cx>
2090    
2091     * HTML.pm.src (get_inner_html): HTML5 revision 922 (inner_html
2092     for <pre> and <textarea>).
2093    
2094 wakaba 1.33 2007-06-23 Wakaba <wakaba@suika.fam.cx>
2095    
2096 wakaba 1.46 * HTML.pm.src: HTML5 revision 920 (<isindex>).
2097    
2098     2007-06-23 Wakaba <wakaba@suika.fam.cx>
2099    
2100 wakaba 1.45 * HTML.pm.src: HTML5 revision 918 (</head>, </body>, </html>).
2101    
2102     2007-06-23 Wakaba <wakaba@suika.fam.cx>
2103    
2104 wakaba 1.44 * HTML.pm.src: HTML5 revision 916 (</body>).
2105     HTML5 revision 917 (conforming bare &).
2106    
2107     2007-06-23 Wakaba <wakaba@suika.fam.cx>
2108    
2109 wakaba 1.43 * NanoDOM.pm (manakai_is_html): Setting to false did not work.
2110    
2111     * HTML.pm.src: HTML5 revision 914 (</ in CDATA, RCDATA).
2112     HTML5 revision 915 (<nobr>).
2113    
2114     2007-06-23 Wakaba <wakaba@suika.fam.cx>
2115    
2116 wakaba 1.42 * HTML.pm.src: HTML5 revisions 908, 909, 912, and 913 (quirks mode).
2117    
2118     * NanoDOM.pm (manakai_is_html, manakai_compat_mode, compat_mode):
2119     New attributes.
2120    
2121     2007-06-23 Wakaba <wakaba@suika.fam.cx>
2122    
2123 wakaba 1.41 * HTML.pm.src: HTML5 revisions 900, 901, 902, and 911 (<
2124     in tags).
2125    
2126     2007-06-23 Wakaba <wakaba@suika.fam.cx>
2127    
2128 wakaba 1.40 * .cvsignore: |Entities.html| is added.
2129    
2130     * HTML.pm.src: |$entity_char| is removed and
2131     requires |Whatpm::_NamedEntityList| instead.
2132     HTML5 revision 898 (refc), except that lack of refc
2133     is parse error.
2134    
2135     * mkentitylist.pl: New script.
2136    
2137     * Makefile (all): |_NamedEntityList.pm| is added.
2138     (_NamedEntityList.pm, Entities.html): New rules.
2139    
2140     2007-06-23 Wakaba <wakaba@suika.fam.cx>
2141    
2142 wakaba 1.39 * HTML.pm.src: Parse errors immediately after U+000D
2143     were ignored and U+000D immediately following another
2144     U+000D was not converted to U+000A.
2145    
2146     2007-06-23 Wakaba <wakaba@suika.fam.cx>
2147    
2148 wakaba 1.38 * HTML.pm.src (set_inner_html): HTML5 revision 892 (adopt
2149     nodes before appended). Parser was not ready for NULL
2150     parse error and escape flag.
2151    
2152     * NanoDOM.pm (adopt_node): New.
2153    
2154     2007-06-23 Wakaba <wakaba@suika.fam.cx>
2155    
2156 wakaba 1.37 * HTML.pm.src: HTML5 revision 886 (insane comment in
2157     CDATA and RCDATA). Note that current implementation
2158     is simply repeating what the spec says and it is maybe not
2159     a best way to do it.
2160    
2161     2007-06-23 Wakaba <wakaba@suika.fam.cx>
2162    
2163 wakaba 1.36 * HTML.pm.src: HTML5 revision 884 (</form> don't close
2164     the form element if a descendant element without implied end tag has
2165     still been open).
2166    
2167     2007-06-23 Wakaba <wakaba@suika.fam.cx>
2168    
2169 wakaba 1.35 * ContentChecker.pm: HTML5 revision 881 (Make |id| attribute with
2170     space characters non-conforming).
2171    
2172     2007-06-23 Wakaba <wakaba@suika.fam.cx>
2173    
2174 wakaba 1.34 * HTML.pm.src: An error message was incorrect.
2175     HTML5 revision 869 (C1 character references).
2176    
2177     2007-06-23 Wakaba <wakaba@suika.fam.cx>
2178    
2179 wakaba 1.33 * HTML.pm.src: HTML5 revision 867 (a LF at the beginning of
2180     a |textarea| is removed).
2181    
2182 wakaba 1.32 2007-06-05 Wakaba <wakaba@suika.fam.cx>
2183    
2184     * NanoDOM.pm (get_attribute_node_ns): New method.
2185    
2186     * ContentChecker.pm: |script| |async| and |defer|
2187     no longer require |src|. |async| MUST NOT be
2188     specified if |defer|. (HTML5 revision 858).
2189    
2190 wakaba 1.31 2007-05-30 Wakaba <wakaba@suika.fam.cx>
2191    
2192     * HTML.pm.src: |<form><form>| went to inifinite loop.
2193    
2194 wakaba 1.27 2007-05-27 Wakaba <wakaba@suika.fam.cx>
2195    
2196 wakaba 1.30 * ContentChecker.pm (html): Set |is_root| (allowed
2197     as a document element) flag on.
2198     (new): Removed.
2199     (check_document): New method.
2200    
2201     2007-05-27 Wakaba <wakaba@suika.fam.cx>
2202    
2203 wakaba 1.29 * ContentChecker.pm (thead, tfoot): Checker specifications
2204     were incorrect.
2205    
2206     2007-05-27 Wakaba <wakaba@suika.fam.cx>
2207    
2208 wakaba 1.28 * ContentChecker.pm ($HTMLURIAttrChecker): Include
2209     error position in the |type| option of the error.
2210    
2211     * HTMLTable.pm (form_table): The |$onerror| parameter
2212     is now optional. Some bugs are fixed.
2213    
2214     2007-05-27 Wakaba <wakaba@suika.fam.cx>
2215    
2216 wakaba 1.27 * HTMLTable.pm: New module.
2217    
2218     * ContentChecker.pm (table): Invoke table model error checker.
2219    
2220     * NanoDOM.pm (first_child, get_attribute_ns): New.
2221    
2222 wakaba 1.25 2007-05-26 Wakaba <wakaba@suika.fam.cx>
2223    
2224 wakaba 1.26 * ContentChecker.pm ($HTMLLinkTypesAttrChecker): New checker.
2225     (link/@rel, a/@rel, area/@rel): Use new checker.
2226    
2227     * Makefile (_LinkTypeList.pm, RelExtensions.html): New rules.
2228    
2229     * _LinkTypeList.pm: New file.
2230    
2231     * mklinktypelist.pl: New file.
2232    
2233     * .cvsignore: |RelExtensions.html| added.
2234    
2235     * NanoDOM.pm (child_nodes): Returns an empty array
2236     for non-child-containing node types.
2237     (text_content): New attribute.
2238    
2239     2007-05-26 Wakaba <wakaba@suika.fam.cx>
2240    
2241 wakaba 1.25 * IMTChecker.pm: New module.
2242    
2243     * ContentChecker.pm ($HTMLIMTAttrChecker): Call IMTChecker
2244     to test parameter value validity.
2245    
2246     * HTML.pm.src ($style_start_tag): Attributes were
2247     discarded.
2248    
2249 wakaba 1.22 2007-05-25 Wakaba <wakaba@suika.fam.cx>
2250    
2251 wakaba 1.24 * ContentChecker.pm ($HTMLURIAttrChecker): Implemented.
2252    
2253     2007-05-25 Wakaba <wakaba@suika.fam.cx>
2254    
2255 wakaba 1.23 * URIChecker.pm: All recommendations from RFC 3986
2256     and RFC 3987 are listed (not all testable items are checked yet).
2257    
2258     2007-05-25 Wakaba <wakaba@suika.fam.cx>
2259    
2260 wakaba 1.22 * URIChecker.pm: New module.
2261    
2262 wakaba 1.18 2007-05-20 Wakaba <wakaba@suika.fam.cx>
2263    
2264 wakaba 1.21 * ContentChecker.pm: Now most attributes are implemented
2265     or associated to some placeholder.
2266     ($ElementDefault): Warn unknown attributes
2267     for unknown elements as "attribute not supported".
2268     ($HTMLLanguageTagAttrChecker, $HTMLMQAttrChecker): New placeholders.
2269     ($HTMLUsemapAttrChecker, $HTMLTargetAttrChecker): New checkers.
2270     (|a| attribute checker): Reimplemented.
2271    
2272     2007-05-20 Wakaba <wakaba@suika.fam.cx>
2273    
2274 wakaba 1.20 * ContentChecker.pm ($HTMLEventHandlerAttrChecker): New placeholder.
2275     ($HTMLAttrChecker): Event handler content attributes
2276     are added.
2277     (link, embed): Required attribute is now checked.
2278     (embed): Unknown local attributes are no longer warned.
2279    
2280     2007-05-20 Wakaba <wakaba@suika.fam.cx>
2281    
2282 wakaba 1.19 * ContentChecker.pm ($HTMLSpaceURIsAttrChecker): New placeholder.
2283     ($HTMLIMTAttrChecker): New checker.
2284     (link@rel, link@href, link@type, style@type,
2285     a@href, a@ping, a@ping, a@type, embed@src, embed@type,
2286     object@data, object@type, source@src, source@type, area@alt,
2287     area@shape, area@coords, area@href,
2288     area@ping, area@rel, area@type, script@src,
2289     script@defer, script@async, script@type): Checkers added.
2290    
2291     2007-05-20 Wakaba <wakaba@suika.fam.cx>
2292    
2293 wakaba 1.18 * ContentChecker.pm: Descendant checking was incorrect.
2294    
2295 wakaba 1.12 2007-05-19 Wakaba <wakaba@suika.fam.cx>
2296    
2297 wakaba 1.17 * ContentChecker.pm: Support |xml:*| and |xmlns:*|
2298     attributes. Report an error if |Element.prefix|
2299     is |xmlns|.
2300    
2301     * NanoDOM.pm (prefix): New attribute.
2302    
2303     2007-05-19 Wakaba <wakaba@suika.fam.cx>
2304    
2305 wakaba 1.16 * HTML.pm.src: In |main| phase, |in body| insertion
2306     mode, action for |<iframe>| was missing.
2307    
2308     2007-05-19 Wakaba <wakaba@suika.fam.cx>
2309    
2310 wakaba 1.15 * ContentChecker.pm: Support for many of HTML5 elements.
2311     ($GetHTMLNonNegativeIntegerAttrChecker): New.
2312    
2313     2007-05-19 Wakaba <wakaba@suika.fam.cx>
2314    
2315 wakaba 1.14 * ContentChecker.pm: Support for most elements up to |progress|.
2316     ($HTMLURIAttrChecker): Placeholder.
2317     ($HTMLIntegerAttrChecker, $GetHTMLFloatingPointNumberAttrChecker):
2318     New.
2319    
2320     2007-05-19 Wakaba <wakaba@suika.fam.cx>
2321    
2322 wakaba 1.13 * ContentChecker.pm: Attribute checkers
2323     for global attributes, |html|, |base|, |style|, and |meta|.
2324    
2325     * NanoDOM.pm (insert_before): Weaken reference
2326     to the parent node.
2327     (Attr::new): Set |owner_element| attribute.
2328     (namespace_uri, manakai_local_name): New attribute implementations.
2329     (owner_element): New attribute.
2330    
2331     2007-05-19 Wakaba <wakaba@suika.fam.cx>
2332    
2333 wakaba 1.12 * ContentChecker.pm ($AttrChecker, $HTMLAttrChecker,
2334     $AnyChecker->{attr_checker}, $HTMLAttrsChecker,
2335     $Element->{$HTML_NS}->{''}): New.
2336     (check_element): Invoke attrs_checker for each element.
2337    
2338 wakaba 1.6 2007-05-13 Wakaba <wakaba@suika.fam.cx>
2339    
2340 wakaba 1.11 * ContentChecker.pm: Don't use |manakai_element_type_match|.
2341    
2342     2007-05-13 Wakaba <wakaba@suika.fam.cx>
2343    
2344 wakaba 1.10 * ContentChecker.pm: Use hashs rather than lists for
2345     element type testings.
2346    
2347     2007-05-13 Wakaba <wakaba@suika.fam.cx>
2348    
2349 wakaba 1.9 * ContentChecker.pm: Don't generate duplicate
2350     error when an element type is put in the "minus" list
2351     and the element type is not allowed explicitly in the particular
2352     element content model.
2353     (html:a checker): New checker.
2354     (html:details, html:datagrid): New checkers.
2355     (html:legend): New checker.
2356    
2357     2007-05-13 Wakaba <wakaba@suika.fam.cx>
2358    
2359 wakaba 1.8 * ContentChecker.pm (html:li checker): Implemented.
2360    
2361     2007-05-13 Wakaba <wakaba@suika.fam.cx>
2362    
2363 wakaba 1.7 * ContentChecker.pm ($HTMLInlineOrStriclyInlineChecker): New
2364     checker.
2365     (html:dd checker): New checker.
2366     (html:q, html:em, html:strong, html:small,
2367     html:m, html:dfn, html:code, html:samp, html:span): New checkers.
2368    
2369     2007-05-13 Wakaba <wakaba@suika.fam.cx>
2370    
2371 wakaba 1.6 * ContentChecker.pm ($AnyChecker): Renamed
2372     from |$ElementDefault->{checker}|.
2373     ($ElementDefault->{checker}): Throw an error that
2374     the element type is not supported by the checker.
2375     ($HTMLMetadataElement): |html:base| was missing.
2376     ($HTMLEmptyChecker): Don't throw an error
2377     for inter-element whitespace nodes.
2378     (html:html checker): Errors were not
2379     thrown even if |html:head| and/or |html:body|
2380     children were missing.
2381     (html:head checker): An error was not
2382     thrown if <meta charset> appered after other
2383     elements.
2384    
2385 wakaba 1.5 2007-05-05 Wakaba <wakaba@suika.fam.cx>
2386    
2387     * ContentChecker.pm: |footer|, |video|, |audio|, |script|,
2388     and |noscript| elements are implemented.
2389     (new): New method.
2390    
2391 wakaba 1.3 2007-05-04 Wakaba <wakaba@suika.fam.cx>
2392    
2393 wakaba 1.4 * ContentChecker.pm: New module.
2394    
2395     2007-05-04 Wakaba <wakaba@suika.fam.cx>
2396    
2397 wakaba 1.3 * NanoDOM.pm (manakai_parent_element,
2398     document_element, manakai_local_name,
2399     manakai_element_type_match): New method.
2400    
2401     2007-05-03 Wakaba <wakaba@suika.fam.cx>
2402    
2403     * HTML.pm.src: Replace decimal and hexadecimal numeric
2404     entities in C1 range using Windows-1252 mapping. Bare LF
2405     did not count as new line for error reporting.
2406    
2407 wakaba 1.2 2007-05-02 Wakaba <wakaba@suika.fam.cx>
2408    
2409     * NanoDOM.pm (DOMImplementation): New class.
2410     (append_child): Weaken the |parent_node| reference.
2411     (create_element_ns, Element new): Set the |owner_document|
2412     reference.
2413     (implementation): New attribute.
2414     (owner_document, local_name, namespace_uri): New attributes.
2415    
2416     * HTML.pm.src (parse_string): Line and column numbers
2417     are now provided to error handler.
2418     (!!!parse-error): Short descriptions are added.
2419     (_construct_tree): Split into three methods; support
2420     for innerHTML mode.
2421     (set_inner_html): New method.
2422    
2423 wakaba 1.1 2007-05-01 Wakaba <wakaba@suika.fam.cx>
2424    
2425     * NanoDOM.html: Documentation is added.
2426    
2427     * HTML.pod, ContentType.html: Documentation is revised.
2428    
2429     * .cvsignore: Pod2html temporary files are added.
2430    
2431     * Makefile: Make |NanoDOM.html|.
2432    
2433     2007-05-01 Wakaba <wakaba@suika.fam.cx>
2434    
2435     * HTML.pm.src (parse_string): New method.
2436     (get_inner_html): Renamed from |inner_html|.
2437    
2438     * Makefile: A rule for |HTML.html| is added.
2439    
2440     * HTML.pod: New documentation.
2441    
2442     2007-05-01 Wakaba <wakaba@suika.fam.cx>
2443    
2444     * NanoDOM.pm (last_child, previous_sibling): New attributes.
2445     (clone_node): Attribute nodes were not completely copied.
2446    
2447     * HTML.pm.src: Many bugs are fixed.
2448    
2449     2007-04-30 Wakaba <wakaba@suika.fam.cx>
2450    
2451     * HTML.pm.src: Some typos are fixed.
2452    
2453     2007-04-30 Wakaba <wakaba@suika.fam.cx>
2454    
2455     * mkhtmlparser.pl, Makefile: References to the |HTML-consume-entity.src|
2456     are removed.
2457    
2458     * HTML.pm.src: Tokenizer's handling on named entities are rewritten.
2459    
2460     * HTML-consume-entity.src: Removed.
2461    
2462     2007-04-30 Wakaba <wakaba@suika.fam.cx>
2463    
2464     * HTML.pm.src: Tokenizer's handling on hexadecimal
2465     numeric entities are rewritten.
2466    
2467     2007-04-30 Wakaba <wakaba@suika.fam.cx>
2468    
2469     * HTML.pm.src: Some tokenizer bugs are fixed.
2470    
2471     2007-04-30 Wakaba <wakaba@suika.fam.cx>
2472    
2473     * HTML.pm.src: The tree construction stage is implemented.
2474    
2475     * mkhtmlparser.pl: New macros are added.
2476    
2477     2007-04-28 Wakaba <wakaba@suika.fam.cx>
2478    
2479     * ContentType.pm: A note on bug in the specification
2480     is removed since it's been now fixed.
2481    
2482     * .cvsignore: New file.
2483    
2484     2007-04-28 Wakaba <wakaba@suika.fam.cx>
2485    
2486     * HTML.pm.src, HTML-consume-entity.src: New files.
2487    
2488     * Makefile (HTML.pm): New rule.
2489    
2490     * mkhtmlparser.pl: New script.
2491    
2492     2007-04-25 Wakaba <wakaba@suika.fam.cx>
2493    
2494     * Makefile: New file.
2495    
2496     2007-04-24 Wakaba <wakaba@suika.fam.cx>
2497    
2498     * ContentType.pm: An error in pod is fixed.
2499    
2500     2007-04-24 Wakaba <wakaba@suika.fam.cx>
2501    
2502     * ContentType.pm: Documentation is added.
2503    
2504     2007-04-24 Wakaba <wakaba@suika.fam.cx>
2505    
2506     * ContentType.pm: New Perl module.
2507    
2508     * ChangeLog: New file.
2509    
2510    

admin@suikawiki.org
ViewVC Help
Powered by ViewVC 1.1.24