/[suikacvs]/markup/html/whatpm/Whatpm/ChangeLog
Suika

Contents of /markup/html/whatpm/Whatpm/ChangeLog

Parent Directory Parent Directory | Revision Log Revision Log


Revision 1.399 - (hide annotations) (download)
Sun Sep 6 10:17:38 2009 UTC (15 years, 10 months ago) by wakaba
Branch: MAIN
Changes since 1.398: +5 -0 lines
++ whatpm/t/ChangeLog	6 Sep 2009 10:17:10 -0000
	* tree-test-1.dat, tree-test-2.dat: Fixed some test results
	related to the handling of </html> end tags.

2009-09-06  Wakaba  <wakaba@suika.fam.cx>

++ whatpm/Whatpm/ChangeLog	6 Sep 2009 10:16:48 -0000
	* HTML.pm.src: Changed the handling of </html> end tag of "in
	body" insertion mode according to the latest spec definition.

2009-09-06  Wakaba  <wakaba@suika.fam.cx>

1 wakaba 1.391 2009-09-06 Wakaba <wakaba@suika.fam.cx>
2    
3 wakaba 1.399 * HTML.pm.src: Changed the handling of </html> end tag of "in
4     body" insertion mode according to the latest spec definition.
5    
6     2009-09-06 Wakaba <wakaba@suika.fam.cx>
7    
8 wakaba 1.398 * HTML.pm.src: Rewrote end tag handling in |head| area (cf. HTML5
9     revision 2730, but it was entirely broken, maybe I missed some
10     spec changes before rev.2730).
11    
12     2009-09-06 Wakaba <wakaba@suika.fam.cx>
13    
14 wakaba 1.397 * HTML.pm.src: Added the |hgroup| element (HTML5 revision 3039 and
15     HTML5 revision 3040).
16    
17     2009-09-06 Wakaba <wakaba@suika.fam.cx>
18    
19 wakaba 1.396 * HTML.pm.src: Added |xmp| to the list of Special elements (HTML5
20     revision 3689). It should make no difference since the |xmp|
21     element has the PCDATA content and cannot be the non-bottommost
22     element in the stack of open elements.
23    
24     2009-09-06 Wakaba <wakaba@suika.fam.cx>
25    
26 wakaba 1.395 * HTML.pm.src: Set the |confidence| flag to |irrlevant| when the
27     fragment parsing algorithm is invoked (HTML5 revision 2699).
28    
29     2009-09-06 Wakaba <wakaba@suika.fam.cx>
30    
31 wakaba 1.394 * HTML.pm.src: Simplified foster parenting (HTML5 revision 2732).
32    
33     2009-09-06 Wakaba <wakaba@suika.fam.cx>
34    
35 wakaba 1.393 * HTML.pm.src: No longer does foster parenting |style|, |script|,
36     and |input type=hidden| elements (HTML5 revision 2988).
37    
38     2009-09-06 Wakaba <wakaba@suika.fam.cx>
39    
40 wakaba 1.392 * HTML.pm.src: Dropped the |eventsource| element (HTML5 revision
41     2870).
42    
43     2009-09-06 Wakaba <wakaba@suika.fam.cx>
44    
45 wakaba 1.391 * HTML.pm.src (_tree_construction_main): Treate |keygen| as a void
46     element (HTML5 revision 2960).
47    
48 wakaba 1.389 2009-09-05 Wakaba <wakaba@suika.fam.cx>
49    
50 wakaba 1.390 * HTML.pm.src: Moved the "in table text" insertion mode code up
51     such that it does also work as specced in HTML5 in the remaining
52     cases (HTML5 revision 3382).
53    
54     2009-09-05 Wakaba <wakaba@suika.fam.cx>
55    
56 wakaba 1.389 * HTML.pm.src: Implementing the "in table text" insertion mode,
57     first half (HTML5 revision 3382).
58    
59 wakaba 1.388 2009-08-30 Wakaba <wakaba@suika.fam.cx>
60    
61     * ContentChecker.pm: Allow |xml:lang| in HTML document if the
62     owner element is not an HTML element (HTML5 revision 3697).
63    
64 wakaba 1.387 2009-08-23 Wakaba <wakaba@suika.fam.cx>
65    
66     * ContentChecker.pm: Allow <input type=hidden> in context where
67     interactive content is disallowed (HTML5 revision 2392).
68    
69 wakaba 1.380 2009-08-16 Wakaba <wakaba@suika.fam.cx>
70    
71 wakaba 1.386 * HTML.pm.src: Dropped support for "XSLT-compat" and added
72     "about:legacy-compat" (HTML5 revision 2725).
73    
74     2009-08-16 Wakaba <wakaba@suika.fam.cx>
75    
76 wakaba 1.385 * HTML.pm.src: "<!DOCTYPE>" is now explicitly supported in the
77     spec (HTML5 revision 2700).
78    
79     2009-08-16 Wakaba <wakaba@suika.fam.cx>
80    
81 wakaba 1.384 * HTML.pm.src: Added support for <script> element in the "in
82     select" and "in select in table" insertion modes (HTML5 revision
83     2500).
84    
85     2009-08-16 Wakaba <wakaba@suika.fam.cx>
86    
87 wakaba 1.383 * HTML.pm.src: An <hn> start tag in an <hn> element should close
88     the earlier element (HTML5 revision 2501).
89    
90     2009-08-16 Wakaba <wakaba@suika.fam.cx>
91    
92 wakaba 1.382 * HTML.pm.src: The </br> end tags before the <body> start tag
93     should equally survive as the first br element child of the body
94     element (HTML5 revision 2586).
95    
96     2009-08-16 Wakaba <wakaba@suika.fam.cx>
97    
98 wakaba 1.381 * HTML.pm.src: <font> in SVG land is handled as an SVG element as
99     long as it did not have HTML attributes (HTML5 revision 2590).
100    
101     2009-08-16 Wakaba <wakaba@suika.fam.cx>
102    
103 wakaba 1.380 * HTML.pm.src (_tree_construction_main): Improved error reporting
104     on "select" start tag in "in select" and "in select in table"
105     parsing modes (c.f. HTML5 revision 3604).
106    
107 wakaba 1.378 2009-07-26 Wakaba <wakaba@suika.fam.cx>
108    
109 wakaba 1.379 * mkhtmlclass.pl: Made xmlns:xlink with wrong namespace URL
110     invalid (HTML5 revision 2701).
111    
112     2009-07-26 Wakaba <wakaba@suika.fam.cx>
113    
114 wakaba 1.378 * HTML.pm.src: </rt>, </rp>, </option>, </optgroup> become
115     optional before </body> (HTML5 revision 2806).
116    
117 wakaba 1.376 2009-07-25 Wakaba <wakaba@suika.fam.cx>
118    
119 wakaba 1.377 * HTML.pm.src: Marked where SVG |script| execution should be
120     implemented (HTML5 revision 2904).
121    
122     2009-07-25 Wakaba <wakaba@suika.fam.cx>
123    
124 wakaba 1.376 * HTML.pm.src (set_inner_html): The fragment parsing algorithm
125     inherits the document quirksness flag of the main document (HTML5
126     revision 2935).
127    
128 wakaba 1.375 2009-07-05 Wakaba <wakaba@suika.fam.cx>
129    
130     * HTML.pm.src (_tree_construction_main): Put |table| in |p| in
131     quirks mode (The Hixie Quirk, HTML5 revision 3132).
132    
133 wakaba 1.372 2009-07-03 Wakaba <wakaba@suika.fam.cx>
134    
135 wakaba 1.374 * HTML.pm.src: <kegten> breaks out of <select> (HTML5 revision
136     3163).
137    
138     2009-07-03 Wakaba <wakaba@suika.fam.cx>
139    
140 wakaba 1.373 * HTML.pm.src: Don't change encoding to UTF-8 if the input
141     encoding is UTF-16 (HTML5 revision 3204).
142    
143     2009-07-03 Wakaba <wakaba@suika.fam.cx>
144    
145 wakaba 1.372 * HTML.pm.src (parse_byte_stream): Don't change to another
146     encoding if the input is UTF-16 (HTML5 revision 3205).
147    
148 wakaba 1.371 2009-06-28 Wakaba <wakaba@suika.fam.cx>
149    
150     * HTML.pm.src: |isindex| should no longer generate |p| element
151     node (HTML5 revision 3236).
152    
153 wakaba 1.368 2008-12-12 Wakaba <wakaba@suika.fam.cx>
154    
155 wakaba 1.370 * ContentChecker.pm: Introduced new flag |has_autofocus|. Added
156     support for new |id_type| value of |any|, which means any kind of
157     element is allowed as a referenced element.
158    
159     2008-12-12 Wakaba <wakaba@suika.fam.cx>
160    
161 wakaba 1.369 * ContentChecker.pm: Introduced |check_attrs2| method for the ease
162     of defining codes for checking required attribute and as such.
163    
164     2008-12-12 Wakaba <wakaba@suika.fam.cx>
165    
166 wakaba 1.368 * IMTChecker.pm: Added more definitions for subtypes.
167    
168 wakaba 1.367 2008-12-11 Wakaba <wakaba@suika.fam.cx>
169    
170     * URIChecker.pm: Some of |pos_end| values were wrong.
171    
172 wakaba 1.364 2008-12-06 Wakaba <wakaba@suika.fam.cx>
173    
174 wakaba 1.366 * ContentChecker.pm (check_element): Added support for "no
175     referenced datalist" error.
176    
177     2008-12-06 Wakaba <wakaba@suika.fam.cx>
178    
179 wakaba 1.365 * URIChecker.pm: Bug fix: It did not work unless
180     Message::DOM::DOMImplementation has been |require|d.
181    
182     2008-12-06 Wakaba <wakaba@suika.fam.cx>
183    
184 wakaba 1.364 * NanoDOM.pm (document_uri): New attribute.
185    
186     * ContentChecker.pm: Don't use methods not implemented by NanoDOM.
187    
188 wakaba 1.363 2008-11-07 Wakaba <wakaba@suika.fam.cx>
189    
190     * NanoDOM.pm (text_content): Don't create a Text node if the new
191     value is empty.
192    
193 wakaba 1.362 2008-11-06 Wakaba <wakaba@suika.fam.cx>
194    
195     * SWML/: New directory.
196    
197 wakaba 1.361 2008-10-20 Wakaba <wakaba@suika.fam.cx>
198    
199     * NanoDOM.pm (specified, all_declarations_processed,
200     manakai_attribute_type): New attributes.
201    
202 wakaba 1.359 2008-10-19 Wakaba <wakaba@suika.fam.cx>
203    
204 wakaba 1.360 * NanoDOM.pm (Entity->new): Initialize ->child_nodes as an empty
205     array.
206    
207     2008-10-19 Wakaba <wakaba@suika.fam.cx>
208    
209 wakaba 1.359 * NanoDOM.pm (notation_name): New attribute.
210    
211 wakaba 1.357 2008-10-18 Wakaba <wakaba@suika.fam.cx>
212    
213 wakaba 1.359 * NanoDOM.pm (public_id, system_id): New attributes.a
214 wakaba 1.358
215     2008-10-18 Wakaba <wakaba@suika.fam.cx>
216    
217 wakaba 1.357 * NanoDOM.pm (text_content): Moved to Node from Element. Setter
218     implemented.
219     (allowed_tokens, default_type, declared_type): Implemented.
220    
221 wakaba 1.356 2008-10-17 Wakaba <wakaba@suika.fam.cx>
222    
223     * NanoDOM.pm (node_name): New attribute.
224     (ELEMENT_TYPE_DEFINITION_NODE, ATTRIBUTE_DEFINITION_NODE): New
225     constants.
226     (create_element_type_definition_node, create_attribute_definition,
227     create_notation, create_general_entity,
228     get_element_type_definition_node,
229     set_element_type_definition_node, get_general_entity_node,
230     set_general_entity_node, get_notation_node, set_notation_node,
231     get_attribute_definition_node, set_attribute_definition_node): New
232     methods.
233     (element_types, entities, notations, attribute_definitions): New
234     attributes.
235     (DocumentType): Support for child nodes, entities, notations, and
236     element types.
237     (Entity, Notation, ElementTypeDefinition, AttributeDefinition):
238     New classes.
239    
240     * Dumper.pm: Support for general entities, notations, element type
241     definitions, and attribute definitions.
242    
243 wakaba 1.355 2008-10-15 Wakaba <wakaba@suika.fam.cx>
244    
245     * NanoDOM.pm (create_processing_instruction): New method.
246     (xml_version, xml_encoding, xml_standalone): New attributes.
247     (ProcessingInstruction): New class.
248    
249 wakaba 1.352 2008-10-14 Wakaba <wakaba@suika.fam.cx>
250    
251 wakaba 1.354 * HTML.pm.src: Handling of end tags in the foreign content
252     insertion mode was partially wrong, because of wrong bit
253     operations.
254    
255     2008-10-14 Wakaba <wakaba@suika.fam.cx>
256    
257 wakaba 1.353 * NanoDOM.pm (dom_config): New attribute (do nothing), for
258     Whatpm::XML::Parser support.
259    
260     2008-10-14 Wakaba <wakaba@suika.fam.cx>
261    
262 wakaba 1.352 * Makefile: New rule to make HTML/Tokenizer.pm is added.
263    
264     * HTML.pm.src: Tokenizer part moved to another file.
265    
266 wakaba 1.349 2008-10-13 Wakaba <wakaba@suika.fam.cx>
267    
268 wakaba 1.351 * HTML.pm.src: Merge |DT_EL| and |DD_EL| as |DTDD_EL|.
269    
270     2008-10-13 Wakaba <wakaba@suika.fam.cx>
271    
272 wakaba 1.350 * HTML.pm.src: Element category constants redefined.
273    
274     2008-10-13 Wakaba <wakaba@suika.fam.cx>
275    
276 wakaba 1.349 * HTML.pm.src: Steps for CDATA/RCDATA elements in tree
277     construction stage synced with the spec (HTML5 revisions 2139 and
278     2302).
279    
280 wakaba 1.348 2008-10-07 Wakaba <wakaba@suika.fam.cx>
281    
282     * ContentChecker.pm: New error level "html5_fact" added, which
283     should be tentatively used until all of requirements are properly
284     specced as RFC 2119 "MUST" in HTML5.
285    
286 wakaba 1.346 2008-10-05 Wakaba <wakaba@suika.fam.cx>
287    
288 wakaba 1.347 * ContentChecker.pod: Note on internal flags is added.
289    
290     2008-10-05 Wakaba <wakaba@suika.fam.cx>
291    
292 wakaba 1.346 * HTML.pm.src: An AAA bug fixed.
293    
294 wakaba 1.336 2008-10-04 Wakaba <wakaba@suika.fam.cx>
295    
296 wakaba 1.345 * HTML.pm.src: If another node is inserted by the parser, don't
297     reuse existing Text node to append a character (HTML5 revision
298     2124).
299    
300     2008-10-04 Wakaba <wakaba@suika.fam.cx>
301    
302 wakaba 1.344 * HTML.pm.src: Support for <option> and <optgroup> in body (HTML5
303     revisions 1731 and 2128).
304    
305     2008-10-04 Wakaba <wakaba@suika.fam.cx>
306    
307 wakaba 1.343 * HTML.pm.src: Make <foreignObject> scoping (HTML5 revision 1837).
308     Support for end tags of camelCase SVG elements were broken. A
309     wrong error type text fixed.
310    
311     2008-10-04 Wakaba <wakaba@suika.fam.cx>
312    
313 wakaba 1.342 * HTML.pm.src: Drop redundant code (HTML5 revision 1731).
314    
315     2008-10-04 Wakaba <wakaba@suika.fam.cx>
316    
317 wakaba 1.341 * HTML.pm.src: Support for new definition of |param| and |source|
318     start tag parsing (HTML5 revision 1731).
319    
320     2008-10-04 Wakaba <wakaba@suika.fam.cx>
321    
322 wakaba 1.340 * HTML.pm.src: <p> steps reimplemented (HTML5 revision 1731).
323    
324     2008-10-04 Wakaba <wakaba@suika.fam.cx>
325    
326 wakaba 1.339 * HTML.pm.src: <li>, <dt>, and <dd> steps reimplemented (HTML5
327     revisions 1731 and 1831).
328    
329     2008-10-04 Wakaba <wakaba@suika.fam.cx>
330    
331 wakaba 1.338 * HTML.pm.src: Support for new flow (but not phrasing) elements (HTML5
332     revisions 1731 and 1778). Support for the </sarcasm> end tag (HTML5
333     revision 1731).
334    
335     2008-10-04 Wakaba <wakaba@suika.fam.cx>
336    
337 wakaba 1.337 * HTML.pm.src: Support for |command| and |eventsource| elements (HTML5
338     revision 1731). End tags of |option| and |optgroup| elements are
339     now optional (HTML5 revision 1731).
340    
341     2008-10-04 Wakaba <wakaba@suika.fam.cx>
342    
343 wakaba 1.336 * HTML.pm.src: New "special" elements added to the list (HTML5
344     revision 1778). "strile" -> "strike".
345    
346 wakaba 1.335 2008-10-02 Wakaba <wakaba@suika.fam.cx>
347    
348     * ContentType.pm (get_sniffed_type): Support for the "better"
349     content sniffing (HTML5 revision 1927). In a case the official
350     type was not returned when the method is invoked in the list
351     context.
352    
353 wakaba 1.334 2008-09-22 Wakaba <wakaba@suika.fam.cx>
354    
355     * HTML.pm.src: Character references for non-space C0 characters,
356     including U+000B VT, DEL character, noncharacter code points, are
357     now converted to the U+FFFD character (cf. HTML5 revision 2138).
358    
359 wakaba 1.330 2008-09-21 Wakaba <wakaba@suika.fam.cx>
360    
361 wakaba 1.333 * ContentChecker.pm: |form=""| check support added.
362    
363     2008-09-21 Wakaba <wakaba@suika.fam.cx>
364    
365 wakaba 1.332 * ContentChecker.pm: |contextmenu| validness is now checked using
366     |id| and |id_type| properties, and |menu| property is removed.
367    
368     2008-09-21 Wakaba <wakaba@suika.fam.cx>
369    
370 wakaba 1.331 * ContentChecker.pm: Prepare for |form| |name| attribute's
371     duplication checking.
372    
373     2008-09-21 Wakaba <wakaba@suika.fam.cx>
374    
375 wakaba 1.330 * HTML.pm.src (parse_byte_stream): Support (or non-support) for
376     unsupported charset="" parameter value (HTML5 revision 2131).
377    
378 wakaba 1.325 2008-09-20 Wakaba <wakaba@suika.fam.cx>
379    
380 wakaba 1.329 * HTML.pm.src: Reminding places where U+000B is allowed as a space
381     character is fixed (cf. HTML5 revision 1738).
382    
383     * ContentChecker.pm, HTMLTable.pm: U+000B is no longer part of
384     space characters (HTML5 revision 1738).
385    
386     2008-09-20 Wakaba <wakaba@suika.fam.cx>
387    
388 wakaba 1.328 * HTML.pm.src: The "anything else" case for the "after after body"
389     insertion mode was not updated to swtich to the "in body"
390     insertion mode. U+000B is no longer a space character for the
391     purpose of tree construction phase (HTML5 revision 1738).
392    
393     2008-09-20 Wakaba <wakaba@suika.fam.cx>
394    
395 wakaba 1.327 * HTML.pm.src: U+000B is no longer a space character (HTML5
396     revision 1738).
397    
398     2008-09-20 Wakaba <wakaba@suika.fam.cx>
399    
400 wakaba 1.326 * ContentType.pm: 0x0B is no longer a space character (HTML5
401     revision 1738).
402    
403     * HTML.pm.src: U+000B is no longer a space character for the
404     algorithm for extracting an encoding from a Content-Type (HTML5
405     revision 1738).
406    
407     2008-09-20 Wakaba <wakaba@suika.fam.cx>
408    
409 wakaba 1.325 * ContentChecker.pm ($IsInHTMLInteractiveContent): New.
410    
411 wakaba 1.320 2008-09-18 Wakaba <wakaba@suika.fam.cx>
412    
413 wakaba 1.324 * LangTag.pm: Add checks for remaining requirements from RFC 4646.
414    
415     * mklangreg.pl: Sort 'Prefix' values by their length, to ease
416     matching.
417    
418     2008-09-18 Wakaba <wakaba@suika.fam.cx>
419    
420 wakaba 1.323 * LangTag.pm: Warn for private use language subtags. Error level
421     typos fixed. Support for Suppress-Script field.
422    
423     * mklangreg.pl: Support for dumping of nested structure.
424    
425     2008-09-18 Wakaba <wakaba@suika.fam.cx>
426    
427 wakaba 1.322 * LangTag.pm (check_rfc4646_langtag): Check if a tag is in the
428     recommended case as per RFC 4646.
429    
430     2008-09-18 Wakaba <wakaba@suika.fam.cx>
431    
432 wakaba 1.321 * LangTag.pm (check_rfc4646_langtag): New method.
433    
434     2008-09-18 Wakaba <wakaba@suika.fam.cx>
435    
436 wakaba 1.320 * mklangreg.pl: New script.
437    
438     * Makefile: Updated for creation of the module for language subtag
439     registry.
440    
441 wakaba 1.315 2008-09-16 Wakaba <wakaba@suika.fam.cx>
442    
443 wakaba 1.319 * Makefile: WebIDL.html added.
444    
445     * WebIDL.pod: New documentation.
446    
447     2008-09-16 Wakaba <wakaba@suika.fam.cx>
448    
449 wakaba 1.318 * WebIDL.pm: Checker's error types are redefined.
450    
451     2008-09-16 Wakaba <wakaba@suika.fam.cx>
452    
453 wakaba 1.317 * WebIDL.pm: Parser's error types are redefined. Some forward
454     compatible parsing bugs are fixed. Some unreachable codes are
455     commented out.
456    
457     2008-09-16 Wakaba <wakaba@suika.fam.cx>
458    
459 wakaba 1.316 * WebIDL.pm: Support for the reminding extended attributes are
460     added. It does not satisfy the definition that a forward
461     interface declaration has an extended attribute. It seems that
462     unless explicitly allowed multiple extended attributes with the
463     same name is not allowed, though it is not explicitly mentioned in
464     the spec.
465    
466     2008-09-16 Wakaba <wakaba@suika.fam.cx>
467    
468 wakaba 1.315 * WebIDL.pm: Unescapes extended attribute names and extended
469     attribute identifiers. Preserve whether an extended attribute has
470     an argument list of not. Support for extended attributes:
471     Constructor, ExceptionConsts, IndexGetter, IndexSetter,
472     NameGetter, NameSetter, and Null.
473     (has_argument_list): New attribute.
474     (idl_text): Stringifies argument lists, if any, even if it is
475     empty.
476    
477 wakaba 1.310 2008-09-15 Wakaba <wakaba@suika.fam.cx>
478    
479 wakaba 1.314 * HTML.pm.src: New state |PCDATA_STATE|. Use an empty string for
480     |{s_kwd}| in DATA_STATE as default.
481    
482     2008-09-15 Wakaba <wakaba@suika.fam.cx>
483    
484 wakaba 1.313 * HTML.pm.src, mkhtmlparser.pl: Replace |{prev_char}|
485     by |{s_kwd}| in DATA_STATE.
486    
487     2008-09-15 Wakaba <wakaba@suika.fam.cx>
488    
489 wakaba 1.312 * HTML.pm.src: Shorten keys.
490    
491     2008-09-15 Wakaba <wakaba@suika.fam.cx>
492    
493 wakaba 1.311 * HTML.pm.src: Remove checking for control character, surrogate
494     pair, or noncharacter code points and non-Unicode code
495     points (they should be handled by Whatpm::Charset::UnicodeChecker).
496     (parse_char_stream): Support for the |$get_wrapper| argument and
497     character stream error handlers.
498    
499     2008-09-15 Wakaba <wakaba@suika.fam.cx>
500    
501 wakaba 1.310 * ContentChecker.pm: Don't call |loda_ns_module|
502     for null-namespace elements/attributes.
503    
504     * HTML.pm.src: Fact out $disallowed_control_chars
505     as a hash.
506    
507 wakaba 1.300 2008-09-14 Wakaba <wakaba@suika.fam.cx>
508    
509 wakaba 1.309 * HTML.pm.src: Regexp typo fixed. |{prev_char}|
510     and |{next_char}| initializations are moved to initialization
511     method. |{read_until}| now supports buffering. Sync |set_inner_html|
512     with |parse_char_stream|.
513    
514     2008-09-14 Wakaba <wakaba@suika.fam.cx>
515    
516 wakaba 1.308 * HTML.pm.src (parse_char_stream): Make |set_next_char|
517     invoke |manakai_read_until|, not only |read|, where
518     possible, to decrease the number of |read| method calls.
519    
520     * mkhtmlparser.pl: Related changes to the aforementioned
521     modification.
522    
523     2008-09-14 Wakaba <wakaba@suika.fam.cx>
524    
525 wakaba 1.307 * HTML.pm.src: Use |read| instead of |getc|. |set_inner_html|
526     would report character error from now.
527    
528     2008-09-14 Wakaba <wakaba@suika.fam.cx>
529    
530 wakaba 1.306 * HTML.pm.src: White-space-leaded non-white-space character
531     tokens in "before head insertion mode" was not
532     correctly handled.
533     (set_inner_html): Reimplemented using CharString decodehandle
534     class. Support for $get_wrapper argument. Support
535     for |{read_until}| feature.
536    
537     2008-09-14 Wakaba <wakaba@suika.fam.cx>
538    
539 wakaba 1.305 * HTML.pm.src: Make a "bare ero" error for unknown
540     entities point the "&" character.
541    
542     2008-09-14 Wakaba <wakaba@suika.fam.cx>
543    
544 wakaba 1.304 * HTML.pm.src: It turns out that U+FFFD don't have to
545     be added to the list of excluded characters.
546    
547     2008-09-14 Wakaba <wakaba@suika.fam.cx>
548    
549 wakaba 1.303 * HTML.pm.src ($char_onerror): Have character decoder's |line|
550     and |column| a higher priority than the one set by the
551     tokenizer's input handler.
552     ($self->{read_until}): Exclude U+FFFD (but this might
553     not be necessary, since now we do line/column fixup in
554     the character decode handle).
555    
556     2008-09-14 Wakaba <wakaba@suika.fam.cx>
557    
558 wakaba 1.302 * HTML.pm.src: Use |{read_until}| where possible.
559    
560     2008-09-14 Wakaba <wakaba@suika.fam.cx>
561    
562 wakaba 1.301 * HTML.pm.src: Change |{getc_until}| to |{read_until}|
563     and |manakai_getc_until| to |manakai_read_until| to
564     reduce the number of string copies.
565    
566     2008-09-14 Wakaba <wakaba@suika.fam.cx>
567    
568 wakaba 1.300 * HTML.pm.src (parse_char_string): Use newly created
569     |Whatpm::Charset::DecodeHandle::CharString| instead of Perl's
570     standard feature to |open| a string as a filehandle,
571     since Perl's string filehandle seems not supporting |ungetc|
572     method correctly.
573     (parse_char_stream): Define |{getc_until}| method.
574     (DATA_STATE): Experimental support for |getc_until| feature.
575    
576 wakaba 1.292 2008-09-13 Wakaba <wakaba@suika.fam.cx>
577    
578 wakaba 1.299 * HTML.pm.src: Check points added to newly added branches.
579    
580     2008-09-13 Wakaba <wakaba@suika.fam.cx>
581    
582 wakaba 1.298 * HTML.pm.src: Remove |{char}|, which is no longer used.
583     Remove |{entity_in_attr}| and |{last_attribute_value_state}|
584     and replaced by |{prev_state}|.
585    
586     * mkhtmlparser.pl: Remove |{char}| feature.
587     Remove |!!!back-next-input-character;| macro.
588    
589     2008-09-13 Wakaba <wakaba@suika.fam.cx>
590    
591 wakaba 1.297 * HTML.pm.src: Finally we get rid of all the inner loops. Remove
592     entity related tokenizer states in favor of new states
593     implementing the consume character reference algorithm.
594    
595     2008-09-13 Wakaba <wakaba@suika.fam.cx>
596    
597     * HTML.pm.src: "Consume a character reference" algorithm is
598 wakaba 1.296 now implemented as a tokenizer's state, rather than
599     a method, with minimum changes (more changes will
600     be made, in due course). "Bogus comment state"'s inner
601     loop gets removed.
602    
603     2008-09-13 Wakaba <wakaba@suika.fam.cx>
604    
605 wakaba 1.297 * HTML.pm.src: Make |PUBLIC| and |SYSTEM| keyword tokenizing
606 wakaba 1.295 into their own tokenizer states.
607    
608     2008-09-13 Wakaba <wakaba@suika.fam.cx>
609    
610 wakaba 1.294 * HTML.pm.src: |CDATA_SECTION_STATE| (formally |CDATA_BLOCK_STATE|
611     is split into three states.
612    
613     2008-09-13 Wakaba <wakaba@suika.fam.cx>
614    
615 wakaba 1.293 * HTML.pm.src: |CLOSE_TAG_OPEN_STATE| is broken into
616     itself and new |CDATA_PCDATA_CLOSE_TAG_STATE| so that
617     no longer does the tokenizer have to push back next input
618     characters in those states.
619    
620     2008-09-13 Wakaba <wakaba@suika.fam.cx>
621    
622 wakaba 1.292 * HTML.pm.src: |MARKUP_DECLARATION_OPEN_STATE| broken
623     into four states so that no longer does the tokenizer have to push
624     back next input characters in that state.
625    
626 wakaba 1.291 2008-09-11 Wakaba <wakaba@suika.fam.cx>
627    
628     * HTML.pm.src: Methods now accept additional parameter, $get_wrapper,
629     which can be used to insert some wrapper between the character
630     stream handle and the tokenizer. (It is currently not supported
631     for |set_inner_html| for |Element|s).
632    
633 wakaba 1.289 2008-09-10 Wakaba <wakaba@suika.fam.cx>
634    
635 wakaba 1.290 * HTML.pm.src: Ignore punctuations in charset names.
636    
637     2008-09-10 Wakaba <wakaba@suika.fam.cx>
638    
639 wakaba 1.289 * ContentChecker.pm: Support for charset-layer error levels.
640    
641     * HTML.pm.src: Don't specify |text| argument for the
642     |chardecode:fallback| error, since it is not the encoding
643     being used alternatively.
644    
645 wakaba 1.288 2008-09-06 Wakaba <wakaba@suika.fam.cx>
646    
647     * HTML.pm.src: Support for |XSLT-compat| (HTML5 revision 2141).
648    
649 wakaba 1.286 2008-08-31 Wakaba <wakaba@suika.fam.cx>
650    
651 wakaba 1.287 * CacheManifest.pm: Support for extensibility (HTML5 revision 2051).
652    
653     2008-08-31 Wakaba <wakaba@suika.fam.cx>
654    
655 wakaba 1.286 * HTML.pm.src: Bug fix and sync with the spec with regard
656     to after after frameset insertion mode processing (HTML5
657     revision 1909). Note that the implementation was wrong
658     per the old spec before the r1909 changes.
659    
660 wakaba 1.278 2008-08-30 Wakaba <wakaba@suika.fam.cx>
661    
662 wakaba 1.285 * HTMLTable.pm: scope=auto algorithm fix synced with the
663     spec (HTML5 revision 2093).
664     ($process_row): Algorithm step numbers synced with the
665     spec (HTML5 revision 2092).
666    
667     2008-08-30 Wakaba <wakaba@suika.fam.cx>
668    
669 wakaba 1.284 * HTMLTable.pm: Zs is not what we want; we want White_Space! (HTML5
670     revision 2094).
671    
672     2008-08-30 Wakaba <wakaba@suika.fam.cx>
673    
674     * ContentType.pm: Support for image/svg+xml (HTML5 revision 2096).
675    
676     2008-08-30 Wakaba <wakaba@suika.fam.cx>
677    
678 wakaba 1.283 * HTML.pm.src: '"' and "'" at the end of attribute
679     name (after another attribute) now raise parse error (HTML5
680     revision 2123). Empty unquoted attribute values are no
681     longer allowed (HTML5 revision 2122).
682    
683     2008-08-30 Wakaba <wakaba@suika.fam.cx>
684    
685 wakaba 1.282 * mkhtmlparser.pl: Support for MathML |definitionURL| attribute (HTML5
686     revision 2130).
687    
688     2008-08-30 Wakaba <wakaba@suika.fam.cx>
689    
690 wakaba 1.281 * ContentChecker.pm: |xml:lang| attribute value must be same
691     as |lang| attribute value for HTML elements (HTML5 revision 2062
692     and so on).
693    
694     2008-08-30 Wakaba <wakaba@suika.fam.cx>
695    
696 wakaba 1.280 * ContentChecker.pm: Error level definition for |xml_id_error|
697     was missing.
698    
699     * URIChecker.pm: The end of the URL should be marked as the
700     error location for an empty path error. The position
701     between the userinfo and the port components should be
702     marked as the error location for an empty host error.
703    
704     2008-08-30 Wakaba <wakaba@suika.fam.cx>
705    
706 wakaba 1.279 * URIChecker.pm: Set parameters representing where in the
707     value the error occurs for errors. Report unknown
708     address format error in warning level, since address
709     formats are rarely added. Path segments starting with "/.."
710     were misinterpreted as a dot-segment.
711    
712     2008-08-30 Wakaba <wakaba@suika.fam.cx>
713    
714 wakaba 1.278 * URIChecker.pm (check_iri_reference): Requires
715     |Message::DOM::DOMImplementation|.
716    
717 wakaba 1.277 2008-08-29 Wakaba <wakaba@suika.fam.cx>
718    
719     * IMTChecker.pm: Updated for the new error reporting architecture.
720    
721     * ContentChecker.pm: Error levels for IMTs are added.
722    
723 wakaba 1.276 2008-08-17 Wakaba <wakaba@suika.fam.cx>
724    
725     * H2H.pm (_shift_token): Support for unquoted HTML attribute
726     values.
727    
728 wakaba 1.274 2008-08-16 Wakaba <wakaba@suika.fam.cx>
729    
730 wakaba 1.275 * CacheManifest.pm: Support for new style of error
731     reports.
732    
733     * HTML.pm.src: Set line=1, column=1 to the document node.
734    
735     2008-08-16 Wakaba <wakaba@suika.fam.cx>
736    
737 wakaba 1.274 * ContentChecker.pm, RDFXML.pm: Pass {level} object to language tag
738     and URL checkers. Support for more error levels for bogus
739     langauge tag and URL "standards".
740    
741     * LangTag.pm, URIChecker.pm: Support for new style error
742     level reporting.
743    
744 wakaba 1.271 2008-08-15 Wakaba <wakaba@suika.fam.cx>
745    
746 wakaba 1.273 * ContentChecker.pm: Support for RDF/XML error levels.
747    
748     * HTMLTable.pm, RDFXML.pm: Support for new style of error level
749     specifying. Error types are revised.
750    
751     2008-08-15 Wakaba <wakaba@suika.fam.cx>
752    
753 wakaba 1.272 * ContentChecker.pm: All error reporting method calls are
754     renewed.
755    
756     2008-08-15 Wakaba <wakaba@suika.fam.cx>
757    
758 wakaba 1.271 * HTML.pm.src: All error type names and "text" parameters
759     are revised. Use new style for "level" specification.
760    
761     * mkhtmlparser.pl: Use new style for "level" specification.
762    
763 wakaba 1.268 2008-08-03 Wakaba <wakaba@suika.fam.cx>
764    
765 wakaba 1.270 * WebIDL.pm (parse_char_string): Simplified error
766     reporting process for broken ignored valuetype definition.
767     (Valuetype idl_text): Support for special "DOMString" name.
768    
769     2008-08-03 Wakaba <wakaba@suika.fam.cx>
770    
771 wakaba 1.269 * WebIDL.pm ($get_scoped_name): Append "::::" if the last
772     terminal of the ScopedName is "DOMString", such that whether
773     the last part of the scoped name is "DOMString" or "_DOMString"
774     later. It is necessary to determine whether a |typedef|
775     definition should be ignored or not.
776     (parse_char_string): Unescape the identifier of
777     exception members.
778     ($resolve): Return undef for builtin types and sequence<T>
779     types (we might not have to do this, however...).
780     (check): Support checking for Exceptions, Valuetypes,
781     and Typedefs.
782     ($serialize_type): Support for "DOMString::::" syntax.
783     (Typedef idl_text): Output Type as "DOMString" if it
784     is really "DOMString" (i.e. its internal representation
785     is "::DOMString::").
786    
787     2008-08-03 Wakaba <wakaba@suika.fam.cx>
788    
789 wakaba 1.268 * WebIDL.pm ($resolve): New code, based on resolve code
790     for constant types in the |check| method.
791     (check): Support for checking of attributes, operations, and
792     arguments.
793     (Attribute/Operation idl_text): Exception names in getraises,
794     setraises, and raises clauses is serizlied by |$serialize_type|
795     code.
796    
797 wakaba 1.264 2008-08-02 Wakaba <wakaba@suika.fam.cx>
798    
799 wakaba 1.267 * WebIDL.pm ($integer): Order of selections are changed to match
800     hexadecimal numbers (the original pattern, taken from the spec,
801     was not work for hexadecimal numbers, because the "0" prefix
802     matches to the [0-7]* part (as an empty string) and therefore
803     it does not match with remaining "x..." part of a "0x..." integer
804     literal.
805     ($get_type): It now returns a string, not an array reference,
806     for regular types and |sequence| types (i.e. it in any case
807     returns a string).
808     ($get_next_token): The second item in the array that represents
809     a integer or float token is now a Perl number value, not the
810     original string representation of the number.
811     (check): Support for const value consistency checking.
812     No extended attribute is defined for constants.
813     (Node subclasses): Use simple strings rather than array references
814     for default data type values.
815     ($serialize_type): Type values are now simple strings.
816     (value): If the new attribute value is a false value, then
817     a FALSE value is set to the attribute.
818    
819     2008-08-02 Wakaba <wakaba@suika.fam.cx>
820    
821 wakaba 1.266 * WebIDL.pm ($get_scoped_name): Now scoped names are stored
822     in its stringified format ("scoped name" as defined in the
823     spec). Note that future version of this module should not use
824     array references for type values and the |type_text| attribute
825     should be made obsolete.
826     (parse_char_string): Unescape attribute names.
827     (check): Support for checking of whether inherited interfaces
828     are actually defined or not. Support for checking of whether
829     interface member identifiers are duplicated or not.
830     ($serialize_type): Scoped names are returned as is. A future
831     version of this code should escape identifiers other than "DOMString",
832     otherwise the idl_text would be non-conforming.
833    
834     2008-08-02 Wakaba <wakaba@suika.fam.cx>
835    
836 wakaba 1.264 * WebIDL.pm (parse_char_string): Set line/column numbers
837 wakaba 1.265 to generated nodes. Unescape identifiers. Extended attributes
838     for Definition's were ignored.
839     (append_child): Set |parent_node| attribute.
840     (parent_node): New attribute.
841     (check): Support interface/exception members. Support
842     extended attributes. Support definition identifier uniqueness
843     constraint.
844     (qualified_name): New attribute.
845     (Interface/Exception idl_text): Extended attributes were
846     not prepended to the returned text.
847    
848     2008-08-02 Wakaba <wakaba@suika.fam.cx>
849    
850     * WebIDL.pm (parse_char_string): Set line/column numbers
851 wakaba 1.264 to interface object experimentally. s/shift/pop/g, shift
852     would make things wrong. Support for interface forward
853     declarations was missing. Broken interface declarations
854     with no block were not ignored entirely.
855     (Whatpm::WebIDL::Node): New abstract class. This class
856     makes things easier.
857     (child_nodes): New attribute. Unlike DOM's attribute with
858     same name, this attribute returns a dead list of nodes for
859     simplicity.
860     (get_user_data, set_user_data): New methods.
861     (Module idl_text): A SPACE character should be inserted
862     before the |{| character.
863     (Interface idl_text): Support for interface forward declarations.
864     (is_forward_declaration): New attribute.
865    
866 wakaba 1.259 2008-07-19 Wakaba <wakaba@suika.fam.cx>
867    
868 wakaba 1.263 * WebIDL.pm (type_text): Better serializer.
869    
870     2008-07-19 Wakaba <wakaba@suika.fam.cx>
871    
872 wakaba 1.262 * WebIDL.pm: Revise forward-compatible parsing so that
873     it now can handle broken extended attributes and as such.
874    
875     2008-07-19 Wakaba <wakaba@suika.fam.cx>
876    
877 wakaba 1.261 * WebIDL.pm: Real support for extended attributes.
878     Support for extended attributes with arguments.
879    
880     2008-07-19 Wakaba <wakaba@suika.fam.cx>
881    
882 wakaba 1.260 * WebIDL.pm: Support for |exception| syntax.
883     (Interface->idl_text): Tentative support for inheritances.
884    
885     2008-07-19 Wakaba <wakaba@suika.fam.cx>
886    
887 wakaba 1.259 * WebIDL.pm: Hierarchical scoped name support was broken.
888     Support for raises, setraises, and getraises syntaxes.
889    
890 wakaba 1.258 2008-07-18 Wakaba <wakaba@suika.fam.cx>
891    
892     * WebIDL.pm: Support for |idl_text| attribute, version 1 (no
893     proper support for types, extended attributes, and exceptions yet).
894     WebIDL parser, version 1 (no support for exceptions yet,
895     no proper support for extended attributes yet).
896    
897 wakaba 1.257 2008-07-09 Wakaba <wakaba@suika.fam.cx>
898    
899     * WebIDL.pm (parse_char_string): Support for basic attribute syntax.
900    
901 wakaba 1.255 2008-06-29 Wakaba <wakaba@suika.fam.cx>
902    
903 wakaba 1.256 * WebIDL.pm: Support for valuetype and const.
904    
905     2008-06-29 Wakaba <wakaba@suika.fam.cx>
906    
907 wakaba 1.255 * WebIDL.pm: New module.
908    
909 wakaba 1.254 2008-06-15 Wakaba <wakaba@suika.fam.cx>
910    
911     * Makefile (Entities.html): URI changed.
912    
913 wakaba 1.253 2008-06-08 Wakaba <wakaba@suika.fam.cx>
914    
915     * HTML.pm.src: Support for ruby parsing (HTML5 revision 1704).
916    
917 wakaba 1.251 2008-06-01 Wakaba <wakaba@suika.fam.cx>
918    
919 wakaba 1.252 * HTML.pm.src (_get_next_token): A parse error was missing.
920    
921     2008-06-01 Wakaba <wakaba@suika.fam.cx>
922    
923 wakaba 1.251 * mklinktypelist.pl: rel=contact is no longer part of the HTML5
924     spec (commented out). (HTML5 revision 1711).
925    
926 wakaba 1.248 2008-05-25 Wakaba <wakaba@suika.fam.cx>
927    
928 wakaba 1.250 * ContentType.pm: Drop support for UTF-32 (HTML5 revision 1701).
929    
930     * HTML.pm.src: UTF-16BE and UTF-16LE should be considered
931     as UTF-16 (HTML5 revision 1701).
932    
933     2008-05-25 Wakaba <wakaba@suika.fam.cx>
934    
935 wakaba 1.249 * HTML.pm.src: Support for <noframes> in <head> (HTML5 revision
936     1692).
937    
938     2008-05-25 Wakaba <wakaba@suika.fam.cx>
939    
940     * HTML.pm.src: The secondary insertion mode used when switching
941     to foreign content is the "in body" insertion mode (HTML5 revision
942     1696).
943    
944     2008-05-25 Wakaba <wakaba@suika.fam.cx>
945    
946 wakaba 1.248 * HTML.pm.src: Don't raise parse error for <isindex/> (HTML5
947     revision 1697).
948    
949 wakaba 1.239 2008-05-24 Wakaba <wakaba@suika.fam.cx>
950    
951 wakaba 1.247 * HTML.pm.src: Support for end-of-file token in foreign content
952     insertion mode (HTML5 revision 1693). Update SVG camelCase
953     attribute list (HTML5 revision 1700). <textarea> closes
954     </select> (HTML5 revision 1699). More start tags close in
955     foreign content insertion mode (HTML5 revision 1698).
956    
957     2008-05-24 Wakaba <wakaba@suika.fam.cx>
958    
959 wakaba 1.246 * HTML.pm.src: ";" is not part of charset name (HTML5 revision 1665).
960    
961     2008-05-24 Wakaba <wakaba@suika.fam.cx>
962    
963 wakaba 1.245 * HTML.pm.src: More robust charset parameter detection (HTML5
964     revision 1674).
965    
966     2008-05-24 Wakaba <wakaba@suika.fam.cx>
967    
968 wakaba 1.244 * ContentType.pm: Support for image/vnd.microsoft.icon (HTML5
969     revision 1676).
970    
971     2008-05-24 Wakaba <wakaba@suika.fam.cx>
972    
973 wakaba 1.243 * HTML.pm.src: Ignore language part of public identifiers for
974     quriks mode detection (HTML5 revision 1679).
975    
976     2008-05-24 Wakaba <wakaba@suika.fam.cx>
977    
978 wakaba 1.242 * HTML.pm.src: Reduce the number of errors in truncated doctypes (HTML5
979     revision 1685).
980    
981     2008-05-24 Wakaba <wakaba@suika.fam.cx>
982    
983 wakaba 1.241 * HTML.pm.src: Support for EOF in new states for tags (HTML5
984     revision 1684).
985    
986     2008-05-24 Wakaba <wakaba@suika.fam.cx>
987    
988 wakaba 1.240 * HTML.pm.src (_reset_insertion_mode): Make <td>.innerHTML
989     work (HTML5 revision 1690).
990    
991     2008-05-24 Wakaba <wakaba@suika.fam.cx>
992    
993 wakaba 1.239 * HTML.pm.src (_tree_construction_main): Change handling of
994     end tags in head insertion modes (HTML5 revision 1686).
995     (parse_char_string): Bug fix for non-utf8 character string handlings.
996     (parse_char_stream): |ungetc| does not work well for this context.
997    
998 wakaba 1.237 2008-05-18 Wakaba <wakaba@suika.fam.cx>
999    
1000 wakaba 1.238 * HTML.pm.src (parse_byte_string): Redefined to invoke
1001     |parse_byte_stream|.
1002     (parse_byte_stream): New method.
1003    
1004     2008-05-18 Wakaba <wakaba@suika.fam.cx>
1005    
1006 wakaba 1.237 * HTML.pm.src (parse_byte_string): Fix the column number reported
1007     by encoding layer error reporter.
1008    
1009 wakaba 1.233 2008-05-17 Wakaba <wakaba@suika.fam.cx>
1010    
1011 wakaba 1.236 * HTML.pm.src (parse_byte_string): Use streaming decoder
1012     rather than converting the whole byte string and then parsing.
1013     Propagate errors in character encoding layer.
1014     (get_next_token): Precise error reporting for |bare stago| error.
1015    
1016     2008-05-17 Wakaba <wakaba@suika.fam.cx>
1017    
1018 wakaba 1.235 * HTML.pm.src (parse_char_stream): New method.
1019     (parse_char_string): This method is now defined as an invocation
1020     of the |parse_char_stream| method.
1021    
1022     2008-05-17 Wakaba <wakaba@suika.fam.cx>
1023    
1024 wakaba 1.234 * HTML.pm.src (parse_byte_string): Report various status
1025     of the sniffing as info-level errors. Support for new
1026     decoding framework in parser resestting.
1027     (new): Various default error levels were not set.
1028    
1029     2008-05-17 Wakaba <wakaba@suika.fam.cx>
1030    
1031 wakaba 1.233 * HTML.pm.src (parse_byte_string): HTML5 encoding siniffing
1032     algorithm, except for the actual sniffing, is implemented
1033     with new framework with Message::Charset::Info.
1034    
1035 wakaba 1.232 2008-05-16 Wakaba <wakaba@suika.fam.cx>
1036    
1037     * CacheManifest.pm (_parse): Drop fragment identifiers from
1038     URIs in fallback section (HTML5 revision 1596).
1039    
1040 wakaba 1.230 2008-05-10 Wakaba <wakaba@suika.fam.cx>
1041    
1042 wakaba 1.231 * Makefile (Entities.html): URI has changed.
1043    
1044     2008-05-10 Wakaba <wakaba@suika.fam.cx>
1045    
1046 wakaba 1.230 * CacheManifest.pm: Don't replace U+0000 NULL (HTML5 revision
1047     1553).
1048    
1049 wakaba 1.228 2008-05-06 Wakaba <wakaba@suika.fam.cx>
1050    
1051 wakaba 1.229 * ContentChecker.pm: Noted that those returned in |table| are
1052     no longer table elements, but table objects returned
1053     by Whatpm::HTMLTable.
1054    
1055     * HTMLTable.pm (form_table): Return table element node
1056     as |$table->{element}|.
1057     (assign_header): Support for the |headers=""| attribute.
1058    
1059     2008-05-06 Wakaba <wakaba@suika.fam.cx>
1060    
1061 wakaba 1.228 * HTMLTable.pm (assign_header): New function; first version
1062     with no support for headers="".
1063     (form_table): Include table width and height to the returned
1064     table object for covenience. Indexing in column assignement was wrong.
1065     Set whether a data cell is empty or not for convenience.
1066    
1067 wakaba 1.222 2008-05-05 Wakaba <wakaba@suika.fam.cx>
1068    
1069 wakaba 1.227 * HTMLTable.pm: Robuster caption support (HTML5 revision 1393).
1070    
1071     2008-05-05 Wakaba <wakaba@suika.fam.cx>
1072    
1073 wakaba 1.226 * HTMLTable.pm: How table model errors are detected is
1074     changed (HTML5 revision 1387).
1075    
1076     2008-05-05 Wakaba <wakaba@suika.fam.cx>
1077    
1078 wakaba 1.225 * HTMLTable.pm: The algorithm now moves |tfoot| elements
1079     to the end of the table (HTML5 revision 1380).
1080    
1081     2008-05-05 Wakaba <wakaba@suika.fam.cx>
1082    
1083 wakaba 1.224 * HTMLTable.pm: The algorithm is now 0-based indexing, instead
1084     of 1-based (HTML5 revision 1376).
1085    
1086     2008-05-05 Wakaba <wakaba@suika.fam.cx>
1087    
1088 wakaba 1.223 * ContentType.pm: "Content-Type: text/plain; charset=UTF-8"
1089     and "Content-Encoding" no longer prevent sniffing (HTML5
1090     revision 1288).
1091    
1092     2008-05-05 Wakaba <wakaba@suika.fam.cx>
1093    
1094 wakaba 1.222 * ContentType.pm: Skip BOMs n feed or HTML algorithm (HTML5 revision
1095     1282).
1096    
1097 wakaba 1.220 2008-05-03 Wakaba <wakaba@suika.fam.cx>
1098    
1099 wakaba 1.221 * ContentChecker.pm: Support for global attributes.
1100     Status of XML specs are added.
1101    
1102     2008-05-03 Wakaba <wakaba@suika.fam.cx>
1103    
1104 wakaba 1.220 * ContentChecker.pm (check_element): Support for |template=""|
1105     and |ref=""| attribute (referent element type checking).
1106    
1107 wakaba 1.218 2008-04-29 Wakaba <wakaba@suika.fam.cx>
1108    
1109 wakaba 1.219 * CacheManifest.pm (_parse): New same origin definition (HTML5 revision
1110     1500) is implemented (except for IDNA part and URI-scheme-specific
1111 wakaba 1.218 knowledge). Line number counting was wrong for LF-only
1112     documents.
1113    
1114 wakaba 1.211 2008-04-13 Wakaba <wakaba@suika.fam.cx>
1115    
1116 wakaba 1.217 * HTML.pm.src: Raise an parse error for any disallowed
1117     character (HTML5 revision 1263).
1118    
1119     2008-04-13 Wakaba <wakaba@suika.fam.cx>
1120    
1121 wakaba 1.216 * mkentitylist.pl: Support for new HTML5 entity table format (the
1122     definition for |AElig;| was missing).
1123    
1124     2008-04-13 Wakaba <wakaba@suika.fam.cx>
1125    
1126 wakaba 1.215 * HTML.pm.src, mkhtmlparser.pl: Support for element/attribute
1127     name/namespace fixup (HTML5 revisions 1413, 1415, 1416, and 1417).
1128    
1129     2008-04-13 Wakaba <wakaba@suika.fam.cx>
1130    
1131 wakaba 1.214 * HTML.pm.src: List of element names that close foreign content
1132     insertion mode is added (HTML5 revisions 1412 and 1418).
1133    
1134     2008-04-13 Wakaba <wakaba@suika.fam.cx>
1135    
1136 wakaba 1.213 * HTML.pm.src: Support for |mglyph| and |malignmark|
1137     elements (HTML5 revision 1410).
1138    
1139     2008-04-13 Wakaba <wakaba@suika.fam.cx>
1140    
1141 wakaba 1.212 * HTML.pm.src: Support for new long MathML entities (HTML5
1142     revision 1406).
1143    
1144     2008-04-13 Wakaba <wakaba@suika.fam.cx>
1145    
1146 wakaba 1.211 * HTML.pm.src: CDATA section support for MathML and SVG
1147     elements (HTML5 revision 1404 and 1420).
1148    
1149 wakaba 1.209 2008-04-12 Wakaba <wakaba@suika.fam.cx>
1150    
1151 wakaba 1.210 * HTML.pm.src, mkhtmlparser.pl: Support for MathML and SVG
1152     elements (HTML5 revision 1404). Unused !!!macro definitions
1153     are removed.
1154    
1155     2008-04-12 Wakaba <wakaba@suika.fam.cx>
1156    
1157 wakaba 1.209 * HTML.pm.src, mkhtmlparser.pl: The way permitted slash errors
1158     are raised is changed (HTML5 revision 1404).
1159    
1160 wakaba 1.206 2008-04-06 Wakaba <wakaba@suika.fam.cx>
1161    
1162 wakaba 1.208 * HTML.pm.src: Category sets that are no longer used are removed.
1163    
1164     2008-04-06 Wakaba <wakaba@suika.fam.cx>
1165    
1166 wakaba 1.207 * HTML.pm.src: The ->[1] property of stack entries are now
1167     replaced by constants representing element category.
1168    
1169     2008-04-06 Wakaba <wakaba@suika.fam.cx>
1170    
1171 wakaba 1.206 * HTML.pm.src: Don't use local name stored in stack (i.e. ->[1])
1172     for error reporting. (This is a preparation for using constant
1173     value for ->[1].)
1174    
1175 wakaba 1.204 2008-03-22 Wakaba <wakaba@suika.fam.cx>
1176    
1177 wakaba 1.205 * RDFXML.pm: Typo fixed.
1178    
1179     2008-03-22 Wakaba <wakaba@suika.fam.cx>
1180    
1181 wakaba 1.204 * ContentChecker.pm: |fact_level| is now treated
1182     as same as |must_level|, i.e. level = |m|.
1183     (check_element): Make list of URIs in the DOM.
1184    
1185 wakaba 1.197 2008-03-21 Wakaba <wakaba@suika.fam.cx>
1186    
1187 wakaba 1.203 * RDFXML.pm: Language accessor implemented.
1188     Local (null-namespace) attribute support.
1189    
1190     2008-03-21 Wakaba <wakaba@suika.fam.cx>
1191    
1192 wakaba 1.202 * RDFXML.pm: Factored out ID checking code.
1193    
1194     2008-03-21 Wakaba <wakaba@suika.fam.cx>
1195    
1196 wakaba 1.201 * RDFXML.pm: TODO items noted. Validation of ID
1197     and URI attributes is implemented. Warn if unknown
1198     value is used in rdf:parseType="" attribute.
1199    
1200     * URIChecker.pm (check_rdf_uri_reference): New function.
1201    
1202     2008-03-21 Wakaba <wakaba@suika.fam.cx>
1203    
1204 wakaba 1.200 * RDFXML.pm: bnodeid implemented. Relative references
1205     are now resolved.
1206    
1207     2008-03-21 Wakaba <wakaba@suika.fam.cx>
1208    
1209 wakaba 1.199 * ContentChecker.pm: RDF reification implemented.
1210    
1211     * RDFXML.pm: undef vs false bug fixed.
1212     Reification implemented.
1213    
1214     2008-03-21 Wakaba <wakaba@suika.fam.cx>
1215    
1216 wakaba 1.198 * RDFXML.pm: s/id/ID/ for attribute name.
1217     The |node| arguments are added for |ontriple| calls.
1218     Too many "attribute not allowed" errors were raised.
1219    
1220     * ContentChecker.pm: Initial experimental support for rdf:RDF
1221     element.
1222    
1223     2008-03-21 Wakaba <wakaba@suika.fam.cx>
1224    
1225 wakaba 1.197 * RDFXML.pm: New module.
1226    
1227 wakaba 1.192 2008-03-20 Wakaba <wakaba@suika.fam.cx>
1228    
1229 wakaba 1.196 * HTML.pm.src (set_inner_html): Line/column number
1230     code was old one yet.
1231    
1232     2008-03-20 Wakaba <wakaba@suika.fam.cx>
1233    
1234 wakaba 1.195 * HTML.pm.src: Better line/column reporting for "duplicate attribute"
1235     errors. Line/column markings for DOCTYPE, comment, and
1236     character tokens are reintroduced; otherwise,
1237     error location for "not HTML5" error and errors
1238     for implied elements are not attached.
1239    
1240     2008-03-20 Wakaba <wakaba@suika.fam.cx>
1241    
1242 wakaba 1.194 * HTML.pm.src: Set line/column numbers to attributes.
1243    
1244     * NanoDOM.pm (create_attribute_ns, set_attribute_node_ns): Added.
1245     (value): Setter implemented.
1246    
1247     * mkhtmlparser.pl: Set line/column numbers to Attr nodes.
1248    
1249     2008-03-20 Wakaba <wakaba@suika.fam.cx>
1250    
1251 wakaba 1.193 * HTML.pm.src: Unused line/column markings are removed.
1252    
1253     2008-03-20 Wakaba <wakaba@suika.fam.cx>
1254    
1255 wakaba 1.192 * HTML.pm.src (_get_next_token): Remove |first_start_tag|
1256     flag, which is no longer used.
1257    
1258 wakaba 1.190 2008-03-17 Wakaba <wakaba@suika.fam.cx>
1259    
1260 wakaba 1.191 * HTML.pm.src: Set line/column information to element nodes.
1261    
1262     * mkhtmlparser.pl (!!!create-element, !!!insert-element,
1263     and !!!insert-element-t): Set line/column information to
1264     element nodes.
1265    
1266     2008-03-17 Wakaba <wakaba@suika.fam.cx>
1267    
1268 wakaba 1.190 * HTML.pm.src (_get_next_token): The first "<" character
1269     in "<?", "<>", or "</>" should be the error point.
1270    
1271 wakaba 1.187 2008-03-16 Wakaba <wakaba@suika.fam.cx>
1272    
1273 wakaba 1.189 * HTML.pm.src: Some more fixes on error position reporting.
1274    
1275     2008-03-16 Wakaba <wakaba@suika.fam.cx>
1276    
1277 wakaba 1.188 * HTML.pm.src: Token-level precious error reporting.
1278    
1279     2008-03-16 Wakaba <wakaba@suika.fam.cx>
1280    
1281 wakaba 1.187 * HTML.pm.src: Preparation for more precious error point
1282     reporting.
1283    
1284 wakaba 1.184 2008-03-11 Wakaba <wakaba@suika.fam.cx>
1285    
1286 wakaba 1.186 * HTML.pm.src: Error type revised.
1287    
1288     2008-03-11 Wakaba <wakaba@suika.fam.cx>
1289    
1290 wakaba 1.185 * HTML.pm.src: Similar codes are merged together, again.
1291    
1292     2008-03-11 Wakaba <wakaba@suika.fam.cx>
1293    
1294 wakaba 1.184 * HTML.pm.src: Similar codes are merged together.
1295    
1296 wakaba 1.183 2008-03-10 Wakaba <wakaba@suika.fam.cx>
1297    
1298     * mkhtmlparser.pl: Set "level" parameter to parse errors.
1299    
1300     * HTML.pm.src: Code refined.
1301    
1302 wakaba 1.175 2008-03-09 Wakaba <wakaba@suika.fam.cx>
1303    
1304 wakaba 1.182 * HTML.pm.src: |</body>| treatement has been changed (HTML5
1305     revision 1348). Note that I really don't know this makes
1306     any difference in the black-box behavior of the parser.
1307    
1308     2008-03-09 Wakaba <wakaba@suika.fam.cx>
1309    
1310 wakaba 1.181 * HTML.pm.src: New end-of-file token implementation (HTML5
1311     revision 1348).
1312    
1313     2008-03-09 Wakaba <wakaba@suika.fam.cx>
1314    
1315 wakaba 1.180 * HTML.pm.src: |applet| support (HTML5 revision 1347).
1316    
1317     2008-03-09 Wakaba <wakaba@suika.fam.cx>
1318    
1319 wakaba 1.179 * HTML.pm.src: Foster parenting in AAA (HTML5 revision 1343).
1320    
1321     2008-03-09 Wakaba <wakaba@suika.fam.cx>
1322    
1323 wakaba 1.178 * HTML.pm.src: Support for |<input>| in the "in select" insertion
1324     mode and support for the "in select in table" insertion mode (HTML5
1325     revision 1342).
1326    
1327     2008-03-09 Wakaba <wakaba@suika.fam.cx>
1328    
1329 wakaba 1.177 * HTML.pm.src: No foster parenting for <script> and <script>
1330     in non-tainted <table>s (HTML5 revision 1336).
1331    
1332     2008-03-09 Wakaba <wakaba@suika.fam.cx>
1333    
1334 wakaba 1.176 * HTML.pm.src: Ignore white space characters between <html>
1335     and <head> (HTML5 revision 1332).
1336    
1337     2008-03-09 Wakaba <wakaba@suika.fam.cx>
1338    
1339 wakaba 1.175 * HTML.pm.src: Treat <input type=hidden> as if it were a
1340     white space (HTML5 revision 1331).
1341    
1342 wakaba 1.164 2008-03-08 Wakaba <wakaba@suika.fam.cx>
1343    
1344 wakaba 1.174 * HTML.pm.src: Ignore U+000A at the beginning of a |listing|
1345     element (HTML5 revision 1330).
1346    
1347     2008-03-08 Wakaba <wakaba@suika.fam.cx>
1348    
1349 wakaba 1.173 * HTML.pm.src: <title> is always appended to the current
1350     element (HTML5 revision 1328).
1351    
1352     2008-03-08 Wakaba <wakaba@suika.fam.cx>
1353    
1354 wakaba 1.172 * HTML.pm.src: White space in tainted tables are moved
1355     into foster parents (HTML5 revision 1326).
1356    
1357     2008-03-08 Wakaba <wakaba@suika.fam.cx>
1358    
1359 wakaba 1.171 * HTML.pm.src: Reduce errors from foster parenting cases (HTML5
1360     revision 1321).
1361    
1362     2008-03-08 Wakaba <wakaba@suika.fam.cx>
1363    
1364 wakaba 1.170 * HTML.pm.src: |</h/n/>| case code rearranged to align with
1365     the spec (HTML5 revision 1320). Note that we finally complete
1366     all of HTML5 revision 1320 changes.
1367    
1368     2008-03-08 Wakaba <wakaba@suika.fam.cx>
1369    
1370 wakaba 1.169 * HTML.pm.src: |</form>| now works similar to |</div>| for unclosed
1371     tags (HTML5 revision 1320).
1372    
1373     2008-03-08 Wakaba <wakaba@suika.fam.cx>
1374    
1375 wakaba 1.168 * HTML.pm.src: |</p>| case rearranged with no actual change in fact.
1376    
1377     2008-03-08 Wakaba <wakaba@suika.fam.cx>
1378    
1379 wakaba 1.167 * HTML.pm.src: A "generate implied end tags" code (t409.1)
1380     could not be reached so that it is now removed (HTML5 revision
1381     1320).
1382    
1383     2008-03-08 Wakaba <wakaba@suika.fam.cx>
1384    
1385 wakaba 1.166 * HTML.pm.src: Code for the case of |</div>| and so on are revised
1386     to align with new spec text (HTML5 revision 1320).
1387    
1388     2008-03-08 Wakaba <wakaba@suika.fam.cx>
1389    
1390 wakaba 1.165 * HTML.pm.src: Remove strange |if| condition; however, it should
1391     have had no harm in theory.
1392    
1393     2008-03-08 Wakaba <wakaba@suika.fam.cx>
1394    
1395 wakaba 1.164 * HTML.pm.src (_tree_construction_main): '</p>' in body
1396     case is split from other end tags for the preparation
1397     of implementing HTML revision 1320.
1398    
1399 wakaba 1.162 2008-03-07 Wakaba <wakaba@suika.fam.cx>
1400    
1401 wakaba 1.163 * HTML.pm.src: Simplified "generate implied end tag" (HTML5
1402     revision 1320).
1403    
1404     2008-03-07 Wakaba <wakaba@suika.fam.cx>
1405    
1406 wakaba 1.162 * HTML.pm.src (_tree_construction_main): Merge rules for "h1"
1407     and "div" (HTML5 revision 1318). Add comments to where
1408     |form| pointer association codes should be inserted (HTML5 revision
1409     1319).
1410    
1411 wakaba 1.161 2008-03-06 Wakaba <wakaba@suika.fam.cx>
1412    
1413     * HTML.pm.src: <html> treatement refined (HTML5 revision 1314).
1414    
1415 wakaba 1.159 2008-03-05 Wakaba <wakaba@suika.fam.cx>
1416    
1417 wakaba 1.160 * HTML.pm.src: Since the case t268 should never be reached (no
1418     other token type, there are), it is replaced by a |die| statement.
1419    
1420     2008-03-05 Wakaba <wakaba@suika.fam.cx>
1421    
1422 wakaba 1.159 * HTML.pm.src: Typo fixed.
1423    
1424 wakaba 1.158 2008-03-04 Wakaba <wakaba@suika.fam.cx>
1425    
1426     * HTML.pm.src (_tree_construction_initial): Some limited quirks
1427     doctypes were not uppercased for comparison.
1428    
1429 wakaba 1.152 2008-03-03 Wakaba <wakaba@suika.fam.cx>
1430    
1431 wakaba 1.157 * HTML.pm.src (tree construction and set_inner_html): Checkpoints
1432     are added.
1433    
1434     2008-03-03 Wakaba <wakaba@suika.fam.cx>
1435    
1436 wakaba 1.156 * HTML.pm.src (_tokenize_attempt_to_consume_an_entity): Checkpoints
1437     are set. Cases that are unlikely reached are noted as so.
1438    
1439     2008-03-03 Wakaba <wakaba@suika.fam.cx>
1440    
1441 wakaba 1.155 * HTML.pm.src: Checkpoints for debugging are added.
1442    
1443     * mkhtmlparser.pl: Support for |!!!cp| syntax.
1444    
1445     2008-03-03 Wakaba <wakaba@suika.fam.cx>
1446    
1447 wakaba 1.154 * HTML.pm.src, mkhtmlparser.pl: s/_input_character/_char/g
1448     for simplicity.
1449    
1450     2008-03-03 Wakaba <wakaba@suika.fam.cx>
1451    
1452 wakaba 1.153 * HTML.pm.src: Flag name changed: s/correct/force-quirks/g (HTML5
1453     revision 1307).
1454    
1455     2008-03-03 Wakaba <wakaba@suika.fam.cx>
1456    
1457 wakaba 1.152 * HTML.pm.src (_get_next_token): Where the /incorrect/ flag
1458     is set are changed (HTML5 revision 1305).
1459    
1460 wakaba 1.149 2008-03-02 Wakaba <wakaba@suika.fam.cx>
1461    
1462 wakaba 1.151 * HTML.pm.src: Raise a parse error for |<span ===>| (HTML5 revision
1463     1292). Entities are not parsed in comment-like part in RCDATA
1464     elements (HTML5 revision 1294). Allow bare & at the end
1465     of attribute value literals (HTML5 revision 1296). More
1466     quirks mode doctypes (HTML5 revision 1302). Requires spaces
1467     between attributes and ban attribute names or unquoted
1468     attribute values containing single or double quotes (HTML5
1469     revision 1303).
1470    
1471     2008-03-02 Wakaba <wakaba@suika.fam.cx>
1472    
1473 wakaba 1.150 * ContentChecker.pm: Typo fixed. Don't raise "character encoding"
1474     and related errors unless it is an HTML document (though the spec
1475     is unclear on whether it is applied to XHTML document).
1476    
1477     * HTML.pm (%HTMLAttrStatus): WF2 repetition model attributes
1478     are added.
1479    
1480     2008-03-02 Wakaba <wakaba@suika.fam.cx>
1481    
1482 wakaba 1.149 * HTML.pm.src: s/local_name/manakai_local_name/g.
1483    
1484 wakaba 1.148 2008-03-01 Wakaba <wakaba@suika.fam.cx>
1485    
1486     * _NamedEntityList.pm: Updated (HTML5 revision 1286).
1487    
1488     * HTML.pm.src: |charset| in |content| attribute is
1489     case-insensitive (HTML5 revision 1270).
1490    
1491 wakaba 1.147 2008-02-26 Wakaba <wakaba@suika.fam.cx>
1492    
1493     * ContentChecker.pm: New status constants are added.
1494     ($ElementDefault): |status| added.
1495     (check_element): Err for non-standard or deprecated elements.
1496     (_attr_status_info): For non-standard or deprecated attributes.
1497    
1498 wakaba 1.142 2008-02-24 Wakaba <wakaba@suika.fam.cx>
1499    
1500 wakaba 1.146 * ContentChecker.pm (_attr_status_info): New internal method.
1501    
1502     2008-02-24 Wakaba <wakaba@suika.fam.cx>
1503    
1504 wakaba 1.145 * ContentChecker.pm (check_element): Element standardized
1505     status information is now dispatched.
1506    
1507     2008-02-24 Wakaba <wakaba@suika.fam.cx>
1508    
1509 wakaba 1.144 * ContentChecker.pm (check_element): Fix |del|-and-significant
1510     problem by adding some more arguments.
1511    
1512     2008-02-24 Wakaba <wakaba@suika.fam.cx>
1513    
1514 wakaba 1.143 * ContentChecker.pm (check_element): Use context of
1515     container-for-the-purpose-of-content-model element (not transparent
1516     element) for |check_child_element| calling and significant
1517     text flag marking. This reintroduces |<del>|-and-significant
1518     problem again.
1519    
1520     2008-02-24 Wakaba <wakaba@suika.fam.cx>
1521    
1522 wakaba 1.142 * ContentChecker.pm (check_element): Make semi-transparent
1523     elements ignored for the purpose of phase changes in
1524     content model checking.
1525    
1526 wakaba 1.138 2008-02-23 Wakaba <wakaba@suika.fam.cx>
1527    
1528 wakaba 1.141 * ContentChecker.pm (check_element): In-element state
1529     was not properly managed for transparent cases.
1530    
1531     2008-02-23 Wakaba <wakaba@suika.fam.cx>
1532    
1533 wakaba 1.140 * ContentChecker.pm (check_element): Support for |video|
1534     and |audio| as semi-transparent elements.
1535    
1536     2008-02-23 Wakaba <wakaba@suika.fam.cx>
1537    
1538 wakaba 1.139 * ContentChecker.pm ($HTMLSemiTransparentElements): New.
1539     (check_element): s/minuses/minus_elements/, s/pluses/plus_elements/.
1540     Support for |html:object| as a semi-transparent element.
1541    
1542     2008-02-23 Wakaba <wakaba@suika.fam.cx>
1543    
1544 wakaba 1.138 * ContentChecker.pm (check_element): The way to traverse
1545     the tree is entirely revised to make it easier to track
1546     the state of ancestors/descendants. As a result of this
1547     revision (which rewrites almost all of Whatpm::ContentChecker::HTML),
1548     support for content model checking for HTML elements |figure|,
1549     |object|, |video|, and |audio| and checking for XML elements (and
1550     some XMLNS checkings) are dropped for now. They will be
1551     reimplemented in due cource.
1552    
1553 wakaba 1.134 2008-02-17 Wakaba <wakaba@suika.fam.cx>
1554    
1555 wakaba 1.137 * HTML.pm.src: |>| in public or system literal closes the DOCTYPE
1556     token (HTML5 revision 1225).
1557    
1558     2008-02-17 Wakaba <wakaba@suika.fam.cx>
1559    
1560 wakaba 1.136 * ContentChecker.pm ({unsupported_level}): New value.
1561    
1562     * HTML.pm.src: Save whether |meta| |content| attribute
1563     contains character references or not.
1564    
1565     2008-02-17 Wakaba <wakaba@suika.fam.cx>
1566    
1567 wakaba 1.135 * ContentChecker.pm (_get_children): (Incompleted) attempt to
1568     imlement significant content checking for contents
1569     with |del| elements.
1570    
1571     2008-02-17 Wakaba <wakaba@suika.fam.cx>
1572    
1573 wakaba 1.134 * ContenteChecker.pm ($HTMLTransparentElements): More
1574     elements are added.
1575     (_get_children): HTML |object| elements are now semi-transparent.
1576    
1577     * NanoDOM.pm (manakai_html, manakai_head): New methods.
1578    
1579 wakaba 1.133 2008-02-16 Wakaba <wakaba@suika.fam.cx>
1580    
1581     * CacheManifest.pm: HTML5 revision 1211 implemented.
1582    
1583     * CacheManifest.pod: Updated.
1584    
1585 wakaba 1.132 2008-02-10 Wakaba <wakaba@suika.fam.cx>
1586    
1587     * ContentChecker.pm (check_document, check_element): Support
1588     for second argument ($onsubdoc).
1589     (_get_css_parser): Removed (now it is part of WDCC).
1590    
1591 wakaba 1.131 2008-02-09 Wakaba <wakaba@suika.fam.cx>
1592    
1593     * ContentChecker.pm (_get_css_parser): New.
1594    
1595 wakaba 1.129 2007-11-25 Wakaba <wakaba@suika.fam.cx>
1596    
1597 wakaba 1.130 * ContentChecker.pm ($AnyChecker): Old way to add child elements
1598     for checking had been used.
1599    
1600     2007-11-25 Wakaba <wakaba@suika.fam.cx>
1601    
1602 wakaba 1.129 * ContentChecker.pm (check_element): New todo item type |descendant|.
1603    
1604 wakaba 1.125 2007-11-23 Wakaba <wakaba@suika.fam.cx>
1605    
1606 wakaba 1.128 * IMTChecker.pm: Revised to raise errors and warnings as (poorly)
1607     specced in RFC 2046 and RFC 4288.
1608     (application/atom+xml): Definition added.
1609    
1610     2007-11-23 Wakaba <wakaba@suika.fam.cx>
1611    
1612 wakaba 1.127 * URIChecker.pm: Make RFC 3986 should-level errors
1613     warnings (rather than SHOULD-level errors).
1614    
1615     2007-11-23 Wakaba <wakaba@suika.fam.cx>
1616    
1617 wakaba 1.126 * NanoDOM.pm (get_user_data, set_user_data): New methods.
1618    
1619     * HTML.pm.src: A flag for character references in attribute
1620     values are added. Set |manakai_has_reference| user data
1621     to |charset| attribute.
1622    
1623     2007-11-23 Wakaba <wakaba@suika.fam.cx>
1624    
1625 wakaba 1.125 * NanoDOM.pm (input_encoding, manakai_charset, manakai_has_bom): New
1626     attributes.
1627    
1628     * ContentChecker.pm (check_document): Warn if charset requirements
1629     cannot be tested.
1630    
1631 wakaba 1.124 2007-11-19 Wakaba <wakaba@suika.fam.cx>
1632    
1633     * HTML.pm.src (parse_byte_string): Detect charset
1634     by universalchardet if charset parameter is not specified.
1635    
1636     * Makefile (Charset-all, Charset-clean): New rules.
1637    
1638 wakaba 1.120 2007-11-18 Wakaba <wakaba@suika.fam.cx>
1639    
1640 wakaba 1.123 * ContentChecker.pm (check_document): Check the existence
1641     of character encoding declaration and selection of encoding
1642     for HTML document.
1643    
1644     2007-11-18 Wakaba <wakaba@suika.fam.cx>
1645    
1646 wakaba 1.122 * ContentType.pm (get_sniffed_type): Return also the official
1647     type in list context.
1648    
1649     2007-11-18 Wakaba <wakaba@suika.fam.cx>
1650    
1651 wakaba 1.121 * ContentType.pm: Sniffing with leading white space
1652     ignoring (HTML5 revisions 1013 and 1016).
1653    
1654     2007-11-18 Wakaba <wakaba@suika.fam.cx>
1655    
1656 wakaba 1.120 * ContentType.pm: HTML5 revision 1013 changes, except for leading
1657     white spaces, are implemented.
1658    
1659 wakaba 1.117 2007-11-11 Wakaba <wakaba@suika.fam.cx>
1660    
1661 wakaba 1.119 * HTML.pm.src (parse_char_string): Set |inner_encoding|
1662     attribute if possible.
1663    
1664     2007-11-11 Wakaba <wakaba@suika.fam.cx>
1665    
1666 wakaba 1.118 * HTML.pm.src (parse_byte_string): New method.
1667     (parse_char_string): New alias for |parse_string|.
1668     (main phase): Invoking "change the encoding" algorithm if desired.
1669    
1670     * HTML.pod: Updated.
1671    
1672     2007-11-11 Wakaba <wakaba@suika.fam.cx>
1673    
1674 wakaba 1.117 * HTML.pod (get_inner_html): Removed.
1675    
1676     * Makefile (HTML-all, HTML-clean): New.
1677    
1678     2007-11-11 Wakaba <wakaba@suika.fam.cx>
1679    
1680     * HTML.pm.src (get_inner_html): Removed (moved to HTML/Serializer.pm).
1681    
1682 wakaba 1.116 2007-11-08 Wakaba <wakaba@suika.fam.cx>
1683    
1684     * mklinktypelist.pl: s/noreferer/noreferrer/ (HTML5 revision 1132).
1685    
1686 wakaba 1.112 2007-11-04 Wakaba <wakaba@suika.fam.cx>
1687    
1688 wakaba 1.115 * Makefile: |CacheManifest.html| is added.
1689    
1690     * CacheManifest.pod: New file.
1691    
1692     2007-11-04 Wakaba <wakaba@suika.fam.cx>
1693    
1694 wakaba 1.114 * CacheManifest.pm: New module.
1695    
1696     2007-11-04 Wakaba <wakaba@suika.fam.cx>
1697    
1698 wakaba 1.113 * HTML.pm.src: Support for application cache selection algorithm
1699     callback.
1700    
1701     2007-11-04 Wakaba <wakaba@suika.fam.cx>
1702    
1703 wakaba 1.112 * mklinktypelist.pl: Support for rel=noreferer (HTML5 revision 1118).
1704    
1705 wakaba 1.110 2007-10-17 Wakaba <wakaba@suika.fam.cx>
1706    
1707 wakaba 1.111 * Makefile (clean): New rule.
1708    
1709     * NanoDOM.pm (public_id, system_id): New attributes.
1710    
1711     2007-10-17 Wakaba <wakaba@suika.fam.cx>
1712    
1713 wakaba 1.110 * Makefile (CSS-all, CSS-clean, clean): New rules.
1714    
1715 wakaba 1.109 2007-10-14 Wakaba <wakaba@suika.fam.cx>
1716    
1717     * ContentChecker.pm (check_document): Support for
1718     new |is_xml_root| flag.
1719     (check_element): Support for new |pluses| state.
1720     (_add_pluses): New method.
1721     (_remove_minuses): Support for new |minus| item.
1722    
1723 wakaba 1.106 2007-09-24 Wakaba <wakaba@suika.fam.cx>
1724    
1725 wakaba 1.107 * ContentChecker.pm: Raise specific error for invalid
1726     root element.
1727    
1728     2007-09-24 Wakaba <wakaba@suika.fam.cx>
1729    
1730 wakaba 1.106 * ContentChecker.pm: Set level values for later uses.
1731    
1732 wakaba 1.103 2007-09-09 Wakaba <wakaba@suika.fam.cx>
1733    
1734 wakaba 1.105 * ContentChecker.pm: Support for language tag validation.
1735    
1736     2007-09-09 Wakaba <wakaba@suika.fam.cx>
1737    
1738 wakaba 1.104 * LangTag.pm (check_rfc3066_language_tag): New method.
1739    
1740     2007-09-09 Wakaba <wakaba@suika.fam.cx>
1741    
1742 wakaba 1.103 * LangTag.pm: New module.
1743    
1744 wakaba 1.102 2007-09-04 Wakaba <wakaba@suika.fam.cx>
1745    
1746     * HTML.pm.src: Some error types were wrong.
1747    
1748 wakaba 1.100 2007-08-17 Wakaba <wakaba@suika.fam.cx>
1749    
1750 wakaba 1.101 * CSS/: New directory.
1751    
1752     2007-08-17 Wakaba <wakaba@suika.fam.cx>
1753    
1754 wakaba 1.100 * ContentChecker.pm (_check_get_children): Support
1755     for |noscript| in |head|.
1756    
1757 wakaba 1.99 2007-08-12 Wakaba <wakaba@suika.fam.cx>
1758    
1759     * URI/: New directory.
1760    
1761 wakaba 1.94 2007-08-11 Wakaba <wakaba@suika.fam.cx>
1762    
1763 wakaba 1.98 * HTML.pm.src: Tokenizer's states are now represented in
1764     number.
1765    
1766     2007-08-11 Wakaba <wakaba@suika.fam.cx>
1767    
1768 wakaba 1.97 * HTML.pm.src: |or|s for insertion modes are replaced
1769     by |&|s.
1770    
1771     2007-08-11 Wakaba <wakaba@suika.fam.cx>
1772    
1773 wakaba 1.96 * HTML.pm.src: Token types are now represented in number.
1774    
1775     2007-08-11 Wakaba <wakaba@suika.fam.cx>
1776    
1777 wakaba 1.95 * ContentType.pm (SEE ALSO): Updated.
1778    
1779     * HTML.pm.src: Insertion modes are now represented in number.
1780    
1781     2007-08-11 Wakaba <wakaba@suika.fam.cx>
1782    
1783 wakaba 1.94 * ContentType.pm: Sniffing for bitmap images (HTML5 revision
1784     999) is implemented.
1785    
1786 wakaba 1.93 2007-08-08 Wakaba <wakaba@suika.fam.cx>
1787    
1788     * ContentType.pm: Sniffing for |<script| (HTML5 revision
1789     983) is implemented.
1790    
1791 wakaba 1.92 2007-08-06 Wakaba <wakaba@suika.fam.cx>
1792    
1793     * ContentChecker.pod: New documentation.
1794    
1795     * Makefile: A rule for |ContentChecker.html| is added.
1796    
1797     * ContentChecker.pm: A pod "LICENSE" section is added.
1798    
1799     * NanoDOM.pm ($VERSION): New variable.
1800    
1801 wakaba 1.90 2007-08-05 Wakaba <wakaba@suika.fam.cx>
1802    
1803 wakaba 1.91 * H2H.pm: |b|, |i|, and |sub| are added to the
1804     list of allowed HTML elements.
1805    
1806     2007-08-05 Wakaba <wakaba@suika.fam.cx>
1807    
1808 wakaba 1.90 * H2H.pm: |samp| is added to the list of allowed
1809     HTML elements.
1810    
1811     * URIChecker.pm (check_iri): New.
1812     (check_iri_reference): Error type for IRI reference
1813     syntax error is changed.
1814    
1815 wakaba 1.87 2007-08-04 Wakaba <wakaba@suika.fam.cx>
1816    
1817 wakaba 1.89 * ContentChecker.pm: Reference to the |Whatpm::ContentChecker::Atom|
1818     is added.
1819     (check_document): Load appropriate module before validation.
1820    
1821     2007-08-04 Wakaba <wakaba@suika.fam.cx>
1822    
1823 wakaba 1.88 * ContentChecker/: New directory.
1824    
1825     2007-08-04 Wakaba <wakaba@suika.fam.cx>
1826    
1827 wakaba 1.87 * ContentChecker.pm: HTML |time| element is implemented.
1828    
1829     * HTMLTable.pm: Comments are updated as HTML5 is revised.
1830    
1831 wakaba 1.71 2007-07-21 Wakaba <wakaba@suika.fam.cx>
1832    
1833 wakaba 1.86 * ContentChecker.pm (check_document): Return value
1834     even if no document element is found.
1835    
1836     2007-07-21 Wakaba <wakaba@suika.fam.cx>
1837    
1838 wakaba 1.85 * HTML.pm.src: |$in_body| is no longer a function.
1839    
1840     2007-07-21 Wakaba <wakaba@suika.fam.cx>
1841    
1842 wakaba 1.84 * HTML.pm.src: The |$in_body| code has been moved down.
1843    
1844     2007-07-21 Wakaba <wakaba@suika.fam.cx>
1845    
1846 wakaba 1.83 * HTML.pm.src: The "trailing end" insertion mode
1847     is split into "after html body" and "after html frameset"
1848     insertion modes. Their codes are merged with "after body"
1849     and "after frameset" codes. |$previous_insertion_mode|
1850     has been removed. "after frameset" code is
1851     merged with "in frameset" code.
1852    
1853     2007-07-21 Wakaba <wakaba@suika.fam.cx>
1854    
1855 wakaba 1.82 * HTML.pm.src: The "before head" insertion mode is
1856     merged with the "in head" insertion mode.
1857    
1858     2007-07-21 Wakaba <wakaba@suika.fam.cx>
1859    
1860 wakaba 1.81 * HTML.pm.src: Cases in "in head" insertion mode are
1861     reorganized.
1862    
1863     2007-07-21 Wakaba <wakaba@suika.fam.cx>
1864    
1865 wakaba 1.80 * HTML.pm.src: Some cases in "in table" insertion mode
1866     are merged.
1867    
1868     2007-07-21 Wakaba <wakaba@suika.fam.cx>
1869    
1870 wakaba 1.79 * HTML.pm.src: The "in row" insertion mode is merged
1871     with "in table" insertion mode.
1872    
1873     2007-07-21 Wakaba <wakaba@suika.fam.cx>
1874    
1875 wakaba 1.78 * HTML.pm.src: The "in table" and "in table body" insertion
1876     modes are merged.
1877    
1878     2007-07-21 Wakaba <wakaba@suika.fam.cx>
1879    
1880 wakaba 1.77 * HTML.pm.src: There is no "in table head" or "in table foot"
1881     insertion mode!
1882    
1883     2007-07-21 Wakaba <wakaba@suika.fam.cx>
1884    
1885 wakaba 1.76 * HTML.pm.src: |<noframes>| "in frameset" and "in noframes"
1886     now directly invoke the handler.
1887    
1888     2007-07-21 Wakaba <wakaba@suika.fam.cx>
1889    
1890 wakaba 1.75 * HTML.pm.src: Codes for "in cell" insertion mode
1891     is merged to the "in body" insertion mode code.
1892    
1893     2007-07-21 Wakaba <wakaba@suika.fam.cx>
1894    
1895 wakaba 1.74 * HTML.pm.src: Codes for "in body" and "in caption"
1896     insertion modes are merged.
1897    
1898     2007-07-21 Wakaba <wakaba@suika.fam.cx>
1899    
1900 wakaba 1.73 * HTML.pm.src: Two |!!!next-token|s were missing.
1901    
1902     2007-07-21 Wakaba <wakaba@suika.fam.cx>
1903    
1904 wakaba 1.72 * HTML.pm.src: Use numeric constant for |{content_mode}|
1905     instead of string constant for |{content_model_flag}|.
1906    
1907     2007-07-21 Wakaba <wakaba@suika.fam.cx>
1908    
1909 wakaba 1.71 * HTML.pm.src: Add the name of the attribute
1910     to the "duplicate attribute" error.
1911    
1912 wakaba 1.69 2007-07-17 Wakaba <wakaba@suika.fam.cx>
1913    
1914 wakaba 1.70 * ContentChecker.pm: Return the |class| node list.
1915    
1916     2007-07-17 Wakaba <wakaba@suika.fam.cx>
1917    
1918 wakaba 1.69 * ContentChecker.pm: Return the |id| node list.
1919    
1920     * HTML.pm.src: A typo is fixed.
1921    
1922 wakaba 1.63 2007-07-16 Wakaba <wakaba@suika.fam.cx>
1923    
1924 wakaba 1.68 * ContentChecker.pm: Drop wrong |level => 'error'| specification
1925     from "in HTML:xml:lang" error. Character position
1926     is now the last part of the error type in the URI error
1927     description. Report "unsupported" status for language
1928     tags, media queries, script codes, and style sheets.
1929    
1930     2007-07-16 Wakaba <wakaba@suika.fam.cx>
1931    
1932 wakaba 1.67 * ContentChecker.pm: Report error if |xml:lang|
1933     in HTML, |lang| in XML, |xmlns| in XML, and |meta| |charset|
1934     in XML.
1935    
1936     * NanoDOM.pm (Attr.owner_document): New attribute.
1937    
1938     2007-07-16 Wakaba <wakaba@suika.fam.cx>
1939    
1940 wakaba 1.66 * HTML.pm.src: The character immediately following
1941     a bare |hcro| was discarded. Fix handling of
1942     entity references in attribute values.
1943    
1944     2007-07-16 Wakaba <wakaba@suika.fam.cx>
1945    
1946 wakaba 1.65 * HTML.pm.src (main and trailin end phases): Token
1947     types |DOCTYPE|, |comment|, |end-of-file|, and |<html>| are
1948     factored out. Error types |in html:#DOCTYPE|
1949     and |after html:#DOCTYPE| are merged
1950     into |DOCTYPE in the middle|. |</frameset>|
1951     in fragment parsing mode changed the
1952     insertion mode.
1953    
1954     2007-07-16 Wakaba <wakaba@suika.fam.cx>
1955    
1956 wakaba 1.64 * HTML.pm.src: |$phase| has been removed; The |trailing end|
1957     phase is now an insertion mode. Treatments for white
1958     space character tokens were incorrect for some
1959     insertion modes. An old |meta| case was not removed.
1960    
1961     2007-07-16 Wakaba <wakaba@suika.fam.cx>
1962    
1963 wakaba 1.63 * HTML.pm.src: |meta| charset declaration extraction
1964     implemented (but changing the encoding is not yet:-).
1965    
1966 wakaba 1.60 2007-07-15 Wakaba <wakaba@suika.fam.cx>
1967    
1968 wakaba 1.62 * Charset/: New directory.
1969    
1970     2007-07-15 Wakaba <wakaba@suika.fam.cx>
1971    
1972 wakaba 1.61 * H2H.pm: New Perl module (created from
1973     manakai's H2H.dis).
1974    
1975     2007-07-15 Wakaba <wakaba@suika.fam.cx>
1976    
1977 wakaba 1.60 * XMLSerializer.pm: New Perl module (created from
1978     manakai's SimpleLS.dis).
1979    
1980 wakaba 1.59 2007-07-07 Wakaba <wakaba@suika.fam.cx>
1981    
1982     * HTML.pm.src: |<!---x-->| was not processed correctly.
1983    
1984 wakaba 1.57 2007-07-01 Wakaba <wakaba@suika.fam.cx>
1985    
1986 wakaba 1.58 * HTML.pm.src: Report correct error message
1987     for |<body></div></body>|.
1988    
1989     2007-07-01 Wakaba <wakaba@suika.fam.cx>
1990    
1991 wakaba 1.57 * HTMLTable.pm: An error description was incorrect.
1992    
1993     2007-06-30 Wakaba <wakaba@suika.fam.cx>
1994    
1995     * ContentChecker.pm: Return |{term}| list.
1996    
1997 wakaba 1.55 2007-06-30 Wakaba <wakaba@suika.fam.cx>
1998    
1999 wakaba 1.56 * HTML.pm.src: HTML5 revisions 961-966 (</p>, </br>,
2000     nested <nobr>, implied </tbody>, </tfoot>, and </thead>,
2001     and <title> outside of head).
2002    
2003     2007-06-30 Wakaba <wakaba@suika.fam.cx>
2004    
2005 wakaba 1.55 * IMTChecker.pm: Report warning for unregistered
2006     and private types/subtypes.
2007    
2008     * ContentChecker.pm, HTML.pm.src, IMTChecker.pm,
2009     URIChecker.pm, HTMLTable.pm: Error messages are now
2010     consistent; they are all listed in
2011     <http://suika.fam.cx/gate/2005/sw/Whatpm%20Error%20Types>.
2012    
2013 wakaba 1.52 2007-06-25 Wakaba <wakaba@suika.fam.cx>
2014    
2015 wakaba 1.54 * ContentChecker.pm: |<img ismap>| not in |<a></a>|
2016     is now erred. |<datalist>| is implemented.
2017     Attribute checker for |<command>| and |<menu>| are
2018     added. Support for |contextmenu| global attribute
2019     is added.
2020    
2021     2007-06-25 Wakaba <wakaba@suika.fam.cx>
2022    
2023 wakaba 1.53 * HTML.pm.src (_reset_insertion_mode): Interpretation
2024     of Step 3 has been changed.
2025    
2026     2007-06-25 Wakaba <wakaba@suika.fam.cx>
2027    
2028 wakaba 1.52 * HTML.pm.src: Late |<html>| parse error is implemented.
2029    
2030 wakaba 1.47 2007-06-24 Wakaba <wakaba@suika.fam.cx>
2031    
2032 wakaba 1.51 * URIChecker.pm (check_iri_reference): A |decode| method name was
2033     incorrect.
2034    
2035     * ContentChecker.pm: Support for the |footer| element.
2036     Check URI syntax for space-separated URI attributes.
2037     Support for the |tabindex| attribute. Support
2038     for |datetime| attribute.
2039    
2040     2007-06-24 Wakaba <wakaba@suika.fam.cx>
2041    
2042 wakaba 1.50 * HTML.pm.src: HTML5 revision 1.144 (&#x0D;) and 1.145 (invalid
2043     character references). HTML5 revision 1.146 (white space
2044     characters before root start tag). HTML5 revision
2045     1.148 (named character references in attribute values).
2046     HTML5 revision 1.152 (<plaintext>.innerHTML get).
2047    
2048     2007-06-24 Wakaba <wakaba@suika.fam.cx>
2049    
2050     * HTML.pm.src: HTML5 revisions 1.142 and 1.143 (<noscript>
2051 wakaba 1.49 in <head>).
2052    
2053     2007-06-24 Wakaba <wakaba@suika.fam.cx>
2054    
2055 wakaba 1.48 * HTML.pm.src: HTML5 revision 935 (<base>, <link>, <meta>
2056     in body).
2057    
2058     * ContentChecker.pm: HTML5 revision 938 (scoped="").
2059    
2060     2007-06-24 Wakaba <wakaba@suika.fam.cx>
2061    
2062 wakaba 1.47 * HTML.pm.src: HTML5 revision 923 (matching end tag in CDATA
2063     or RCDATA in fragment parsing mode). HTML5 revision
2064     924 (<!--> and <!--->). HTML5 revision 926 (hn in hn).
2065    
2066     2007-06-23 Wakaba <wakaba@suika.fam.cx>
2067    
2068     * HTML.pm.src (get_inner_html): HTML5 revision 922 (inner_html
2069     for <pre> and <textarea>).
2070    
2071 wakaba 1.33 2007-06-23 Wakaba <wakaba@suika.fam.cx>
2072    
2073 wakaba 1.46 * HTML.pm.src: HTML5 revision 920 (<isindex>).
2074    
2075     2007-06-23 Wakaba <wakaba@suika.fam.cx>
2076    
2077 wakaba 1.45 * HTML.pm.src: HTML5 revision 918 (</head>, </body>, </html>).
2078    
2079     2007-06-23 Wakaba <wakaba@suika.fam.cx>
2080    
2081 wakaba 1.44 * HTML.pm.src: HTML5 revision 916 (</body>).
2082     HTML5 revision 917 (conforming bare &).
2083    
2084     2007-06-23 Wakaba <wakaba@suika.fam.cx>
2085    
2086 wakaba 1.43 * NanoDOM.pm (manakai_is_html): Setting to false did not work.
2087    
2088     * HTML.pm.src: HTML5 revision 914 (</ in CDATA, RCDATA).
2089     HTML5 revision 915 (<nobr>).
2090    
2091     2007-06-23 Wakaba <wakaba@suika.fam.cx>
2092    
2093 wakaba 1.42 * HTML.pm.src: HTML5 revisions 908, 909, 912, and 913 (quirks mode).
2094    
2095     * NanoDOM.pm (manakai_is_html, manakai_compat_mode, compat_mode):
2096     New attributes.
2097    
2098     2007-06-23 Wakaba <wakaba@suika.fam.cx>
2099    
2100 wakaba 1.41 * HTML.pm.src: HTML5 revisions 900, 901, 902, and 911 (<
2101     in tags).
2102    
2103     2007-06-23 Wakaba <wakaba@suika.fam.cx>
2104    
2105 wakaba 1.40 * .cvsignore: |Entities.html| is added.
2106    
2107     * HTML.pm.src: |$entity_char| is removed and
2108     requires |Whatpm::_NamedEntityList| instead.
2109     HTML5 revision 898 (refc), except that lack of refc
2110     is parse error.
2111    
2112     * mkentitylist.pl: New script.
2113    
2114     * Makefile (all): |_NamedEntityList.pm| is added.
2115     (_NamedEntityList.pm, Entities.html): New rules.
2116    
2117     2007-06-23 Wakaba <wakaba@suika.fam.cx>
2118    
2119 wakaba 1.39 * HTML.pm.src: Parse errors immediately after U+000D
2120     were ignored and U+000D immediately following another
2121     U+000D was not converted to U+000A.
2122    
2123     2007-06-23 Wakaba <wakaba@suika.fam.cx>
2124    
2125 wakaba 1.38 * HTML.pm.src (set_inner_html): HTML5 revision 892 (adopt
2126     nodes before appended). Parser was not ready for NULL
2127     parse error and escape flag.
2128    
2129     * NanoDOM.pm (adopt_node): New.
2130    
2131     2007-06-23 Wakaba <wakaba@suika.fam.cx>
2132    
2133 wakaba 1.37 * HTML.pm.src: HTML5 revision 886 (insane comment in
2134     CDATA and RCDATA). Note that current implementation
2135     is simply repeating what the spec says and it is maybe not
2136     a best way to do it.
2137    
2138     2007-06-23 Wakaba <wakaba@suika.fam.cx>
2139    
2140 wakaba 1.36 * HTML.pm.src: HTML5 revision 884 (</form> don't close
2141     the form element if a descendant element without implied end tag has
2142     still been open).
2143    
2144     2007-06-23 Wakaba <wakaba@suika.fam.cx>
2145    
2146 wakaba 1.35 * ContentChecker.pm: HTML5 revision 881 (Make |id| attribute with
2147     space characters non-conforming).
2148    
2149     2007-06-23 Wakaba <wakaba@suika.fam.cx>
2150    
2151 wakaba 1.34 * HTML.pm.src: An error message was incorrect.
2152     HTML5 revision 869 (C1 character references).
2153    
2154     2007-06-23 Wakaba <wakaba@suika.fam.cx>
2155    
2156 wakaba 1.33 * HTML.pm.src: HTML5 revision 867 (a LF at the beginning of
2157     a |textarea| is removed).
2158    
2159 wakaba 1.32 2007-06-05 Wakaba <wakaba@suika.fam.cx>
2160    
2161     * NanoDOM.pm (get_attribute_node_ns): New method.
2162    
2163     * ContentChecker.pm: |script| |async| and |defer|
2164     no longer require |src|. |async| MUST NOT be
2165     specified if |defer|. (HTML5 revision 858).
2166    
2167 wakaba 1.31 2007-05-30 Wakaba <wakaba@suika.fam.cx>
2168    
2169     * HTML.pm.src: |<form><form>| went to inifinite loop.
2170    
2171 wakaba 1.27 2007-05-27 Wakaba <wakaba@suika.fam.cx>
2172    
2173 wakaba 1.30 * ContentChecker.pm (html): Set |is_root| (allowed
2174     as a document element) flag on.
2175     (new): Removed.
2176     (check_document): New method.
2177    
2178     2007-05-27 Wakaba <wakaba@suika.fam.cx>
2179    
2180 wakaba 1.29 * ContentChecker.pm (thead, tfoot): Checker specifications
2181     were incorrect.
2182    
2183     2007-05-27 Wakaba <wakaba@suika.fam.cx>
2184    
2185 wakaba 1.28 * ContentChecker.pm ($HTMLURIAttrChecker): Include
2186     error position in the |type| option of the error.
2187    
2188     * HTMLTable.pm (form_table): The |$onerror| parameter
2189     is now optional. Some bugs are fixed.
2190    
2191     2007-05-27 Wakaba <wakaba@suika.fam.cx>
2192    
2193 wakaba 1.27 * HTMLTable.pm: New module.
2194    
2195     * ContentChecker.pm (table): Invoke table model error checker.
2196    
2197     * NanoDOM.pm (first_child, get_attribute_ns): New.
2198    
2199 wakaba 1.25 2007-05-26 Wakaba <wakaba@suika.fam.cx>
2200    
2201 wakaba 1.26 * ContentChecker.pm ($HTMLLinkTypesAttrChecker): New checker.
2202     (link/@rel, a/@rel, area/@rel): Use new checker.
2203    
2204     * Makefile (_LinkTypeList.pm, RelExtensions.html): New rules.
2205    
2206     * _LinkTypeList.pm: New file.
2207    
2208     * mklinktypelist.pl: New file.
2209    
2210     * .cvsignore: |RelExtensions.html| added.
2211    
2212     * NanoDOM.pm (child_nodes): Returns an empty array
2213     for non-child-containing node types.
2214     (text_content): New attribute.
2215    
2216     2007-05-26 Wakaba <wakaba@suika.fam.cx>
2217    
2218 wakaba 1.25 * IMTChecker.pm: New module.
2219    
2220     * ContentChecker.pm ($HTMLIMTAttrChecker): Call IMTChecker
2221     to test parameter value validity.
2222    
2223     * HTML.pm.src ($style_start_tag): Attributes were
2224     discarded.
2225    
2226 wakaba 1.22 2007-05-25 Wakaba <wakaba@suika.fam.cx>
2227    
2228 wakaba 1.24 * ContentChecker.pm ($HTMLURIAttrChecker): Implemented.
2229    
2230     2007-05-25 Wakaba <wakaba@suika.fam.cx>
2231    
2232 wakaba 1.23 * URIChecker.pm: All recommendations from RFC 3986
2233     and RFC 3987 are listed (not all testable items are checked yet).
2234    
2235     2007-05-25 Wakaba <wakaba@suika.fam.cx>
2236    
2237 wakaba 1.22 * URIChecker.pm: New module.
2238    
2239 wakaba 1.18 2007-05-20 Wakaba <wakaba@suika.fam.cx>
2240    
2241 wakaba 1.21 * ContentChecker.pm: Now most attributes are implemented
2242     or associated to some placeholder.
2243     ($ElementDefault): Warn unknown attributes
2244     for unknown elements as "attribute not supported".
2245     ($HTMLLanguageTagAttrChecker, $HTMLMQAttrChecker): New placeholders.
2246     ($HTMLUsemapAttrChecker, $HTMLTargetAttrChecker): New checkers.
2247     (|a| attribute checker): Reimplemented.
2248    
2249     2007-05-20 Wakaba <wakaba@suika.fam.cx>
2250    
2251 wakaba 1.20 * ContentChecker.pm ($HTMLEventHandlerAttrChecker): New placeholder.
2252     ($HTMLAttrChecker): Event handler content attributes
2253     are added.
2254     (link, embed): Required attribute is now checked.
2255     (embed): Unknown local attributes are no longer warned.
2256    
2257     2007-05-20 Wakaba <wakaba@suika.fam.cx>
2258    
2259 wakaba 1.19 * ContentChecker.pm ($HTMLSpaceURIsAttrChecker): New placeholder.
2260     ($HTMLIMTAttrChecker): New checker.
2261     (link@rel, link@href, link@type, style@type,
2262     a@href, a@ping, a@ping, a@type, embed@src, embed@type,
2263     object@data, object@type, source@src, source@type, area@alt,
2264     area@shape, area@coords, area@href,
2265     area@ping, area@rel, area@type, script@src,
2266     script@defer, script@async, script@type): Checkers added.
2267    
2268     2007-05-20 Wakaba <wakaba@suika.fam.cx>
2269    
2270 wakaba 1.18 * ContentChecker.pm: Descendant checking was incorrect.
2271    
2272 wakaba 1.12 2007-05-19 Wakaba <wakaba@suika.fam.cx>
2273    
2274 wakaba 1.17 * ContentChecker.pm: Support |xml:*| and |xmlns:*|
2275     attributes. Report an error if |Element.prefix|
2276     is |xmlns|.
2277    
2278     * NanoDOM.pm (prefix): New attribute.
2279    
2280     2007-05-19 Wakaba <wakaba@suika.fam.cx>
2281    
2282 wakaba 1.16 * HTML.pm.src: In |main| phase, |in body| insertion
2283     mode, action for |<iframe>| was missing.
2284    
2285     2007-05-19 Wakaba <wakaba@suika.fam.cx>
2286    
2287 wakaba 1.15 * ContentChecker.pm: Support for many of HTML5 elements.
2288     ($GetHTMLNonNegativeIntegerAttrChecker): New.
2289    
2290     2007-05-19 Wakaba <wakaba@suika.fam.cx>
2291    
2292 wakaba 1.14 * ContentChecker.pm: Support for most elements up to |progress|.
2293     ($HTMLURIAttrChecker): Placeholder.
2294     ($HTMLIntegerAttrChecker, $GetHTMLFloatingPointNumberAttrChecker):
2295     New.
2296    
2297     2007-05-19 Wakaba <wakaba@suika.fam.cx>
2298    
2299 wakaba 1.13 * ContentChecker.pm: Attribute checkers
2300     for global attributes, |html|, |base|, |style|, and |meta|.
2301    
2302     * NanoDOM.pm (insert_before): Weaken reference
2303     to the parent node.
2304     (Attr::new): Set |owner_element| attribute.
2305     (namespace_uri, manakai_local_name): New attribute implementations.
2306     (owner_element): New attribute.
2307    
2308     2007-05-19 Wakaba <wakaba@suika.fam.cx>
2309    
2310 wakaba 1.12 * ContentChecker.pm ($AttrChecker, $HTMLAttrChecker,
2311     $AnyChecker->{attr_checker}, $HTMLAttrsChecker,
2312     $Element->{$HTML_NS}->{''}): New.
2313     (check_element): Invoke attrs_checker for each element.
2314    
2315 wakaba 1.6 2007-05-13 Wakaba <wakaba@suika.fam.cx>
2316    
2317 wakaba 1.11 * ContentChecker.pm: Don't use |manakai_element_type_match|.
2318    
2319     2007-05-13 Wakaba <wakaba@suika.fam.cx>
2320    
2321 wakaba 1.10 * ContentChecker.pm: Use hashs rather than lists for
2322     element type testings.
2323    
2324     2007-05-13 Wakaba <wakaba@suika.fam.cx>
2325    
2326 wakaba 1.9 * ContentChecker.pm: Don't generate duplicate
2327     error when an element type is put in the "minus" list
2328     and the element type is not allowed explicitly in the particular
2329     element content model.
2330     (html:a checker): New checker.
2331     (html:details, html:datagrid): New checkers.
2332     (html:legend): New checker.
2333    
2334     2007-05-13 Wakaba <wakaba@suika.fam.cx>
2335    
2336 wakaba 1.8 * ContentChecker.pm (html:li checker): Implemented.
2337    
2338     2007-05-13 Wakaba <wakaba@suika.fam.cx>
2339    
2340 wakaba 1.7 * ContentChecker.pm ($HTMLInlineOrStriclyInlineChecker): New
2341     checker.
2342     (html:dd checker): New checker.
2343     (html:q, html:em, html:strong, html:small,
2344     html:m, html:dfn, html:code, html:samp, html:span): New checkers.
2345    
2346     2007-05-13 Wakaba <wakaba@suika.fam.cx>
2347    
2348 wakaba 1.6 * ContentChecker.pm ($AnyChecker): Renamed
2349     from |$ElementDefault->{checker}|.
2350     ($ElementDefault->{checker}): Throw an error that
2351     the element type is not supported by the checker.
2352     ($HTMLMetadataElement): |html:base| was missing.
2353     ($HTMLEmptyChecker): Don't throw an error
2354     for inter-element whitespace nodes.
2355     (html:html checker): Errors were not
2356     thrown even if |html:head| and/or |html:body|
2357     children were missing.
2358     (html:head checker): An error was not
2359     thrown if <meta charset> appered after other
2360     elements.
2361    
2362 wakaba 1.5 2007-05-05 Wakaba <wakaba@suika.fam.cx>
2363    
2364     * ContentChecker.pm: |footer|, |video|, |audio|, |script|,
2365     and |noscript| elements are implemented.
2366     (new): New method.
2367    
2368 wakaba 1.3 2007-05-04 Wakaba <wakaba@suika.fam.cx>
2369    
2370 wakaba 1.4 * ContentChecker.pm: New module.
2371    
2372     2007-05-04 Wakaba <wakaba@suika.fam.cx>
2373    
2374 wakaba 1.3 * NanoDOM.pm (manakai_parent_element,
2375     document_element, manakai_local_name,
2376     manakai_element_type_match): New method.
2377    
2378     2007-05-03 Wakaba <wakaba@suika.fam.cx>
2379    
2380     * HTML.pm.src: Replace decimal and hexadecimal numeric
2381     entities in C1 range using Windows-1252 mapping. Bare LF
2382     did not count as new line for error reporting.
2383    
2384 wakaba 1.2 2007-05-02 Wakaba <wakaba@suika.fam.cx>
2385    
2386     * NanoDOM.pm (DOMImplementation): New class.
2387     (append_child): Weaken the |parent_node| reference.
2388     (create_element_ns, Element new): Set the |owner_document|
2389     reference.
2390     (implementation): New attribute.
2391     (owner_document, local_name, namespace_uri): New attributes.
2392    
2393     * HTML.pm.src (parse_string): Line and column numbers
2394     are now provided to error handler.
2395     (!!!parse-error): Short descriptions are added.
2396     (_construct_tree): Split into three methods; support
2397     for innerHTML mode.
2398     (set_inner_html): New method.
2399    
2400 wakaba 1.1 2007-05-01 Wakaba <wakaba@suika.fam.cx>
2401    
2402     * NanoDOM.html: Documentation is added.
2403    
2404     * HTML.pod, ContentType.html: Documentation is revised.
2405    
2406     * .cvsignore: Pod2html temporary files are added.
2407    
2408     * Makefile: Make |NanoDOM.html|.
2409    
2410     2007-05-01 Wakaba <wakaba@suika.fam.cx>
2411    
2412     * HTML.pm.src (parse_string): New method.
2413     (get_inner_html): Renamed from |inner_html|.
2414    
2415     * Makefile: A rule for |HTML.html| is added.
2416    
2417     * HTML.pod: New documentation.
2418    
2419     2007-05-01 Wakaba <wakaba@suika.fam.cx>
2420    
2421     * NanoDOM.pm (last_child, previous_sibling): New attributes.
2422     (clone_node): Attribute nodes were not completely copied.
2423    
2424     * HTML.pm.src: Many bugs are fixed.
2425    
2426     2007-04-30 Wakaba <wakaba@suika.fam.cx>
2427    
2428     * HTML.pm.src: Some typos are fixed.
2429    
2430     2007-04-30 Wakaba <wakaba@suika.fam.cx>
2431    
2432     * mkhtmlparser.pl, Makefile: References to the |HTML-consume-entity.src|
2433     are removed.
2434    
2435     * HTML.pm.src: Tokenizer's handling on named entities are rewritten.
2436    
2437     * HTML-consume-entity.src: Removed.
2438    
2439     2007-04-30 Wakaba <wakaba@suika.fam.cx>
2440    
2441     * HTML.pm.src: Tokenizer's handling on hexadecimal
2442     numeric entities are rewritten.
2443    
2444     2007-04-30 Wakaba <wakaba@suika.fam.cx>
2445    
2446     * HTML.pm.src: Some tokenizer bugs are fixed.
2447    
2448     2007-04-30 Wakaba <wakaba@suika.fam.cx>
2449    
2450     * HTML.pm.src: The tree construction stage is implemented.
2451    
2452     * mkhtmlparser.pl: New macros are added.
2453    
2454     2007-04-28 Wakaba <wakaba@suika.fam.cx>
2455    
2456     * ContentType.pm: A note on bug in the specification
2457     is removed since it's been now fixed.
2458    
2459     * .cvsignore: New file.
2460    
2461     2007-04-28 Wakaba <wakaba@suika.fam.cx>
2462    
2463     * HTML.pm.src, HTML-consume-entity.src: New files.
2464    
2465     * Makefile (HTML.pm): New rule.
2466    
2467     * mkhtmlparser.pl: New script.
2468    
2469     2007-04-25 Wakaba <wakaba@suika.fam.cx>
2470    
2471     * Makefile: New file.
2472    
2473     2007-04-24 Wakaba <wakaba@suika.fam.cx>
2474    
2475     * ContentType.pm: An error in pod is fixed.
2476    
2477     2007-04-24 Wakaba <wakaba@suika.fam.cx>
2478    
2479     * ContentType.pm: Documentation is added.
2480    
2481     2007-04-24 Wakaba <wakaba@suika.fam.cx>
2482    
2483     * ContentType.pm: New Perl module.
2484    
2485     * ChangeLog: New file.
2486    
2487    

admin@suikawiki.org
ViewVC Help
Powered by ViewVC 1.1.24