/[suikacvs]/markup/html/whatpm/Whatpm/ChangeLog
Suika

Contents of /markup/html/whatpm/Whatpm/ChangeLog

Parent Directory Parent Directory | Revision Log Revision Log


Revision 1.402 - (hide annotations) (download)
Sun Sep 6 13:02:20 2009 UTC (15 years, 10 months ago) by wakaba
Branch: MAIN
Changes since 1.401: +4 -0 lines
++ whatpm/t/ChangeLog	6 Sep 2009 13:01:39 -0000
	* tree-test-frames.dat: Added more tests.

2009-09-06  Wakaba  <wakaba@suika.fam.cx>

++ whatpm/Whatpm/ChangeLog	6 Sep 2009 13:01:57 -0000
	* HTML.pm.src: Fixed typos on |frameset-ok| implementation.

2009-09-06  Wakaba  <wakaba@suika.fam.cx>

1 wakaba 1.391 2009-09-06 Wakaba <wakaba@suika.fam.cx>
2    
3 wakaba 1.402 * HTML.pm.src: Fixed typos on |frameset-ok| implementation.
4    
5     2009-09-06 Wakaba <wakaba@suika.fam.cx>
6    
7 wakaba 1.401 * HTML.pm.src: The first cut to implement the |frameset-ok| flag
8     concept (HTML5 revision 2703).
9    
10     2009-09-06 Wakaba <wakaba@suika.fam.cx>
11    
12 wakaba 1.400 * HTML.pm.src: |<hr/>| was incorrectly marked as an error.
13    
14     2009-09-06 Wakaba <wakaba@suika.fam.cx>
15    
16 wakaba 1.399 * HTML.pm.src: Changed the handling of </html> end tag of "in
17     body" insertion mode according to the latest spec definition.
18    
19     2009-09-06 Wakaba <wakaba@suika.fam.cx>
20    
21 wakaba 1.398 * HTML.pm.src: Rewrote end tag handling in |head| area (cf. HTML5
22     revision 2730, but it was entirely broken, maybe I missed some
23     spec changes before rev.2730).
24    
25     2009-09-06 Wakaba <wakaba@suika.fam.cx>
26    
27 wakaba 1.397 * HTML.pm.src: Added the |hgroup| element (HTML5 revision 3039 and
28     HTML5 revision 3040).
29    
30     2009-09-06 Wakaba <wakaba@suika.fam.cx>
31    
32 wakaba 1.396 * HTML.pm.src: Added |xmp| to the list of Special elements (HTML5
33     revision 3689). It should make no difference since the |xmp|
34     element has the PCDATA content and cannot be the non-bottommost
35     element in the stack of open elements.
36    
37     2009-09-06 Wakaba <wakaba@suika.fam.cx>
38    
39 wakaba 1.395 * HTML.pm.src: Set the |confidence| flag to |irrlevant| when the
40     fragment parsing algorithm is invoked (HTML5 revision 2699).
41    
42     2009-09-06 Wakaba <wakaba@suika.fam.cx>
43    
44 wakaba 1.394 * HTML.pm.src: Simplified foster parenting (HTML5 revision 2732).
45    
46     2009-09-06 Wakaba <wakaba@suika.fam.cx>
47    
48 wakaba 1.393 * HTML.pm.src: No longer does foster parenting |style|, |script|,
49     and |input type=hidden| elements (HTML5 revision 2988).
50    
51     2009-09-06 Wakaba <wakaba@suika.fam.cx>
52    
53 wakaba 1.392 * HTML.pm.src: Dropped the |eventsource| element (HTML5 revision
54     2870).
55    
56     2009-09-06 Wakaba <wakaba@suika.fam.cx>
57    
58 wakaba 1.391 * HTML.pm.src (_tree_construction_main): Treate |keygen| as a void
59     element (HTML5 revision 2960).
60    
61 wakaba 1.389 2009-09-05 Wakaba <wakaba@suika.fam.cx>
62    
63 wakaba 1.390 * HTML.pm.src: Moved the "in table text" insertion mode code up
64     such that it does also work as specced in HTML5 in the remaining
65     cases (HTML5 revision 3382).
66    
67     2009-09-05 Wakaba <wakaba@suika.fam.cx>
68    
69 wakaba 1.389 * HTML.pm.src: Implementing the "in table text" insertion mode,
70     first half (HTML5 revision 3382).
71    
72 wakaba 1.388 2009-08-30 Wakaba <wakaba@suika.fam.cx>
73    
74     * ContentChecker.pm: Allow |xml:lang| in HTML document if the
75     owner element is not an HTML element (HTML5 revision 3697).
76    
77 wakaba 1.387 2009-08-23 Wakaba <wakaba@suika.fam.cx>
78    
79     * ContentChecker.pm: Allow <input type=hidden> in context where
80     interactive content is disallowed (HTML5 revision 2392).
81    
82 wakaba 1.380 2009-08-16 Wakaba <wakaba@suika.fam.cx>
83    
84 wakaba 1.386 * HTML.pm.src: Dropped support for "XSLT-compat" and added
85     "about:legacy-compat" (HTML5 revision 2725).
86    
87     2009-08-16 Wakaba <wakaba@suika.fam.cx>
88    
89 wakaba 1.385 * HTML.pm.src: "<!DOCTYPE>" is now explicitly supported in the
90     spec (HTML5 revision 2700).
91    
92     2009-08-16 Wakaba <wakaba@suika.fam.cx>
93    
94 wakaba 1.384 * HTML.pm.src: Added support for <script> element in the "in
95     select" and "in select in table" insertion modes (HTML5 revision
96     2500).
97    
98     2009-08-16 Wakaba <wakaba@suika.fam.cx>
99    
100 wakaba 1.383 * HTML.pm.src: An <hn> start tag in an <hn> element should close
101     the earlier element (HTML5 revision 2501).
102    
103     2009-08-16 Wakaba <wakaba@suika.fam.cx>
104    
105 wakaba 1.382 * HTML.pm.src: The </br> end tags before the <body> start tag
106     should equally survive as the first br element child of the body
107     element (HTML5 revision 2586).
108    
109     2009-08-16 Wakaba <wakaba@suika.fam.cx>
110    
111 wakaba 1.381 * HTML.pm.src: <font> in SVG land is handled as an SVG element as
112     long as it did not have HTML attributes (HTML5 revision 2590).
113    
114     2009-08-16 Wakaba <wakaba@suika.fam.cx>
115    
116 wakaba 1.380 * HTML.pm.src (_tree_construction_main): Improved error reporting
117     on "select" start tag in "in select" and "in select in table"
118     parsing modes (c.f. HTML5 revision 3604).
119    
120 wakaba 1.378 2009-07-26 Wakaba <wakaba@suika.fam.cx>
121    
122 wakaba 1.379 * mkhtmlclass.pl: Made xmlns:xlink with wrong namespace URL
123     invalid (HTML5 revision 2701).
124    
125     2009-07-26 Wakaba <wakaba@suika.fam.cx>
126    
127 wakaba 1.378 * HTML.pm.src: </rt>, </rp>, </option>, </optgroup> become
128     optional before </body> (HTML5 revision 2806).
129    
130 wakaba 1.376 2009-07-25 Wakaba <wakaba@suika.fam.cx>
131    
132 wakaba 1.377 * HTML.pm.src: Marked where SVG |script| execution should be
133     implemented (HTML5 revision 2904).
134    
135     2009-07-25 Wakaba <wakaba@suika.fam.cx>
136    
137 wakaba 1.376 * HTML.pm.src (set_inner_html): The fragment parsing algorithm
138     inherits the document quirksness flag of the main document (HTML5
139     revision 2935).
140    
141 wakaba 1.375 2009-07-05 Wakaba <wakaba@suika.fam.cx>
142    
143     * HTML.pm.src (_tree_construction_main): Put |table| in |p| in
144     quirks mode (The Hixie Quirk, HTML5 revision 3132).
145    
146 wakaba 1.372 2009-07-03 Wakaba <wakaba@suika.fam.cx>
147    
148 wakaba 1.374 * HTML.pm.src: <kegten> breaks out of <select> (HTML5 revision
149     3163).
150    
151     2009-07-03 Wakaba <wakaba@suika.fam.cx>
152    
153 wakaba 1.373 * HTML.pm.src: Don't change encoding to UTF-8 if the input
154     encoding is UTF-16 (HTML5 revision 3204).
155    
156     2009-07-03 Wakaba <wakaba@suika.fam.cx>
157    
158 wakaba 1.372 * HTML.pm.src (parse_byte_stream): Don't change to another
159     encoding if the input is UTF-16 (HTML5 revision 3205).
160    
161 wakaba 1.371 2009-06-28 Wakaba <wakaba@suika.fam.cx>
162    
163     * HTML.pm.src: |isindex| should no longer generate |p| element
164     node (HTML5 revision 3236).
165    
166 wakaba 1.368 2008-12-12 Wakaba <wakaba@suika.fam.cx>
167    
168 wakaba 1.370 * ContentChecker.pm: Introduced new flag |has_autofocus|. Added
169     support for new |id_type| value of |any|, which means any kind of
170     element is allowed as a referenced element.
171    
172     2008-12-12 Wakaba <wakaba@suika.fam.cx>
173    
174 wakaba 1.369 * ContentChecker.pm: Introduced |check_attrs2| method for the ease
175     of defining codes for checking required attribute and as such.
176    
177     2008-12-12 Wakaba <wakaba@suika.fam.cx>
178    
179 wakaba 1.368 * IMTChecker.pm: Added more definitions for subtypes.
180    
181 wakaba 1.367 2008-12-11 Wakaba <wakaba@suika.fam.cx>
182    
183     * URIChecker.pm: Some of |pos_end| values were wrong.
184    
185 wakaba 1.364 2008-12-06 Wakaba <wakaba@suika.fam.cx>
186    
187 wakaba 1.366 * ContentChecker.pm (check_element): Added support for "no
188     referenced datalist" error.
189    
190     2008-12-06 Wakaba <wakaba@suika.fam.cx>
191    
192 wakaba 1.365 * URIChecker.pm: Bug fix: It did not work unless
193     Message::DOM::DOMImplementation has been |require|d.
194    
195     2008-12-06 Wakaba <wakaba@suika.fam.cx>
196    
197 wakaba 1.364 * NanoDOM.pm (document_uri): New attribute.
198    
199     * ContentChecker.pm: Don't use methods not implemented by NanoDOM.
200    
201 wakaba 1.363 2008-11-07 Wakaba <wakaba@suika.fam.cx>
202    
203     * NanoDOM.pm (text_content): Don't create a Text node if the new
204     value is empty.
205    
206 wakaba 1.362 2008-11-06 Wakaba <wakaba@suika.fam.cx>
207    
208     * SWML/: New directory.
209    
210 wakaba 1.361 2008-10-20 Wakaba <wakaba@suika.fam.cx>
211    
212     * NanoDOM.pm (specified, all_declarations_processed,
213     manakai_attribute_type): New attributes.
214    
215 wakaba 1.359 2008-10-19 Wakaba <wakaba@suika.fam.cx>
216    
217 wakaba 1.360 * NanoDOM.pm (Entity->new): Initialize ->child_nodes as an empty
218     array.
219    
220     2008-10-19 Wakaba <wakaba@suika.fam.cx>
221    
222 wakaba 1.359 * NanoDOM.pm (notation_name): New attribute.
223    
224 wakaba 1.357 2008-10-18 Wakaba <wakaba@suika.fam.cx>
225    
226 wakaba 1.359 * NanoDOM.pm (public_id, system_id): New attributes.a
227 wakaba 1.358
228     2008-10-18 Wakaba <wakaba@suika.fam.cx>
229    
230 wakaba 1.357 * NanoDOM.pm (text_content): Moved to Node from Element. Setter
231     implemented.
232     (allowed_tokens, default_type, declared_type): Implemented.
233    
234 wakaba 1.356 2008-10-17 Wakaba <wakaba@suika.fam.cx>
235    
236     * NanoDOM.pm (node_name): New attribute.
237     (ELEMENT_TYPE_DEFINITION_NODE, ATTRIBUTE_DEFINITION_NODE): New
238     constants.
239     (create_element_type_definition_node, create_attribute_definition,
240     create_notation, create_general_entity,
241     get_element_type_definition_node,
242     set_element_type_definition_node, get_general_entity_node,
243     set_general_entity_node, get_notation_node, set_notation_node,
244     get_attribute_definition_node, set_attribute_definition_node): New
245     methods.
246     (element_types, entities, notations, attribute_definitions): New
247     attributes.
248     (DocumentType): Support for child nodes, entities, notations, and
249     element types.
250     (Entity, Notation, ElementTypeDefinition, AttributeDefinition):
251     New classes.
252    
253     * Dumper.pm: Support for general entities, notations, element type
254     definitions, and attribute definitions.
255    
256 wakaba 1.355 2008-10-15 Wakaba <wakaba@suika.fam.cx>
257    
258     * NanoDOM.pm (create_processing_instruction): New method.
259     (xml_version, xml_encoding, xml_standalone): New attributes.
260     (ProcessingInstruction): New class.
261    
262 wakaba 1.352 2008-10-14 Wakaba <wakaba@suika.fam.cx>
263    
264 wakaba 1.354 * HTML.pm.src: Handling of end tags in the foreign content
265     insertion mode was partially wrong, because of wrong bit
266     operations.
267    
268     2008-10-14 Wakaba <wakaba@suika.fam.cx>
269    
270 wakaba 1.353 * NanoDOM.pm (dom_config): New attribute (do nothing), for
271     Whatpm::XML::Parser support.
272    
273     2008-10-14 Wakaba <wakaba@suika.fam.cx>
274    
275 wakaba 1.352 * Makefile: New rule to make HTML/Tokenizer.pm is added.
276    
277     * HTML.pm.src: Tokenizer part moved to another file.
278    
279 wakaba 1.349 2008-10-13 Wakaba <wakaba@suika.fam.cx>
280    
281 wakaba 1.351 * HTML.pm.src: Merge |DT_EL| and |DD_EL| as |DTDD_EL|.
282    
283     2008-10-13 Wakaba <wakaba@suika.fam.cx>
284    
285 wakaba 1.350 * HTML.pm.src: Element category constants redefined.
286    
287     2008-10-13 Wakaba <wakaba@suika.fam.cx>
288    
289 wakaba 1.349 * HTML.pm.src: Steps for CDATA/RCDATA elements in tree
290     construction stage synced with the spec (HTML5 revisions 2139 and
291     2302).
292    
293 wakaba 1.348 2008-10-07 Wakaba <wakaba@suika.fam.cx>
294    
295     * ContentChecker.pm: New error level "html5_fact" added, which
296     should be tentatively used until all of requirements are properly
297     specced as RFC 2119 "MUST" in HTML5.
298    
299 wakaba 1.346 2008-10-05 Wakaba <wakaba@suika.fam.cx>
300    
301 wakaba 1.347 * ContentChecker.pod: Note on internal flags is added.
302    
303     2008-10-05 Wakaba <wakaba@suika.fam.cx>
304    
305 wakaba 1.346 * HTML.pm.src: An AAA bug fixed.
306    
307 wakaba 1.336 2008-10-04 Wakaba <wakaba@suika.fam.cx>
308    
309 wakaba 1.345 * HTML.pm.src: If another node is inserted by the parser, don't
310     reuse existing Text node to append a character (HTML5 revision
311     2124).
312    
313     2008-10-04 Wakaba <wakaba@suika.fam.cx>
314    
315 wakaba 1.344 * HTML.pm.src: Support for <option> and <optgroup> in body (HTML5
316     revisions 1731 and 2128).
317    
318     2008-10-04 Wakaba <wakaba@suika.fam.cx>
319    
320 wakaba 1.343 * HTML.pm.src: Make <foreignObject> scoping (HTML5 revision 1837).
321     Support for end tags of camelCase SVG elements were broken. A
322     wrong error type text fixed.
323    
324     2008-10-04 Wakaba <wakaba@suika.fam.cx>
325    
326 wakaba 1.342 * HTML.pm.src: Drop redundant code (HTML5 revision 1731).
327    
328     2008-10-04 Wakaba <wakaba@suika.fam.cx>
329    
330 wakaba 1.341 * HTML.pm.src: Support for new definition of |param| and |source|
331     start tag parsing (HTML5 revision 1731).
332    
333     2008-10-04 Wakaba <wakaba@suika.fam.cx>
334    
335 wakaba 1.340 * HTML.pm.src: <p> steps reimplemented (HTML5 revision 1731).
336    
337     2008-10-04 Wakaba <wakaba@suika.fam.cx>
338    
339 wakaba 1.339 * HTML.pm.src: <li>, <dt>, and <dd> steps reimplemented (HTML5
340     revisions 1731 and 1831).
341    
342     2008-10-04 Wakaba <wakaba@suika.fam.cx>
343    
344 wakaba 1.338 * HTML.pm.src: Support for new flow (but not phrasing) elements (HTML5
345     revisions 1731 and 1778). Support for the </sarcasm> end tag (HTML5
346     revision 1731).
347    
348     2008-10-04 Wakaba <wakaba@suika.fam.cx>
349    
350 wakaba 1.337 * HTML.pm.src: Support for |command| and |eventsource| elements (HTML5
351     revision 1731). End tags of |option| and |optgroup| elements are
352     now optional (HTML5 revision 1731).
353    
354     2008-10-04 Wakaba <wakaba@suika.fam.cx>
355    
356 wakaba 1.336 * HTML.pm.src: New "special" elements added to the list (HTML5
357     revision 1778). "strile" -> "strike".
358    
359 wakaba 1.335 2008-10-02 Wakaba <wakaba@suika.fam.cx>
360    
361     * ContentType.pm (get_sniffed_type): Support for the "better"
362     content sniffing (HTML5 revision 1927). In a case the official
363     type was not returned when the method is invoked in the list
364     context.
365    
366 wakaba 1.334 2008-09-22 Wakaba <wakaba@suika.fam.cx>
367    
368     * HTML.pm.src: Character references for non-space C0 characters,
369     including U+000B VT, DEL character, noncharacter code points, are
370     now converted to the U+FFFD character (cf. HTML5 revision 2138).
371    
372 wakaba 1.330 2008-09-21 Wakaba <wakaba@suika.fam.cx>
373    
374 wakaba 1.333 * ContentChecker.pm: |form=""| check support added.
375    
376     2008-09-21 Wakaba <wakaba@suika.fam.cx>
377    
378 wakaba 1.332 * ContentChecker.pm: |contextmenu| validness is now checked using
379     |id| and |id_type| properties, and |menu| property is removed.
380    
381     2008-09-21 Wakaba <wakaba@suika.fam.cx>
382    
383 wakaba 1.331 * ContentChecker.pm: Prepare for |form| |name| attribute's
384     duplication checking.
385    
386     2008-09-21 Wakaba <wakaba@suika.fam.cx>
387    
388 wakaba 1.330 * HTML.pm.src (parse_byte_stream): Support (or non-support) for
389     unsupported charset="" parameter value (HTML5 revision 2131).
390    
391 wakaba 1.325 2008-09-20 Wakaba <wakaba@suika.fam.cx>
392    
393 wakaba 1.329 * HTML.pm.src: Reminding places where U+000B is allowed as a space
394     character is fixed (cf. HTML5 revision 1738).
395    
396     * ContentChecker.pm, HTMLTable.pm: U+000B is no longer part of
397     space characters (HTML5 revision 1738).
398    
399     2008-09-20 Wakaba <wakaba@suika.fam.cx>
400    
401 wakaba 1.328 * HTML.pm.src: The "anything else" case for the "after after body"
402     insertion mode was not updated to swtich to the "in body"
403     insertion mode. U+000B is no longer a space character for the
404     purpose of tree construction phase (HTML5 revision 1738).
405    
406     2008-09-20 Wakaba <wakaba@suika.fam.cx>
407    
408 wakaba 1.327 * HTML.pm.src: U+000B is no longer a space character (HTML5
409     revision 1738).
410    
411     2008-09-20 Wakaba <wakaba@suika.fam.cx>
412    
413 wakaba 1.326 * ContentType.pm: 0x0B is no longer a space character (HTML5
414     revision 1738).
415    
416     * HTML.pm.src: U+000B is no longer a space character for the
417     algorithm for extracting an encoding from a Content-Type (HTML5
418     revision 1738).
419    
420     2008-09-20 Wakaba <wakaba@suika.fam.cx>
421    
422 wakaba 1.325 * ContentChecker.pm ($IsInHTMLInteractiveContent): New.
423    
424 wakaba 1.320 2008-09-18 Wakaba <wakaba@suika.fam.cx>
425    
426 wakaba 1.324 * LangTag.pm: Add checks for remaining requirements from RFC 4646.
427    
428     * mklangreg.pl: Sort 'Prefix' values by their length, to ease
429     matching.
430    
431     2008-09-18 Wakaba <wakaba@suika.fam.cx>
432    
433 wakaba 1.323 * LangTag.pm: Warn for private use language subtags. Error level
434     typos fixed. Support for Suppress-Script field.
435    
436     * mklangreg.pl: Support for dumping of nested structure.
437    
438     2008-09-18 Wakaba <wakaba@suika.fam.cx>
439    
440 wakaba 1.322 * LangTag.pm (check_rfc4646_langtag): Check if a tag is in the
441     recommended case as per RFC 4646.
442    
443     2008-09-18 Wakaba <wakaba@suika.fam.cx>
444    
445 wakaba 1.321 * LangTag.pm (check_rfc4646_langtag): New method.
446    
447     2008-09-18 Wakaba <wakaba@suika.fam.cx>
448    
449 wakaba 1.320 * mklangreg.pl: New script.
450    
451     * Makefile: Updated for creation of the module for language subtag
452     registry.
453    
454 wakaba 1.315 2008-09-16 Wakaba <wakaba@suika.fam.cx>
455    
456 wakaba 1.319 * Makefile: WebIDL.html added.
457    
458     * WebIDL.pod: New documentation.
459    
460     2008-09-16 Wakaba <wakaba@suika.fam.cx>
461    
462 wakaba 1.318 * WebIDL.pm: Checker's error types are redefined.
463    
464     2008-09-16 Wakaba <wakaba@suika.fam.cx>
465    
466 wakaba 1.317 * WebIDL.pm: Parser's error types are redefined. Some forward
467     compatible parsing bugs are fixed. Some unreachable codes are
468     commented out.
469    
470     2008-09-16 Wakaba <wakaba@suika.fam.cx>
471    
472 wakaba 1.316 * WebIDL.pm: Support for the reminding extended attributes are
473     added. It does not satisfy the definition that a forward
474     interface declaration has an extended attribute. It seems that
475     unless explicitly allowed multiple extended attributes with the
476     same name is not allowed, though it is not explicitly mentioned in
477     the spec.
478    
479     2008-09-16 Wakaba <wakaba@suika.fam.cx>
480    
481 wakaba 1.315 * WebIDL.pm: Unescapes extended attribute names and extended
482     attribute identifiers. Preserve whether an extended attribute has
483     an argument list of not. Support for extended attributes:
484     Constructor, ExceptionConsts, IndexGetter, IndexSetter,
485     NameGetter, NameSetter, and Null.
486     (has_argument_list): New attribute.
487     (idl_text): Stringifies argument lists, if any, even if it is
488     empty.
489    
490 wakaba 1.310 2008-09-15 Wakaba <wakaba@suika.fam.cx>
491    
492 wakaba 1.314 * HTML.pm.src: New state |PCDATA_STATE|. Use an empty string for
493     |{s_kwd}| in DATA_STATE as default.
494    
495     2008-09-15 Wakaba <wakaba@suika.fam.cx>
496    
497 wakaba 1.313 * HTML.pm.src, mkhtmlparser.pl: Replace |{prev_char}|
498     by |{s_kwd}| in DATA_STATE.
499    
500     2008-09-15 Wakaba <wakaba@suika.fam.cx>
501    
502 wakaba 1.312 * HTML.pm.src: Shorten keys.
503    
504     2008-09-15 Wakaba <wakaba@suika.fam.cx>
505    
506 wakaba 1.311 * HTML.pm.src: Remove checking for control character, surrogate
507     pair, or noncharacter code points and non-Unicode code
508     points (they should be handled by Whatpm::Charset::UnicodeChecker).
509     (parse_char_stream): Support for the |$get_wrapper| argument and
510     character stream error handlers.
511    
512     2008-09-15 Wakaba <wakaba@suika.fam.cx>
513    
514 wakaba 1.310 * ContentChecker.pm: Don't call |loda_ns_module|
515     for null-namespace elements/attributes.
516    
517     * HTML.pm.src: Fact out $disallowed_control_chars
518     as a hash.
519    
520 wakaba 1.300 2008-09-14 Wakaba <wakaba@suika.fam.cx>
521    
522 wakaba 1.309 * HTML.pm.src: Regexp typo fixed. |{prev_char}|
523     and |{next_char}| initializations are moved to initialization
524     method. |{read_until}| now supports buffering. Sync |set_inner_html|
525     with |parse_char_stream|.
526    
527     2008-09-14 Wakaba <wakaba@suika.fam.cx>
528    
529 wakaba 1.308 * HTML.pm.src (parse_char_stream): Make |set_next_char|
530     invoke |manakai_read_until|, not only |read|, where
531     possible, to decrease the number of |read| method calls.
532    
533     * mkhtmlparser.pl: Related changes to the aforementioned
534     modification.
535    
536     2008-09-14 Wakaba <wakaba@suika.fam.cx>
537    
538 wakaba 1.307 * HTML.pm.src: Use |read| instead of |getc|. |set_inner_html|
539     would report character error from now.
540    
541     2008-09-14 Wakaba <wakaba@suika.fam.cx>
542    
543 wakaba 1.306 * HTML.pm.src: White-space-leaded non-white-space character
544     tokens in "before head insertion mode" was not
545     correctly handled.
546     (set_inner_html): Reimplemented using CharString decodehandle
547     class. Support for $get_wrapper argument. Support
548     for |{read_until}| feature.
549    
550     2008-09-14 Wakaba <wakaba@suika.fam.cx>
551    
552 wakaba 1.305 * HTML.pm.src: Make a "bare ero" error for unknown
553     entities point the "&" character.
554    
555     2008-09-14 Wakaba <wakaba@suika.fam.cx>
556    
557 wakaba 1.304 * HTML.pm.src: It turns out that U+FFFD don't have to
558     be added to the list of excluded characters.
559    
560     2008-09-14 Wakaba <wakaba@suika.fam.cx>
561    
562 wakaba 1.303 * HTML.pm.src ($char_onerror): Have character decoder's |line|
563     and |column| a higher priority than the one set by the
564     tokenizer's input handler.
565     ($self->{read_until}): Exclude U+FFFD (but this might
566     not be necessary, since now we do line/column fixup in
567     the character decode handle).
568    
569     2008-09-14 Wakaba <wakaba@suika.fam.cx>
570    
571 wakaba 1.302 * HTML.pm.src: Use |{read_until}| where possible.
572    
573     2008-09-14 Wakaba <wakaba@suika.fam.cx>
574    
575 wakaba 1.301 * HTML.pm.src: Change |{getc_until}| to |{read_until}|
576     and |manakai_getc_until| to |manakai_read_until| to
577     reduce the number of string copies.
578    
579     2008-09-14 Wakaba <wakaba@suika.fam.cx>
580    
581 wakaba 1.300 * HTML.pm.src (parse_char_string): Use newly created
582     |Whatpm::Charset::DecodeHandle::CharString| instead of Perl's
583     standard feature to |open| a string as a filehandle,
584     since Perl's string filehandle seems not supporting |ungetc|
585     method correctly.
586     (parse_char_stream): Define |{getc_until}| method.
587     (DATA_STATE): Experimental support for |getc_until| feature.
588    
589 wakaba 1.292 2008-09-13 Wakaba <wakaba@suika.fam.cx>
590    
591 wakaba 1.299 * HTML.pm.src: Check points added to newly added branches.
592    
593     2008-09-13 Wakaba <wakaba@suika.fam.cx>
594    
595 wakaba 1.298 * HTML.pm.src: Remove |{char}|, which is no longer used.
596     Remove |{entity_in_attr}| and |{last_attribute_value_state}|
597     and replaced by |{prev_state}|.
598    
599     * mkhtmlparser.pl: Remove |{char}| feature.
600     Remove |!!!back-next-input-character;| macro.
601    
602     2008-09-13 Wakaba <wakaba@suika.fam.cx>
603    
604 wakaba 1.297 * HTML.pm.src: Finally we get rid of all the inner loops. Remove
605     entity related tokenizer states in favor of new states
606     implementing the consume character reference algorithm.
607    
608     2008-09-13 Wakaba <wakaba@suika.fam.cx>
609    
610     * HTML.pm.src: "Consume a character reference" algorithm is
611 wakaba 1.296 now implemented as a tokenizer's state, rather than
612     a method, with minimum changes (more changes will
613     be made, in due course). "Bogus comment state"'s inner
614     loop gets removed.
615    
616     2008-09-13 Wakaba <wakaba@suika.fam.cx>
617    
618 wakaba 1.297 * HTML.pm.src: Make |PUBLIC| and |SYSTEM| keyword tokenizing
619 wakaba 1.295 into their own tokenizer states.
620    
621     2008-09-13 Wakaba <wakaba@suika.fam.cx>
622    
623 wakaba 1.294 * HTML.pm.src: |CDATA_SECTION_STATE| (formally |CDATA_BLOCK_STATE|
624     is split into three states.
625    
626     2008-09-13 Wakaba <wakaba@suika.fam.cx>
627    
628 wakaba 1.293 * HTML.pm.src: |CLOSE_TAG_OPEN_STATE| is broken into
629     itself and new |CDATA_PCDATA_CLOSE_TAG_STATE| so that
630     no longer does the tokenizer have to push back next input
631     characters in those states.
632    
633     2008-09-13 Wakaba <wakaba@suika.fam.cx>
634    
635 wakaba 1.292 * HTML.pm.src: |MARKUP_DECLARATION_OPEN_STATE| broken
636     into four states so that no longer does the tokenizer have to push
637     back next input characters in that state.
638    
639 wakaba 1.291 2008-09-11 Wakaba <wakaba@suika.fam.cx>
640    
641     * HTML.pm.src: Methods now accept additional parameter, $get_wrapper,
642     which can be used to insert some wrapper between the character
643     stream handle and the tokenizer. (It is currently not supported
644     for |set_inner_html| for |Element|s).
645    
646 wakaba 1.289 2008-09-10 Wakaba <wakaba@suika.fam.cx>
647    
648 wakaba 1.290 * HTML.pm.src: Ignore punctuations in charset names.
649    
650     2008-09-10 Wakaba <wakaba@suika.fam.cx>
651    
652 wakaba 1.289 * ContentChecker.pm: Support for charset-layer error levels.
653    
654     * HTML.pm.src: Don't specify |text| argument for the
655     |chardecode:fallback| error, since it is not the encoding
656     being used alternatively.
657    
658 wakaba 1.288 2008-09-06 Wakaba <wakaba@suika.fam.cx>
659    
660     * HTML.pm.src: Support for |XSLT-compat| (HTML5 revision 2141).
661    
662 wakaba 1.286 2008-08-31 Wakaba <wakaba@suika.fam.cx>
663    
664 wakaba 1.287 * CacheManifest.pm: Support for extensibility (HTML5 revision 2051).
665    
666     2008-08-31 Wakaba <wakaba@suika.fam.cx>
667    
668 wakaba 1.286 * HTML.pm.src: Bug fix and sync with the spec with regard
669     to after after frameset insertion mode processing (HTML5
670     revision 1909). Note that the implementation was wrong
671     per the old spec before the r1909 changes.
672    
673 wakaba 1.278 2008-08-30 Wakaba <wakaba@suika.fam.cx>
674    
675 wakaba 1.285 * HTMLTable.pm: scope=auto algorithm fix synced with the
676     spec (HTML5 revision 2093).
677     ($process_row): Algorithm step numbers synced with the
678     spec (HTML5 revision 2092).
679    
680     2008-08-30 Wakaba <wakaba@suika.fam.cx>
681    
682 wakaba 1.284 * HTMLTable.pm: Zs is not what we want; we want White_Space! (HTML5
683     revision 2094).
684    
685     2008-08-30 Wakaba <wakaba@suika.fam.cx>
686    
687     * ContentType.pm: Support for image/svg+xml (HTML5 revision 2096).
688    
689     2008-08-30 Wakaba <wakaba@suika.fam.cx>
690    
691 wakaba 1.283 * HTML.pm.src: '"' and "'" at the end of attribute
692     name (after another attribute) now raise parse error (HTML5
693     revision 2123). Empty unquoted attribute values are no
694     longer allowed (HTML5 revision 2122).
695    
696     2008-08-30 Wakaba <wakaba@suika.fam.cx>
697    
698 wakaba 1.282 * mkhtmlparser.pl: Support for MathML |definitionURL| attribute (HTML5
699     revision 2130).
700    
701     2008-08-30 Wakaba <wakaba@suika.fam.cx>
702    
703 wakaba 1.281 * ContentChecker.pm: |xml:lang| attribute value must be same
704     as |lang| attribute value for HTML elements (HTML5 revision 2062
705     and so on).
706    
707     2008-08-30 Wakaba <wakaba@suika.fam.cx>
708    
709 wakaba 1.280 * ContentChecker.pm: Error level definition for |xml_id_error|
710     was missing.
711    
712     * URIChecker.pm: The end of the URL should be marked as the
713     error location for an empty path error. The position
714     between the userinfo and the port components should be
715     marked as the error location for an empty host error.
716    
717     2008-08-30 Wakaba <wakaba@suika.fam.cx>
718    
719 wakaba 1.279 * URIChecker.pm: Set parameters representing where in the
720     value the error occurs for errors. Report unknown
721     address format error in warning level, since address
722     formats are rarely added. Path segments starting with "/.."
723     were misinterpreted as a dot-segment.
724    
725     2008-08-30 Wakaba <wakaba@suika.fam.cx>
726    
727 wakaba 1.278 * URIChecker.pm (check_iri_reference): Requires
728     |Message::DOM::DOMImplementation|.
729    
730 wakaba 1.277 2008-08-29 Wakaba <wakaba@suika.fam.cx>
731    
732     * IMTChecker.pm: Updated for the new error reporting architecture.
733    
734     * ContentChecker.pm: Error levels for IMTs are added.
735    
736 wakaba 1.276 2008-08-17 Wakaba <wakaba@suika.fam.cx>
737    
738     * H2H.pm (_shift_token): Support for unquoted HTML attribute
739     values.
740    
741 wakaba 1.274 2008-08-16 Wakaba <wakaba@suika.fam.cx>
742    
743 wakaba 1.275 * CacheManifest.pm: Support for new style of error
744     reports.
745    
746     * HTML.pm.src: Set line=1, column=1 to the document node.
747    
748     2008-08-16 Wakaba <wakaba@suika.fam.cx>
749    
750 wakaba 1.274 * ContentChecker.pm, RDFXML.pm: Pass {level} object to language tag
751     and URL checkers. Support for more error levels for bogus
752     langauge tag and URL "standards".
753    
754     * LangTag.pm, URIChecker.pm: Support for new style error
755     level reporting.
756    
757 wakaba 1.271 2008-08-15 Wakaba <wakaba@suika.fam.cx>
758    
759 wakaba 1.273 * ContentChecker.pm: Support for RDF/XML error levels.
760    
761     * HTMLTable.pm, RDFXML.pm: Support for new style of error level
762     specifying. Error types are revised.
763    
764     2008-08-15 Wakaba <wakaba@suika.fam.cx>
765    
766 wakaba 1.272 * ContentChecker.pm: All error reporting method calls are
767     renewed.
768    
769     2008-08-15 Wakaba <wakaba@suika.fam.cx>
770    
771 wakaba 1.271 * HTML.pm.src: All error type names and "text" parameters
772     are revised. Use new style for "level" specification.
773    
774     * mkhtmlparser.pl: Use new style for "level" specification.
775    
776 wakaba 1.268 2008-08-03 Wakaba <wakaba@suika.fam.cx>
777    
778 wakaba 1.270 * WebIDL.pm (parse_char_string): Simplified error
779     reporting process for broken ignored valuetype definition.
780     (Valuetype idl_text): Support for special "DOMString" name.
781    
782     2008-08-03 Wakaba <wakaba@suika.fam.cx>
783    
784 wakaba 1.269 * WebIDL.pm ($get_scoped_name): Append "::::" if the last
785     terminal of the ScopedName is "DOMString", such that whether
786     the last part of the scoped name is "DOMString" or "_DOMString"
787     later. It is necessary to determine whether a |typedef|
788     definition should be ignored or not.
789     (parse_char_string): Unescape the identifier of
790     exception members.
791     ($resolve): Return undef for builtin types and sequence<T>
792     types (we might not have to do this, however...).
793     (check): Support checking for Exceptions, Valuetypes,
794     and Typedefs.
795     ($serialize_type): Support for "DOMString::::" syntax.
796     (Typedef idl_text): Output Type as "DOMString" if it
797     is really "DOMString" (i.e. its internal representation
798     is "::DOMString::").
799    
800     2008-08-03 Wakaba <wakaba@suika.fam.cx>
801    
802 wakaba 1.268 * WebIDL.pm ($resolve): New code, based on resolve code
803     for constant types in the |check| method.
804     (check): Support for checking of attributes, operations, and
805     arguments.
806     (Attribute/Operation idl_text): Exception names in getraises,
807     setraises, and raises clauses is serizlied by |$serialize_type|
808     code.
809    
810 wakaba 1.264 2008-08-02 Wakaba <wakaba@suika.fam.cx>
811    
812 wakaba 1.267 * WebIDL.pm ($integer): Order of selections are changed to match
813     hexadecimal numbers (the original pattern, taken from the spec,
814     was not work for hexadecimal numbers, because the "0" prefix
815     matches to the [0-7]* part (as an empty string) and therefore
816     it does not match with remaining "x..." part of a "0x..." integer
817     literal.
818     ($get_type): It now returns a string, not an array reference,
819     for regular types and |sequence| types (i.e. it in any case
820     returns a string).
821     ($get_next_token): The second item in the array that represents
822     a integer or float token is now a Perl number value, not the
823     original string representation of the number.
824     (check): Support for const value consistency checking.
825     No extended attribute is defined for constants.
826     (Node subclasses): Use simple strings rather than array references
827     for default data type values.
828     ($serialize_type): Type values are now simple strings.
829     (value): If the new attribute value is a false value, then
830     a FALSE value is set to the attribute.
831    
832     2008-08-02 Wakaba <wakaba@suika.fam.cx>
833    
834 wakaba 1.266 * WebIDL.pm ($get_scoped_name): Now scoped names are stored
835     in its stringified format ("scoped name" as defined in the
836     spec). Note that future version of this module should not use
837     array references for type values and the |type_text| attribute
838     should be made obsolete.
839     (parse_char_string): Unescape attribute names.
840     (check): Support for checking of whether inherited interfaces
841     are actually defined or not. Support for checking of whether
842     interface member identifiers are duplicated or not.
843     ($serialize_type): Scoped names are returned as is. A future
844     version of this code should escape identifiers other than "DOMString",
845     otherwise the idl_text would be non-conforming.
846    
847     2008-08-02 Wakaba <wakaba@suika.fam.cx>
848    
849 wakaba 1.264 * WebIDL.pm (parse_char_string): Set line/column numbers
850 wakaba 1.265 to generated nodes. Unescape identifiers. Extended attributes
851     for Definition's were ignored.
852     (append_child): Set |parent_node| attribute.
853     (parent_node): New attribute.
854     (check): Support interface/exception members. Support
855     extended attributes. Support definition identifier uniqueness
856     constraint.
857     (qualified_name): New attribute.
858     (Interface/Exception idl_text): Extended attributes were
859     not prepended to the returned text.
860    
861     2008-08-02 Wakaba <wakaba@suika.fam.cx>
862    
863     * WebIDL.pm (parse_char_string): Set line/column numbers
864 wakaba 1.264 to interface object experimentally. s/shift/pop/g, shift
865     would make things wrong. Support for interface forward
866     declarations was missing. Broken interface declarations
867     with no block were not ignored entirely.
868     (Whatpm::WebIDL::Node): New abstract class. This class
869     makes things easier.
870     (child_nodes): New attribute. Unlike DOM's attribute with
871     same name, this attribute returns a dead list of nodes for
872     simplicity.
873     (get_user_data, set_user_data): New methods.
874     (Module idl_text): A SPACE character should be inserted
875     before the |{| character.
876     (Interface idl_text): Support for interface forward declarations.
877     (is_forward_declaration): New attribute.
878    
879 wakaba 1.259 2008-07-19 Wakaba <wakaba@suika.fam.cx>
880    
881 wakaba 1.263 * WebIDL.pm (type_text): Better serializer.
882    
883     2008-07-19 Wakaba <wakaba@suika.fam.cx>
884    
885 wakaba 1.262 * WebIDL.pm: Revise forward-compatible parsing so that
886     it now can handle broken extended attributes and as such.
887    
888     2008-07-19 Wakaba <wakaba@suika.fam.cx>
889    
890 wakaba 1.261 * WebIDL.pm: Real support for extended attributes.
891     Support for extended attributes with arguments.
892    
893     2008-07-19 Wakaba <wakaba@suika.fam.cx>
894    
895 wakaba 1.260 * WebIDL.pm: Support for |exception| syntax.
896     (Interface->idl_text): Tentative support for inheritances.
897    
898     2008-07-19 Wakaba <wakaba@suika.fam.cx>
899    
900 wakaba 1.259 * WebIDL.pm: Hierarchical scoped name support was broken.
901     Support for raises, setraises, and getraises syntaxes.
902    
903 wakaba 1.258 2008-07-18 Wakaba <wakaba@suika.fam.cx>
904    
905     * WebIDL.pm: Support for |idl_text| attribute, version 1 (no
906     proper support for types, extended attributes, and exceptions yet).
907     WebIDL parser, version 1 (no support for exceptions yet,
908     no proper support for extended attributes yet).
909    
910 wakaba 1.257 2008-07-09 Wakaba <wakaba@suika.fam.cx>
911    
912     * WebIDL.pm (parse_char_string): Support for basic attribute syntax.
913    
914 wakaba 1.255 2008-06-29 Wakaba <wakaba@suika.fam.cx>
915    
916 wakaba 1.256 * WebIDL.pm: Support for valuetype and const.
917    
918     2008-06-29 Wakaba <wakaba@suika.fam.cx>
919    
920 wakaba 1.255 * WebIDL.pm: New module.
921    
922 wakaba 1.254 2008-06-15 Wakaba <wakaba@suika.fam.cx>
923    
924     * Makefile (Entities.html): URI changed.
925    
926 wakaba 1.253 2008-06-08 Wakaba <wakaba@suika.fam.cx>
927    
928     * HTML.pm.src: Support for ruby parsing (HTML5 revision 1704).
929    
930 wakaba 1.251 2008-06-01 Wakaba <wakaba@suika.fam.cx>
931    
932 wakaba 1.252 * HTML.pm.src (_get_next_token): A parse error was missing.
933    
934     2008-06-01 Wakaba <wakaba@suika.fam.cx>
935    
936 wakaba 1.251 * mklinktypelist.pl: rel=contact is no longer part of the HTML5
937     spec (commented out). (HTML5 revision 1711).
938    
939 wakaba 1.248 2008-05-25 Wakaba <wakaba@suika.fam.cx>
940    
941 wakaba 1.250 * ContentType.pm: Drop support for UTF-32 (HTML5 revision 1701).
942    
943     * HTML.pm.src: UTF-16BE and UTF-16LE should be considered
944     as UTF-16 (HTML5 revision 1701).
945    
946     2008-05-25 Wakaba <wakaba@suika.fam.cx>
947    
948 wakaba 1.249 * HTML.pm.src: Support for <noframes> in <head> (HTML5 revision
949     1692).
950    
951     2008-05-25 Wakaba <wakaba@suika.fam.cx>
952    
953     * HTML.pm.src: The secondary insertion mode used when switching
954     to foreign content is the "in body" insertion mode (HTML5 revision
955     1696).
956    
957     2008-05-25 Wakaba <wakaba@suika.fam.cx>
958    
959 wakaba 1.248 * HTML.pm.src: Don't raise parse error for <isindex/> (HTML5
960     revision 1697).
961    
962 wakaba 1.239 2008-05-24 Wakaba <wakaba@suika.fam.cx>
963    
964 wakaba 1.247 * HTML.pm.src: Support for end-of-file token in foreign content
965     insertion mode (HTML5 revision 1693). Update SVG camelCase
966     attribute list (HTML5 revision 1700). <textarea> closes
967     </select> (HTML5 revision 1699). More start tags close in
968     foreign content insertion mode (HTML5 revision 1698).
969    
970     2008-05-24 Wakaba <wakaba@suika.fam.cx>
971    
972 wakaba 1.246 * HTML.pm.src: ";" is not part of charset name (HTML5 revision 1665).
973    
974     2008-05-24 Wakaba <wakaba@suika.fam.cx>
975    
976 wakaba 1.245 * HTML.pm.src: More robust charset parameter detection (HTML5
977     revision 1674).
978    
979     2008-05-24 Wakaba <wakaba@suika.fam.cx>
980    
981 wakaba 1.244 * ContentType.pm: Support for image/vnd.microsoft.icon (HTML5
982     revision 1676).
983    
984     2008-05-24 Wakaba <wakaba@suika.fam.cx>
985    
986 wakaba 1.243 * HTML.pm.src: Ignore language part of public identifiers for
987     quriks mode detection (HTML5 revision 1679).
988    
989     2008-05-24 Wakaba <wakaba@suika.fam.cx>
990    
991 wakaba 1.242 * HTML.pm.src: Reduce the number of errors in truncated doctypes (HTML5
992     revision 1685).
993    
994     2008-05-24 Wakaba <wakaba@suika.fam.cx>
995    
996 wakaba 1.241 * HTML.pm.src: Support for EOF in new states for tags (HTML5
997     revision 1684).
998    
999     2008-05-24 Wakaba <wakaba@suika.fam.cx>
1000    
1001 wakaba 1.240 * HTML.pm.src (_reset_insertion_mode): Make <td>.innerHTML
1002     work (HTML5 revision 1690).
1003    
1004     2008-05-24 Wakaba <wakaba@suika.fam.cx>
1005    
1006 wakaba 1.239 * HTML.pm.src (_tree_construction_main): Change handling of
1007     end tags in head insertion modes (HTML5 revision 1686).
1008     (parse_char_string): Bug fix for non-utf8 character string handlings.
1009     (parse_char_stream): |ungetc| does not work well for this context.
1010    
1011 wakaba 1.237 2008-05-18 Wakaba <wakaba@suika.fam.cx>
1012    
1013 wakaba 1.238 * HTML.pm.src (parse_byte_string): Redefined to invoke
1014     |parse_byte_stream|.
1015     (parse_byte_stream): New method.
1016    
1017     2008-05-18 Wakaba <wakaba@suika.fam.cx>
1018    
1019 wakaba 1.237 * HTML.pm.src (parse_byte_string): Fix the column number reported
1020     by encoding layer error reporter.
1021    
1022 wakaba 1.233 2008-05-17 Wakaba <wakaba@suika.fam.cx>
1023    
1024 wakaba 1.236 * HTML.pm.src (parse_byte_string): Use streaming decoder
1025     rather than converting the whole byte string and then parsing.
1026     Propagate errors in character encoding layer.
1027     (get_next_token): Precise error reporting for |bare stago| error.
1028    
1029     2008-05-17 Wakaba <wakaba@suika.fam.cx>
1030    
1031 wakaba 1.235 * HTML.pm.src (parse_char_stream): New method.
1032     (parse_char_string): This method is now defined as an invocation
1033     of the |parse_char_stream| method.
1034    
1035     2008-05-17 Wakaba <wakaba@suika.fam.cx>
1036    
1037 wakaba 1.234 * HTML.pm.src (parse_byte_string): Report various status
1038     of the sniffing as info-level errors. Support for new
1039     decoding framework in parser resestting.
1040     (new): Various default error levels were not set.
1041    
1042     2008-05-17 Wakaba <wakaba@suika.fam.cx>
1043    
1044 wakaba 1.233 * HTML.pm.src (parse_byte_string): HTML5 encoding siniffing
1045     algorithm, except for the actual sniffing, is implemented
1046     with new framework with Message::Charset::Info.
1047    
1048 wakaba 1.232 2008-05-16 Wakaba <wakaba@suika.fam.cx>
1049    
1050     * CacheManifest.pm (_parse): Drop fragment identifiers from
1051     URIs in fallback section (HTML5 revision 1596).
1052    
1053 wakaba 1.230 2008-05-10 Wakaba <wakaba@suika.fam.cx>
1054    
1055 wakaba 1.231 * Makefile (Entities.html): URI has changed.
1056    
1057     2008-05-10 Wakaba <wakaba@suika.fam.cx>
1058    
1059 wakaba 1.230 * CacheManifest.pm: Don't replace U+0000 NULL (HTML5 revision
1060     1553).
1061    
1062 wakaba 1.228 2008-05-06 Wakaba <wakaba@suika.fam.cx>
1063    
1064 wakaba 1.229 * ContentChecker.pm: Noted that those returned in |table| are
1065     no longer table elements, but table objects returned
1066     by Whatpm::HTMLTable.
1067    
1068     * HTMLTable.pm (form_table): Return table element node
1069     as |$table->{element}|.
1070     (assign_header): Support for the |headers=""| attribute.
1071    
1072     2008-05-06 Wakaba <wakaba@suika.fam.cx>
1073    
1074 wakaba 1.228 * HTMLTable.pm (assign_header): New function; first version
1075     with no support for headers="".
1076     (form_table): Include table width and height to the returned
1077     table object for covenience. Indexing in column assignement was wrong.
1078     Set whether a data cell is empty or not for convenience.
1079    
1080 wakaba 1.222 2008-05-05 Wakaba <wakaba@suika.fam.cx>
1081    
1082 wakaba 1.227 * HTMLTable.pm: Robuster caption support (HTML5 revision 1393).
1083    
1084     2008-05-05 Wakaba <wakaba@suika.fam.cx>
1085    
1086 wakaba 1.226 * HTMLTable.pm: How table model errors are detected is
1087     changed (HTML5 revision 1387).
1088    
1089     2008-05-05 Wakaba <wakaba@suika.fam.cx>
1090    
1091 wakaba 1.225 * HTMLTable.pm: The algorithm now moves |tfoot| elements
1092     to the end of the table (HTML5 revision 1380).
1093    
1094     2008-05-05 Wakaba <wakaba@suika.fam.cx>
1095    
1096 wakaba 1.224 * HTMLTable.pm: The algorithm is now 0-based indexing, instead
1097     of 1-based (HTML5 revision 1376).
1098    
1099     2008-05-05 Wakaba <wakaba@suika.fam.cx>
1100    
1101 wakaba 1.223 * ContentType.pm: "Content-Type: text/plain; charset=UTF-8"
1102     and "Content-Encoding" no longer prevent sniffing (HTML5
1103     revision 1288).
1104    
1105     2008-05-05 Wakaba <wakaba@suika.fam.cx>
1106    
1107 wakaba 1.222 * ContentType.pm: Skip BOMs n feed or HTML algorithm (HTML5 revision
1108     1282).
1109    
1110 wakaba 1.220 2008-05-03 Wakaba <wakaba@suika.fam.cx>
1111    
1112 wakaba 1.221 * ContentChecker.pm: Support for global attributes.
1113     Status of XML specs are added.
1114    
1115     2008-05-03 Wakaba <wakaba@suika.fam.cx>
1116    
1117 wakaba 1.220 * ContentChecker.pm (check_element): Support for |template=""|
1118     and |ref=""| attribute (referent element type checking).
1119    
1120 wakaba 1.218 2008-04-29 Wakaba <wakaba@suika.fam.cx>
1121    
1122 wakaba 1.219 * CacheManifest.pm (_parse): New same origin definition (HTML5 revision
1123     1500) is implemented (except for IDNA part and URI-scheme-specific
1124 wakaba 1.218 knowledge). Line number counting was wrong for LF-only
1125     documents.
1126    
1127 wakaba 1.211 2008-04-13 Wakaba <wakaba@suika.fam.cx>
1128    
1129 wakaba 1.217 * HTML.pm.src: Raise an parse error for any disallowed
1130     character (HTML5 revision 1263).
1131    
1132     2008-04-13 Wakaba <wakaba@suika.fam.cx>
1133    
1134 wakaba 1.216 * mkentitylist.pl: Support for new HTML5 entity table format (the
1135     definition for |AElig;| was missing).
1136    
1137     2008-04-13 Wakaba <wakaba@suika.fam.cx>
1138    
1139 wakaba 1.215 * HTML.pm.src, mkhtmlparser.pl: Support for element/attribute
1140     name/namespace fixup (HTML5 revisions 1413, 1415, 1416, and 1417).
1141    
1142     2008-04-13 Wakaba <wakaba@suika.fam.cx>
1143    
1144 wakaba 1.214 * HTML.pm.src: List of element names that close foreign content
1145     insertion mode is added (HTML5 revisions 1412 and 1418).
1146    
1147     2008-04-13 Wakaba <wakaba@suika.fam.cx>
1148    
1149 wakaba 1.213 * HTML.pm.src: Support for |mglyph| and |malignmark|
1150     elements (HTML5 revision 1410).
1151    
1152     2008-04-13 Wakaba <wakaba@suika.fam.cx>
1153    
1154 wakaba 1.212 * HTML.pm.src: Support for new long MathML entities (HTML5
1155     revision 1406).
1156    
1157     2008-04-13 Wakaba <wakaba@suika.fam.cx>
1158    
1159 wakaba 1.211 * HTML.pm.src: CDATA section support for MathML and SVG
1160     elements (HTML5 revision 1404 and 1420).
1161    
1162 wakaba 1.209 2008-04-12 Wakaba <wakaba@suika.fam.cx>
1163    
1164 wakaba 1.210 * HTML.pm.src, mkhtmlparser.pl: Support for MathML and SVG
1165     elements (HTML5 revision 1404). Unused !!!macro definitions
1166     are removed.
1167    
1168     2008-04-12 Wakaba <wakaba@suika.fam.cx>
1169    
1170 wakaba 1.209 * HTML.pm.src, mkhtmlparser.pl: The way permitted slash errors
1171     are raised is changed (HTML5 revision 1404).
1172    
1173 wakaba 1.206 2008-04-06 Wakaba <wakaba@suika.fam.cx>
1174    
1175 wakaba 1.208 * HTML.pm.src: Category sets that are no longer used are removed.
1176    
1177     2008-04-06 Wakaba <wakaba@suika.fam.cx>
1178    
1179 wakaba 1.207 * HTML.pm.src: The ->[1] property of stack entries are now
1180     replaced by constants representing element category.
1181    
1182     2008-04-06 Wakaba <wakaba@suika.fam.cx>
1183    
1184 wakaba 1.206 * HTML.pm.src: Don't use local name stored in stack (i.e. ->[1])
1185     for error reporting. (This is a preparation for using constant
1186     value for ->[1].)
1187    
1188 wakaba 1.204 2008-03-22 Wakaba <wakaba@suika.fam.cx>
1189    
1190 wakaba 1.205 * RDFXML.pm: Typo fixed.
1191    
1192     2008-03-22 Wakaba <wakaba@suika.fam.cx>
1193    
1194 wakaba 1.204 * ContentChecker.pm: |fact_level| is now treated
1195     as same as |must_level|, i.e. level = |m|.
1196     (check_element): Make list of URIs in the DOM.
1197    
1198 wakaba 1.197 2008-03-21 Wakaba <wakaba@suika.fam.cx>
1199    
1200 wakaba 1.203 * RDFXML.pm: Language accessor implemented.
1201     Local (null-namespace) attribute support.
1202    
1203     2008-03-21 Wakaba <wakaba@suika.fam.cx>
1204    
1205 wakaba 1.202 * RDFXML.pm: Factored out ID checking code.
1206    
1207     2008-03-21 Wakaba <wakaba@suika.fam.cx>
1208    
1209 wakaba 1.201 * RDFXML.pm: TODO items noted. Validation of ID
1210     and URI attributes is implemented. Warn if unknown
1211     value is used in rdf:parseType="" attribute.
1212    
1213     * URIChecker.pm (check_rdf_uri_reference): New function.
1214    
1215     2008-03-21 Wakaba <wakaba@suika.fam.cx>
1216    
1217 wakaba 1.200 * RDFXML.pm: bnodeid implemented. Relative references
1218     are now resolved.
1219    
1220     2008-03-21 Wakaba <wakaba@suika.fam.cx>
1221    
1222 wakaba 1.199 * ContentChecker.pm: RDF reification implemented.
1223    
1224     * RDFXML.pm: undef vs false bug fixed.
1225     Reification implemented.
1226    
1227     2008-03-21 Wakaba <wakaba@suika.fam.cx>
1228    
1229 wakaba 1.198 * RDFXML.pm: s/id/ID/ for attribute name.
1230     The |node| arguments are added for |ontriple| calls.
1231     Too many "attribute not allowed" errors were raised.
1232    
1233     * ContentChecker.pm: Initial experimental support for rdf:RDF
1234     element.
1235    
1236     2008-03-21 Wakaba <wakaba@suika.fam.cx>
1237    
1238 wakaba 1.197 * RDFXML.pm: New module.
1239    
1240 wakaba 1.192 2008-03-20 Wakaba <wakaba@suika.fam.cx>
1241    
1242 wakaba 1.196 * HTML.pm.src (set_inner_html): Line/column number
1243     code was old one yet.
1244    
1245     2008-03-20 Wakaba <wakaba@suika.fam.cx>
1246    
1247 wakaba 1.195 * HTML.pm.src: Better line/column reporting for "duplicate attribute"
1248     errors. Line/column markings for DOCTYPE, comment, and
1249     character tokens are reintroduced; otherwise,
1250     error location for "not HTML5" error and errors
1251     for implied elements are not attached.
1252    
1253     2008-03-20 Wakaba <wakaba@suika.fam.cx>
1254    
1255 wakaba 1.194 * HTML.pm.src: Set line/column numbers to attributes.
1256    
1257     * NanoDOM.pm (create_attribute_ns, set_attribute_node_ns): Added.
1258     (value): Setter implemented.
1259    
1260     * mkhtmlparser.pl: Set line/column numbers to Attr nodes.
1261    
1262     2008-03-20 Wakaba <wakaba@suika.fam.cx>
1263    
1264 wakaba 1.193 * HTML.pm.src: Unused line/column markings are removed.
1265    
1266     2008-03-20 Wakaba <wakaba@suika.fam.cx>
1267    
1268 wakaba 1.192 * HTML.pm.src (_get_next_token): Remove |first_start_tag|
1269     flag, which is no longer used.
1270    
1271 wakaba 1.190 2008-03-17 Wakaba <wakaba@suika.fam.cx>
1272    
1273 wakaba 1.191 * HTML.pm.src: Set line/column information to element nodes.
1274    
1275     * mkhtmlparser.pl (!!!create-element, !!!insert-element,
1276     and !!!insert-element-t): Set line/column information to
1277     element nodes.
1278    
1279     2008-03-17 Wakaba <wakaba@suika.fam.cx>
1280    
1281 wakaba 1.190 * HTML.pm.src (_get_next_token): The first "<" character
1282     in "<?", "<>", or "</>" should be the error point.
1283    
1284 wakaba 1.187 2008-03-16 Wakaba <wakaba@suika.fam.cx>
1285    
1286 wakaba 1.189 * HTML.pm.src: Some more fixes on error position reporting.
1287    
1288     2008-03-16 Wakaba <wakaba@suika.fam.cx>
1289    
1290 wakaba 1.188 * HTML.pm.src: Token-level precious error reporting.
1291    
1292     2008-03-16 Wakaba <wakaba@suika.fam.cx>
1293    
1294 wakaba 1.187 * HTML.pm.src: Preparation for more precious error point
1295     reporting.
1296    
1297 wakaba 1.184 2008-03-11 Wakaba <wakaba@suika.fam.cx>
1298    
1299 wakaba 1.186 * HTML.pm.src: Error type revised.
1300    
1301     2008-03-11 Wakaba <wakaba@suika.fam.cx>
1302    
1303 wakaba 1.185 * HTML.pm.src: Similar codes are merged together, again.
1304    
1305     2008-03-11 Wakaba <wakaba@suika.fam.cx>
1306    
1307 wakaba 1.184 * HTML.pm.src: Similar codes are merged together.
1308    
1309 wakaba 1.183 2008-03-10 Wakaba <wakaba@suika.fam.cx>
1310    
1311     * mkhtmlparser.pl: Set "level" parameter to parse errors.
1312    
1313     * HTML.pm.src: Code refined.
1314    
1315 wakaba 1.175 2008-03-09 Wakaba <wakaba@suika.fam.cx>
1316    
1317 wakaba 1.182 * HTML.pm.src: |</body>| treatement has been changed (HTML5
1318     revision 1348). Note that I really don't know this makes
1319     any difference in the black-box behavior of the parser.
1320    
1321     2008-03-09 Wakaba <wakaba@suika.fam.cx>
1322    
1323 wakaba 1.181 * HTML.pm.src: New end-of-file token implementation (HTML5
1324     revision 1348).
1325    
1326     2008-03-09 Wakaba <wakaba@suika.fam.cx>
1327    
1328 wakaba 1.180 * HTML.pm.src: |applet| support (HTML5 revision 1347).
1329    
1330     2008-03-09 Wakaba <wakaba@suika.fam.cx>
1331    
1332 wakaba 1.179 * HTML.pm.src: Foster parenting in AAA (HTML5 revision 1343).
1333    
1334     2008-03-09 Wakaba <wakaba@suika.fam.cx>
1335    
1336 wakaba 1.178 * HTML.pm.src: Support for |<input>| in the "in select" insertion
1337     mode and support for the "in select in table" insertion mode (HTML5
1338     revision 1342).
1339    
1340     2008-03-09 Wakaba <wakaba@suika.fam.cx>
1341    
1342 wakaba 1.177 * HTML.pm.src: No foster parenting for <script> and <script>
1343     in non-tainted <table>s (HTML5 revision 1336).
1344    
1345     2008-03-09 Wakaba <wakaba@suika.fam.cx>
1346    
1347 wakaba 1.176 * HTML.pm.src: Ignore white space characters between <html>
1348     and <head> (HTML5 revision 1332).
1349    
1350     2008-03-09 Wakaba <wakaba@suika.fam.cx>
1351    
1352 wakaba 1.175 * HTML.pm.src: Treat <input type=hidden> as if it were a
1353     white space (HTML5 revision 1331).
1354    
1355 wakaba 1.164 2008-03-08 Wakaba <wakaba@suika.fam.cx>
1356    
1357 wakaba 1.174 * HTML.pm.src: Ignore U+000A at the beginning of a |listing|
1358     element (HTML5 revision 1330).
1359    
1360     2008-03-08 Wakaba <wakaba@suika.fam.cx>
1361    
1362 wakaba 1.173 * HTML.pm.src: <title> is always appended to the current
1363     element (HTML5 revision 1328).
1364    
1365     2008-03-08 Wakaba <wakaba@suika.fam.cx>
1366    
1367 wakaba 1.172 * HTML.pm.src: White space in tainted tables are moved
1368     into foster parents (HTML5 revision 1326).
1369    
1370     2008-03-08 Wakaba <wakaba@suika.fam.cx>
1371    
1372 wakaba 1.171 * HTML.pm.src: Reduce errors from foster parenting cases (HTML5
1373     revision 1321).
1374    
1375     2008-03-08 Wakaba <wakaba@suika.fam.cx>
1376    
1377 wakaba 1.170 * HTML.pm.src: |</h/n/>| case code rearranged to align with
1378     the spec (HTML5 revision 1320). Note that we finally complete
1379     all of HTML5 revision 1320 changes.
1380    
1381     2008-03-08 Wakaba <wakaba@suika.fam.cx>
1382    
1383 wakaba 1.169 * HTML.pm.src: |</form>| now works similar to |</div>| for unclosed
1384     tags (HTML5 revision 1320).
1385    
1386     2008-03-08 Wakaba <wakaba@suika.fam.cx>
1387    
1388 wakaba 1.168 * HTML.pm.src: |</p>| case rearranged with no actual change in fact.
1389    
1390     2008-03-08 Wakaba <wakaba@suika.fam.cx>
1391    
1392 wakaba 1.167 * HTML.pm.src: A "generate implied end tags" code (t409.1)
1393     could not be reached so that it is now removed (HTML5 revision
1394     1320).
1395    
1396     2008-03-08 Wakaba <wakaba@suika.fam.cx>
1397    
1398 wakaba 1.166 * HTML.pm.src: Code for the case of |</div>| and so on are revised
1399     to align with new spec text (HTML5 revision 1320).
1400    
1401     2008-03-08 Wakaba <wakaba@suika.fam.cx>
1402    
1403 wakaba 1.165 * HTML.pm.src: Remove strange |if| condition; however, it should
1404     have had no harm in theory.
1405    
1406     2008-03-08 Wakaba <wakaba@suika.fam.cx>
1407    
1408 wakaba 1.164 * HTML.pm.src (_tree_construction_main): '</p>' in body
1409     case is split from other end tags for the preparation
1410     of implementing HTML revision 1320.
1411    
1412 wakaba 1.162 2008-03-07 Wakaba <wakaba@suika.fam.cx>
1413    
1414 wakaba 1.163 * HTML.pm.src: Simplified "generate implied end tag" (HTML5
1415     revision 1320).
1416    
1417     2008-03-07 Wakaba <wakaba@suika.fam.cx>
1418    
1419 wakaba 1.162 * HTML.pm.src (_tree_construction_main): Merge rules for "h1"
1420     and "div" (HTML5 revision 1318). Add comments to where
1421     |form| pointer association codes should be inserted (HTML5 revision
1422     1319).
1423    
1424 wakaba 1.161 2008-03-06 Wakaba <wakaba@suika.fam.cx>
1425    
1426     * HTML.pm.src: <html> treatement refined (HTML5 revision 1314).
1427    
1428 wakaba 1.159 2008-03-05 Wakaba <wakaba@suika.fam.cx>
1429    
1430 wakaba 1.160 * HTML.pm.src: Since the case t268 should never be reached (no
1431     other token type, there are), it is replaced by a |die| statement.
1432    
1433     2008-03-05 Wakaba <wakaba@suika.fam.cx>
1434    
1435 wakaba 1.159 * HTML.pm.src: Typo fixed.
1436    
1437 wakaba 1.158 2008-03-04 Wakaba <wakaba@suika.fam.cx>
1438    
1439     * HTML.pm.src (_tree_construction_initial): Some limited quirks
1440     doctypes were not uppercased for comparison.
1441    
1442 wakaba 1.152 2008-03-03 Wakaba <wakaba@suika.fam.cx>
1443    
1444 wakaba 1.157 * HTML.pm.src (tree construction and set_inner_html): Checkpoints
1445     are added.
1446    
1447     2008-03-03 Wakaba <wakaba@suika.fam.cx>
1448    
1449 wakaba 1.156 * HTML.pm.src (_tokenize_attempt_to_consume_an_entity): Checkpoints
1450     are set. Cases that are unlikely reached are noted as so.
1451    
1452     2008-03-03 Wakaba <wakaba@suika.fam.cx>
1453    
1454 wakaba 1.155 * HTML.pm.src: Checkpoints for debugging are added.
1455    
1456     * mkhtmlparser.pl: Support for |!!!cp| syntax.
1457    
1458     2008-03-03 Wakaba <wakaba@suika.fam.cx>
1459    
1460 wakaba 1.154 * HTML.pm.src, mkhtmlparser.pl: s/_input_character/_char/g
1461     for simplicity.
1462    
1463     2008-03-03 Wakaba <wakaba@suika.fam.cx>
1464    
1465 wakaba 1.153 * HTML.pm.src: Flag name changed: s/correct/force-quirks/g (HTML5
1466     revision 1307).
1467    
1468     2008-03-03 Wakaba <wakaba@suika.fam.cx>
1469    
1470 wakaba 1.152 * HTML.pm.src (_get_next_token): Where the /incorrect/ flag
1471     is set are changed (HTML5 revision 1305).
1472    
1473 wakaba 1.149 2008-03-02 Wakaba <wakaba@suika.fam.cx>
1474    
1475 wakaba 1.151 * HTML.pm.src: Raise a parse error for |<span ===>| (HTML5 revision
1476     1292). Entities are not parsed in comment-like part in RCDATA
1477     elements (HTML5 revision 1294). Allow bare & at the end
1478     of attribute value literals (HTML5 revision 1296). More
1479     quirks mode doctypes (HTML5 revision 1302). Requires spaces
1480     between attributes and ban attribute names or unquoted
1481     attribute values containing single or double quotes (HTML5
1482     revision 1303).
1483    
1484     2008-03-02 Wakaba <wakaba@suika.fam.cx>
1485    
1486 wakaba 1.150 * ContentChecker.pm: Typo fixed. Don't raise "character encoding"
1487     and related errors unless it is an HTML document (though the spec
1488     is unclear on whether it is applied to XHTML document).
1489    
1490     * HTML.pm (%HTMLAttrStatus): WF2 repetition model attributes
1491     are added.
1492    
1493     2008-03-02 Wakaba <wakaba@suika.fam.cx>
1494    
1495 wakaba 1.149 * HTML.pm.src: s/local_name/manakai_local_name/g.
1496    
1497 wakaba 1.148 2008-03-01 Wakaba <wakaba@suika.fam.cx>
1498    
1499     * _NamedEntityList.pm: Updated (HTML5 revision 1286).
1500    
1501     * HTML.pm.src: |charset| in |content| attribute is
1502     case-insensitive (HTML5 revision 1270).
1503    
1504 wakaba 1.147 2008-02-26 Wakaba <wakaba@suika.fam.cx>
1505    
1506     * ContentChecker.pm: New status constants are added.
1507     ($ElementDefault): |status| added.
1508     (check_element): Err for non-standard or deprecated elements.
1509     (_attr_status_info): For non-standard or deprecated attributes.
1510    
1511 wakaba 1.142 2008-02-24 Wakaba <wakaba@suika.fam.cx>
1512    
1513 wakaba 1.146 * ContentChecker.pm (_attr_status_info): New internal method.
1514    
1515     2008-02-24 Wakaba <wakaba@suika.fam.cx>
1516    
1517 wakaba 1.145 * ContentChecker.pm (check_element): Element standardized
1518     status information is now dispatched.
1519    
1520     2008-02-24 Wakaba <wakaba@suika.fam.cx>
1521    
1522 wakaba 1.144 * ContentChecker.pm (check_element): Fix |del|-and-significant
1523     problem by adding some more arguments.
1524    
1525     2008-02-24 Wakaba <wakaba@suika.fam.cx>
1526    
1527 wakaba 1.143 * ContentChecker.pm (check_element): Use context of
1528     container-for-the-purpose-of-content-model element (not transparent
1529     element) for |check_child_element| calling and significant
1530     text flag marking. This reintroduces |<del>|-and-significant
1531     problem again.
1532    
1533     2008-02-24 Wakaba <wakaba@suika.fam.cx>
1534    
1535 wakaba 1.142 * ContentChecker.pm (check_element): Make semi-transparent
1536     elements ignored for the purpose of phase changes in
1537     content model checking.
1538    
1539 wakaba 1.138 2008-02-23 Wakaba <wakaba@suika.fam.cx>
1540    
1541 wakaba 1.141 * ContentChecker.pm (check_element): In-element state
1542     was not properly managed for transparent cases.
1543    
1544     2008-02-23 Wakaba <wakaba@suika.fam.cx>
1545    
1546 wakaba 1.140 * ContentChecker.pm (check_element): Support for |video|
1547     and |audio| as semi-transparent elements.
1548    
1549     2008-02-23 Wakaba <wakaba@suika.fam.cx>
1550    
1551 wakaba 1.139 * ContentChecker.pm ($HTMLSemiTransparentElements): New.
1552     (check_element): s/minuses/minus_elements/, s/pluses/plus_elements/.
1553     Support for |html:object| as a semi-transparent element.
1554    
1555     2008-02-23 Wakaba <wakaba@suika.fam.cx>
1556    
1557 wakaba 1.138 * ContentChecker.pm (check_element): The way to traverse
1558     the tree is entirely revised to make it easier to track
1559     the state of ancestors/descendants. As a result of this
1560     revision (which rewrites almost all of Whatpm::ContentChecker::HTML),
1561     support for content model checking for HTML elements |figure|,
1562     |object|, |video|, and |audio| and checking for XML elements (and
1563     some XMLNS checkings) are dropped for now. They will be
1564     reimplemented in due cource.
1565    
1566 wakaba 1.134 2008-02-17 Wakaba <wakaba@suika.fam.cx>
1567    
1568 wakaba 1.137 * HTML.pm.src: |>| in public or system literal closes the DOCTYPE
1569     token (HTML5 revision 1225).
1570    
1571     2008-02-17 Wakaba <wakaba@suika.fam.cx>
1572    
1573 wakaba 1.136 * ContentChecker.pm ({unsupported_level}): New value.
1574    
1575     * HTML.pm.src: Save whether |meta| |content| attribute
1576     contains character references or not.
1577    
1578     2008-02-17 Wakaba <wakaba@suika.fam.cx>
1579    
1580 wakaba 1.135 * ContentChecker.pm (_get_children): (Incompleted) attempt to
1581     imlement significant content checking for contents
1582     with |del| elements.
1583    
1584     2008-02-17 Wakaba <wakaba@suika.fam.cx>
1585    
1586 wakaba 1.134 * ContenteChecker.pm ($HTMLTransparentElements): More
1587     elements are added.
1588     (_get_children): HTML |object| elements are now semi-transparent.
1589    
1590     * NanoDOM.pm (manakai_html, manakai_head): New methods.
1591    
1592 wakaba 1.133 2008-02-16 Wakaba <wakaba@suika.fam.cx>
1593    
1594     * CacheManifest.pm: HTML5 revision 1211 implemented.
1595    
1596     * CacheManifest.pod: Updated.
1597    
1598 wakaba 1.132 2008-02-10 Wakaba <wakaba@suika.fam.cx>
1599    
1600     * ContentChecker.pm (check_document, check_element): Support
1601     for second argument ($onsubdoc).
1602     (_get_css_parser): Removed (now it is part of WDCC).
1603    
1604 wakaba 1.131 2008-02-09 Wakaba <wakaba@suika.fam.cx>
1605    
1606     * ContentChecker.pm (_get_css_parser): New.
1607    
1608 wakaba 1.129 2007-11-25 Wakaba <wakaba@suika.fam.cx>
1609    
1610 wakaba 1.130 * ContentChecker.pm ($AnyChecker): Old way to add child elements
1611     for checking had been used.
1612    
1613     2007-11-25 Wakaba <wakaba@suika.fam.cx>
1614    
1615 wakaba 1.129 * ContentChecker.pm (check_element): New todo item type |descendant|.
1616    
1617 wakaba 1.125 2007-11-23 Wakaba <wakaba@suika.fam.cx>
1618    
1619 wakaba 1.128 * IMTChecker.pm: Revised to raise errors and warnings as (poorly)
1620     specced in RFC 2046 and RFC 4288.
1621     (application/atom+xml): Definition added.
1622    
1623     2007-11-23 Wakaba <wakaba@suika.fam.cx>
1624    
1625 wakaba 1.127 * URIChecker.pm: Make RFC 3986 should-level errors
1626     warnings (rather than SHOULD-level errors).
1627    
1628     2007-11-23 Wakaba <wakaba@suika.fam.cx>
1629    
1630 wakaba 1.126 * NanoDOM.pm (get_user_data, set_user_data): New methods.
1631    
1632     * HTML.pm.src: A flag for character references in attribute
1633     values are added. Set |manakai_has_reference| user data
1634     to |charset| attribute.
1635    
1636     2007-11-23 Wakaba <wakaba@suika.fam.cx>
1637    
1638 wakaba 1.125 * NanoDOM.pm (input_encoding, manakai_charset, manakai_has_bom): New
1639     attributes.
1640    
1641     * ContentChecker.pm (check_document): Warn if charset requirements
1642     cannot be tested.
1643    
1644 wakaba 1.124 2007-11-19 Wakaba <wakaba@suika.fam.cx>
1645    
1646     * HTML.pm.src (parse_byte_string): Detect charset
1647     by universalchardet if charset parameter is not specified.
1648    
1649     * Makefile (Charset-all, Charset-clean): New rules.
1650    
1651 wakaba 1.120 2007-11-18 Wakaba <wakaba@suika.fam.cx>
1652    
1653 wakaba 1.123 * ContentChecker.pm (check_document): Check the existence
1654     of character encoding declaration and selection of encoding
1655     for HTML document.
1656    
1657     2007-11-18 Wakaba <wakaba@suika.fam.cx>
1658    
1659 wakaba 1.122 * ContentType.pm (get_sniffed_type): Return also the official
1660     type in list context.
1661    
1662     2007-11-18 Wakaba <wakaba@suika.fam.cx>
1663    
1664 wakaba 1.121 * ContentType.pm: Sniffing with leading white space
1665     ignoring (HTML5 revisions 1013 and 1016).
1666    
1667     2007-11-18 Wakaba <wakaba@suika.fam.cx>
1668    
1669 wakaba 1.120 * ContentType.pm: HTML5 revision 1013 changes, except for leading
1670     white spaces, are implemented.
1671    
1672 wakaba 1.117 2007-11-11 Wakaba <wakaba@suika.fam.cx>
1673    
1674 wakaba 1.119 * HTML.pm.src (parse_char_string): Set |inner_encoding|
1675     attribute if possible.
1676    
1677     2007-11-11 Wakaba <wakaba@suika.fam.cx>
1678    
1679 wakaba 1.118 * HTML.pm.src (parse_byte_string): New method.
1680     (parse_char_string): New alias for |parse_string|.
1681     (main phase): Invoking "change the encoding" algorithm if desired.
1682    
1683     * HTML.pod: Updated.
1684    
1685     2007-11-11 Wakaba <wakaba@suika.fam.cx>
1686    
1687 wakaba 1.117 * HTML.pod (get_inner_html): Removed.
1688    
1689     * Makefile (HTML-all, HTML-clean): New.
1690    
1691     2007-11-11 Wakaba <wakaba@suika.fam.cx>
1692    
1693     * HTML.pm.src (get_inner_html): Removed (moved to HTML/Serializer.pm).
1694    
1695 wakaba 1.116 2007-11-08 Wakaba <wakaba@suika.fam.cx>
1696    
1697     * mklinktypelist.pl: s/noreferer/noreferrer/ (HTML5 revision 1132).
1698    
1699 wakaba 1.112 2007-11-04 Wakaba <wakaba@suika.fam.cx>
1700    
1701 wakaba 1.115 * Makefile: |CacheManifest.html| is added.
1702    
1703     * CacheManifest.pod: New file.
1704    
1705     2007-11-04 Wakaba <wakaba@suika.fam.cx>
1706    
1707 wakaba 1.114 * CacheManifest.pm: New module.
1708    
1709     2007-11-04 Wakaba <wakaba@suika.fam.cx>
1710    
1711 wakaba 1.113 * HTML.pm.src: Support for application cache selection algorithm
1712     callback.
1713    
1714     2007-11-04 Wakaba <wakaba@suika.fam.cx>
1715    
1716 wakaba 1.112 * mklinktypelist.pl: Support for rel=noreferer (HTML5 revision 1118).
1717    
1718 wakaba 1.110 2007-10-17 Wakaba <wakaba@suika.fam.cx>
1719    
1720 wakaba 1.111 * Makefile (clean): New rule.
1721    
1722     * NanoDOM.pm (public_id, system_id): New attributes.
1723    
1724     2007-10-17 Wakaba <wakaba@suika.fam.cx>
1725    
1726 wakaba 1.110 * Makefile (CSS-all, CSS-clean, clean): New rules.
1727    
1728 wakaba 1.109 2007-10-14 Wakaba <wakaba@suika.fam.cx>
1729    
1730     * ContentChecker.pm (check_document): Support for
1731     new |is_xml_root| flag.
1732     (check_element): Support for new |pluses| state.
1733     (_add_pluses): New method.
1734     (_remove_minuses): Support for new |minus| item.
1735    
1736 wakaba 1.106 2007-09-24 Wakaba <wakaba@suika.fam.cx>
1737    
1738 wakaba 1.107 * ContentChecker.pm: Raise specific error for invalid
1739     root element.
1740    
1741     2007-09-24 Wakaba <wakaba@suika.fam.cx>
1742    
1743 wakaba 1.106 * ContentChecker.pm: Set level values for later uses.
1744    
1745 wakaba 1.103 2007-09-09 Wakaba <wakaba@suika.fam.cx>
1746    
1747 wakaba 1.105 * ContentChecker.pm: Support for language tag validation.
1748    
1749     2007-09-09 Wakaba <wakaba@suika.fam.cx>
1750    
1751 wakaba 1.104 * LangTag.pm (check_rfc3066_language_tag): New method.
1752    
1753     2007-09-09 Wakaba <wakaba@suika.fam.cx>
1754    
1755 wakaba 1.103 * LangTag.pm: New module.
1756    
1757 wakaba 1.102 2007-09-04 Wakaba <wakaba@suika.fam.cx>
1758    
1759     * HTML.pm.src: Some error types were wrong.
1760    
1761 wakaba 1.100 2007-08-17 Wakaba <wakaba@suika.fam.cx>
1762    
1763 wakaba 1.101 * CSS/: New directory.
1764    
1765     2007-08-17 Wakaba <wakaba@suika.fam.cx>
1766    
1767 wakaba 1.100 * ContentChecker.pm (_check_get_children): Support
1768     for |noscript| in |head|.
1769    
1770 wakaba 1.99 2007-08-12 Wakaba <wakaba@suika.fam.cx>
1771    
1772     * URI/: New directory.
1773    
1774 wakaba 1.94 2007-08-11 Wakaba <wakaba@suika.fam.cx>
1775    
1776 wakaba 1.98 * HTML.pm.src: Tokenizer's states are now represented in
1777     number.
1778    
1779     2007-08-11 Wakaba <wakaba@suika.fam.cx>
1780    
1781 wakaba 1.97 * HTML.pm.src: |or|s for insertion modes are replaced
1782     by |&|s.
1783    
1784     2007-08-11 Wakaba <wakaba@suika.fam.cx>
1785    
1786 wakaba 1.96 * HTML.pm.src: Token types are now represented in number.
1787    
1788     2007-08-11 Wakaba <wakaba@suika.fam.cx>
1789    
1790 wakaba 1.95 * ContentType.pm (SEE ALSO): Updated.
1791    
1792     * HTML.pm.src: Insertion modes are now represented in number.
1793    
1794     2007-08-11 Wakaba <wakaba@suika.fam.cx>
1795    
1796 wakaba 1.94 * ContentType.pm: Sniffing for bitmap images (HTML5 revision
1797     999) is implemented.
1798    
1799 wakaba 1.93 2007-08-08 Wakaba <wakaba@suika.fam.cx>
1800    
1801     * ContentType.pm: Sniffing for |<script| (HTML5 revision
1802     983) is implemented.
1803    
1804 wakaba 1.92 2007-08-06 Wakaba <wakaba@suika.fam.cx>
1805    
1806     * ContentChecker.pod: New documentation.
1807    
1808     * Makefile: A rule for |ContentChecker.html| is added.
1809    
1810     * ContentChecker.pm: A pod "LICENSE" section is added.
1811    
1812     * NanoDOM.pm ($VERSION): New variable.
1813    
1814 wakaba 1.90 2007-08-05 Wakaba <wakaba@suika.fam.cx>
1815    
1816 wakaba 1.91 * H2H.pm: |b|, |i|, and |sub| are added to the
1817     list of allowed HTML elements.
1818    
1819     2007-08-05 Wakaba <wakaba@suika.fam.cx>
1820    
1821 wakaba 1.90 * H2H.pm: |samp| is added to the list of allowed
1822     HTML elements.
1823    
1824     * URIChecker.pm (check_iri): New.
1825     (check_iri_reference): Error type for IRI reference
1826     syntax error is changed.
1827    
1828 wakaba 1.87 2007-08-04 Wakaba <wakaba@suika.fam.cx>
1829    
1830 wakaba 1.89 * ContentChecker.pm: Reference to the |Whatpm::ContentChecker::Atom|
1831     is added.
1832     (check_document): Load appropriate module before validation.
1833    
1834     2007-08-04 Wakaba <wakaba@suika.fam.cx>
1835    
1836 wakaba 1.88 * ContentChecker/: New directory.
1837    
1838     2007-08-04 Wakaba <wakaba@suika.fam.cx>
1839    
1840 wakaba 1.87 * ContentChecker.pm: HTML |time| element is implemented.
1841    
1842     * HTMLTable.pm: Comments are updated as HTML5 is revised.
1843    
1844 wakaba 1.71 2007-07-21 Wakaba <wakaba@suika.fam.cx>
1845    
1846 wakaba 1.86 * ContentChecker.pm (check_document): Return value
1847     even if no document element is found.
1848    
1849     2007-07-21 Wakaba <wakaba@suika.fam.cx>
1850    
1851 wakaba 1.85 * HTML.pm.src: |$in_body| is no longer a function.
1852    
1853     2007-07-21 Wakaba <wakaba@suika.fam.cx>
1854    
1855 wakaba 1.84 * HTML.pm.src: The |$in_body| code has been moved down.
1856    
1857     2007-07-21 Wakaba <wakaba@suika.fam.cx>
1858    
1859 wakaba 1.83 * HTML.pm.src: The "trailing end" insertion mode
1860     is split into "after html body" and "after html frameset"
1861     insertion modes. Their codes are merged with "after body"
1862     and "after frameset" codes. |$previous_insertion_mode|
1863     has been removed. "after frameset" code is
1864     merged with "in frameset" code.
1865    
1866     2007-07-21 Wakaba <wakaba@suika.fam.cx>
1867    
1868 wakaba 1.82 * HTML.pm.src: The "before head" insertion mode is
1869     merged with the "in head" insertion mode.
1870    
1871     2007-07-21 Wakaba <wakaba@suika.fam.cx>
1872    
1873 wakaba 1.81 * HTML.pm.src: Cases in "in head" insertion mode are
1874     reorganized.
1875    
1876     2007-07-21 Wakaba <wakaba@suika.fam.cx>
1877    
1878 wakaba 1.80 * HTML.pm.src: Some cases in "in table" insertion mode
1879     are merged.
1880    
1881     2007-07-21 Wakaba <wakaba@suika.fam.cx>
1882    
1883 wakaba 1.79 * HTML.pm.src: The "in row" insertion mode is merged
1884     with "in table" insertion mode.
1885    
1886     2007-07-21 Wakaba <wakaba@suika.fam.cx>
1887    
1888 wakaba 1.78 * HTML.pm.src: The "in table" and "in table body" insertion
1889     modes are merged.
1890    
1891     2007-07-21 Wakaba <wakaba@suika.fam.cx>
1892    
1893 wakaba 1.77 * HTML.pm.src: There is no "in table head" or "in table foot"
1894     insertion mode!
1895    
1896     2007-07-21 Wakaba <wakaba@suika.fam.cx>
1897    
1898 wakaba 1.76 * HTML.pm.src: |<noframes>| "in frameset" and "in noframes"
1899     now directly invoke the handler.
1900    
1901     2007-07-21 Wakaba <wakaba@suika.fam.cx>
1902    
1903 wakaba 1.75 * HTML.pm.src: Codes for "in cell" insertion mode
1904     is merged to the "in body" insertion mode code.
1905    
1906     2007-07-21 Wakaba <wakaba@suika.fam.cx>
1907    
1908 wakaba 1.74 * HTML.pm.src: Codes for "in body" and "in caption"
1909     insertion modes are merged.
1910    
1911     2007-07-21 Wakaba <wakaba@suika.fam.cx>
1912    
1913 wakaba 1.73 * HTML.pm.src: Two |!!!next-token|s were missing.
1914    
1915     2007-07-21 Wakaba <wakaba@suika.fam.cx>
1916    
1917 wakaba 1.72 * HTML.pm.src: Use numeric constant for |{content_mode}|
1918     instead of string constant for |{content_model_flag}|.
1919    
1920     2007-07-21 Wakaba <wakaba@suika.fam.cx>
1921    
1922 wakaba 1.71 * HTML.pm.src: Add the name of the attribute
1923     to the "duplicate attribute" error.
1924    
1925 wakaba 1.69 2007-07-17 Wakaba <wakaba@suika.fam.cx>
1926    
1927 wakaba 1.70 * ContentChecker.pm: Return the |class| node list.
1928    
1929     2007-07-17 Wakaba <wakaba@suika.fam.cx>
1930    
1931 wakaba 1.69 * ContentChecker.pm: Return the |id| node list.
1932    
1933     * HTML.pm.src: A typo is fixed.
1934    
1935 wakaba 1.63 2007-07-16 Wakaba <wakaba@suika.fam.cx>
1936    
1937 wakaba 1.68 * ContentChecker.pm: Drop wrong |level => 'error'| specification
1938     from "in HTML:xml:lang" error. Character position
1939     is now the last part of the error type in the URI error
1940     description. Report "unsupported" status for language
1941     tags, media queries, script codes, and style sheets.
1942    
1943     2007-07-16 Wakaba <wakaba@suika.fam.cx>
1944    
1945 wakaba 1.67 * ContentChecker.pm: Report error if |xml:lang|
1946     in HTML, |lang| in XML, |xmlns| in XML, and |meta| |charset|
1947     in XML.
1948    
1949     * NanoDOM.pm (Attr.owner_document): New attribute.
1950    
1951     2007-07-16 Wakaba <wakaba@suika.fam.cx>
1952    
1953 wakaba 1.66 * HTML.pm.src: The character immediately following
1954     a bare |hcro| was discarded. Fix handling of
1955     entity references in attribute values.
1956    
1957     2007-07-16 Wakaba <wakaba@suika.fam.cx>
1958    
1959 wakaba 1.65 * HTML.pm.src (main and trailin end phases): Token
1960     types |DOCTYPE|, |comment|, |end-of-file|, and |<html>| are
1961     factored out. Error types |in html:#DOCTYPE|
1962     and |after html:#DOCTYPE| are merged
1963     into |DOCTYPE in the middle|. |</frameset>|
1964     in fragment parsing mode changed the
1965     insertion mode.
1966    
1967     2007-07-16 Wakaba <wakaba@suika.fam.cx>
1968    
1969 wakaba 1.64 * HTML.pm.src: |$phase| has been removed; The |trailing end|
1970     phase is now an insertion mode. Treatments for white
1971     space character tokens were incorrect for some
1972     insertion modes. An old |meta| case was not removed.
1973    
1974     2007-07-16 Wakaba <wakaba@suika.fam.cx>
1975    
1976 wakaba 1.63 * HTML.pm.src: |meta| charset declaration extraction
1977     implemented (but changing the encoding is not yet:-).
1978    
1979 wakaba 1.60 2007-07-15 Wakaba <wakaba@suika.fam.cx>
1980    
1981 wakaba 1.62 * Charset/: New directory.
1982    
1983     2007-07-15 Wakaba <wakaba@suika.fam.cx>
1984    
1985 wakaba 1.61 * H2H.pm: New Perl module (created from
1986     manakai's H2H.dis).
1987    
1988     2007-07-15 Wakaba <wakaba@suika.fam.cx>
1989    
1990 wakaba 1.60 * XMLSerializer.pm: New Perl module (created from
1991     manakai's SimpleLS.dis).
1992    
1993 wakaba 1.59 2007-07-07 Wakaba <wakaba@suika.fam.cx>
1994    
1995     * HTML.pm.src: |<!---x-->| was not processed correctly.
1996    
1997 wakaba 1.57 2007-07-01 Wakaba <wakaba@suika.fam.cx>
1998    
1999 wakaba 1.58 * HTML.pm.src: Report correct error message
2000     for |<body></div></body>|.
2001    
2002     2007-07-01 Wakaba <wakaba@suika.fam.cx>
2003    
2004 wakaba 1.57 * HTMLTable.pm: An error description was incorrect.
2005    
2006     2007-06-30 Wakaba <wakaba@suika.fam.cx>
2007    
2008     * ContentChecker.pm: Return |{term}| list.
2009    
2010 wakaba 1.55 2007-06-30 Wakaba <wakaba@suika.fam.cx>
2011    
2012 wakaba 1.56 * HTML.pm.src: HTML5 revisions 961-966 (</p>, </br>,
2013     nested <nobr>, implied </tbody>, </tfoot>, and </thead>,
2014     and <title> outside of head).
2015    
2016     2007-06-30 Wakaba <wakaba@suika.fam.cx>
2017    
2018 wakaba 1.55 * IMTChecker.pm: Report warning for unregistered
2019     and private types/subtypes.
2020    
2021     * ContentChecker.pm, HTML.pm.src, IMTChecker.pm,
2022     URIChecker.pm, HTMLTable.pm: Error messages are now
2023     consistent; they are all listed in
2024     <http://suika.fam.cx/gate/2005/sw/Whatpm%20Error%20Types>.
2025    
2026 wakaba 1.52 2007-06-25 Wakaba <wakaba@suika.fam.cx>
2027    
2028 wakaba 1.54 * ContentChecker.pm: |<img ismap>| not in |<a></a>|
2029     is now erred. |<datalist>| is implemented.
2030     Attribute checker for |<command>| and |<menu>| are
2031     added. Support for |contextmenu| global attribute
2032     is added.
2033    
2034     2007-06-25 Wakaba <wakaba@suika.fam.cx>
2035    
2036 wakaba 1.53 * HTML.pm.src (_reset_insertion_mode): Interpretation
2037     of Step 3 has been changed.
2038    
2039     2007-06-25 Wakaba <wakaba@suika.fam.cx>
2040    
2041 wakaba 1.52 * HTML.pm.src: Late |<html>| parse error is implemented.
2042    
2043 wakaba 1.47 2007-06-24 Wakaba <wakaba@suika.fam.cx>
2044    
2045 wakaba 1.51 * URIChecker.pm (check_iri_reference): A |decode| method name was
2046     incorrect.
2047    
2048     * ContentChecker.pm: Support for the |footer| element.
2049     Check URI syntax for space-separated URI attributes.
2050     Support for the |tabindex| attribute. Support
2051     for |datetime| attribute.
2052    
2053     2007-06-24 Wakaba <wakaba@suika.fam.cx>
2054    
2055 wakaba 1.50 * HTML.pm.src: HTML5 revision 1.144 (&#x0D;) and 1.145 (invalid
2056     character references). HTML5 revision 1.146 (white space
2057     characters before root start tag). HTML5 revision
2058     1.148 (named character references in attribute values).
2059     HTML5 revision 1.152 (<plaintext>.innerHTML get).
2060    
2061     2007-06-24 Wakaba <wakaba@suika.fam.cx>
2062    
2063     * HTML.pm.src: HTML5 revisions 1.142 and 1.143 (<noscript>
2064 wakaba 1.49 in <head>).
2065    
2066     2007-06-24 Wakaba <wakaba@suika.fam.cx>
2067    
2068 wakaba 1.48 * HTML.pm.src: HTML5 revision 935 (<base>, <link>, <meta>
2069     in body).
2070    
2071     * ContentChecker.pm: HTML5 revision 938 (scoped="").
2072    
2073     2007-06-24 Wakaba <wakaba@suika.fam.cx>
2074    
2075 wakaba 1.47 * HTML.pm.src: HTML5 revision 923 (matching end tag in CDATA
2076     or RCDATA in fragment parsing mode). HTML5 revision
2077     924 (<!--> and <!--->). HTML5 revision 926 (hn in hn).
2078    
2079     2007-06-23 Wakaba <wakaba@suika.fam.cx>
2080    
2081     * HTML.pm.src (get_inner_html): HTML5 revision 922 (inner_html
2082     for <pre> and <textarea>).
2083    
2084 wakaba 1.33 2007-06-23 Wakaba <wakaba@suika.fam.cx>
2085    
2086 wakaba 1.46 * HTML.pm.src: HTML5 revision 920 (<isindex>).
2087    
2088     2007-06-23 Wakaba <wakaba@suika.fam.cx>
2089    
2090 wakaba 1.45 * HTML.pm.src: HTML5 revision 918 (</head>, </body>, </html>).
2091    
2092     2007-06-23 Wakaba <wakaba@suika.fam.cx>
2093    
2094 wakaba 1.44 * HTML.pm.src: HTML5 revision 916 (</body>).
2095     HTML5 revision 917 (conforming bare &).
2096    
2097     2007-06-23 Wakaba <wakaba@suika.fam.cx>
2098    
2099 wakaba 1.43 * NanoDOM.pm (manakai_is_html): Setting to false did not work.
2100    
2101     * HTML.pm.src: HTML5 revision 914 (</ in CDATA, RCDATA).
2102     HTML5 revision 915 (<nobr>).
2103    
2104     2007-06-23 Wakaba <wakaba@suika.fam.cx>
2105    
2106 wakaba 1.42 * HTML.pm.src: HTML5 revisions 908, 909, 912, and 913 (quirks mode).
2107    
2108     * NanoDOM.pm (manakai_is_html, manakai_compat_mode, compat_mode):
2109     New attributes.
2110    
2111     2007-06-23 Wakaba <wakaba@suika.fam.cx>
2112    
2113 wakaba 1.41 * HTML.pm.src: HTML5 revisions 900, 901, 902, and 911 (<
2114     in tags).
2115    
2116     2007-06-23 Wakaba <wakaba@suika.fam.cx>
2117    
2118 wakaba 1.40 * .cvsignore: |Entities.html| is added.
2119    
2120     * HTML.pm.src: |$entity_char| is removed and
2121     requires |Whatpm::_NamedEntityList| instead.
2122     HTML5 revision 898 (refc), except that lack of refc
2123     is parse error.
2124    
2125     * mkentitylist.pl: New script.
2126    
2127     * Makefile (all): |_NamedEntityList.pm| is added.
2128     (_NamedEntityList.pm, Entities.html): New rules.
2129    
2130     2007-06-23 Wakaba <wakaba@suika.fam.cx>
2131    
2132 wakaba 1.39 * HTML.pm.src: Parse errors immediately after U+000D
2133     were ignored and U+000D immediately following another
2134     U+000D was not converted to U+000A.
2135    
2136     2007-06-23 Wakaba <wakaba@suika.fam.cx>
2137    
2138 wakaba 1.38 * HTML.pm.src (set_inner_html): HTML5 revision 892 (adopt
2139     nodes before appended). Parser was not ready for NULL
2140     parse error and escape flag.
2141    
2142     * NanoDOM.pm (adopt_node): New.
2143    
2144     2007-06-23 Wakaba <wakaba@suika.fam.cx>
2145    
2146 wakaba 1.37 * HTML.pm.src: HTML5 revision 886 (insane comment in
2147     CDATA and RCDATA). Note that current implementation
2148     is simply repeating what the spec says and it is maybe not
2149     a best way to do it.
2150    
2151     2007-06-23 Wakaba <wakaba@suika.fam.cx>
2152    
2153 wakaba 1.36 * HTML.pm.src: HTML5 revision 884 (</form> don't close
2154     the form element if a descendant element without implied end tag has
2155     still been open).
2156    
2157     2007-06-23 Wakaba <wakaba@suika.fam.cx>
2158    
2159 wakaba 1.35 * ContentChecker.pm: HTML5 revision 881 (Make |id| attribute with
2160     space characters non-conforming).
2161    
2162     2007-06-23 Wakaba <wakaba@suika.fam.cx>
2163    
2164 wakaba 1.34 * HTML.pm.src: An error message was incorrect.
2165     HTML5 revision 869 (C1 character references).
2166    
2167     2007-06-23 Wakaba <wakaba@suika.fam.cx>
2168    
2169 wakaba 1.33 * HTML.pm.src: HTML5 revision 867 (a LF at the beginning of
2170     a |textarea| is removed).
2171    
2172 wakaba 1.32 2007-06-05 Wakaba <wakaba@suika.fam.cx>
2173    
2174     * NanoDOM.pm (get_attribute_node_ns): New method.
2175    
2176     * ContentChecker.pm: |script| |async| and |defer|
2177     no longer require |src|. |async| MUST NOT be
2178     specified if |defer|. (HTML5 revision 858).
2179    
2180 wakaba 1.31 2007-05-30 Wakaba <wakaba@suika.fam.cx>
2181    
2182     * HTML.pm.src: |<form><form>| went to inifinite loop.
2183    
2184 wakaba 1.27 2007-05-27 Wakaba <wakaba@suika.fam.cx>
2185    
2186 wakaba 1.30 * ContentChecker.pm (html): Set |is_root| (allowed
2187     as a document element) flag on.
2188     (new): Removed.
2189     (check_document): New method.
2190    
2191     2007-05-27 Wakaba <wakaba@suika.fam.cx>
2192    
2193 wakaba 1.29 * ContentChecker.pm (thead, tfoot): Checker specifications
2194     were incorrect.
2195    
2196     2007-05-27 Wakaba <wakaba@suika.fam.cx>
2197    
2198 wakaba 1.28 * ContentChecker.pm ($HTMLURIAttrChecker): Include
2199     error position in the |type| option of the error.
2200    
2201     * HTMLTable.pm (form_table): The |$onerror| parameter
2202     is now optional. Some bugs are fixed.
2203    
2204     2007-05-27 Wakaba <wakaba@suika.fam.cx>
2205    
2206 wakaba 1.27 * HTMLTable.pm: New module.
2207    
2208     * ContentChecker.pm (table): Invoke table model error checker.
2209    
2210     * NanoDOM.pm (first_child, get_attribute_ns): New.
2211    
2212 wakaba 1.25 2007-05-26 Wakaba <wakaba@suika.fam.cx>
2213    
2214 wakaba 1.26 * ContentChecker.pm ($HTMLLinkTypesAttrChecker): New checker.
2215     (link/@rel, a/@rel, area/@rel): Use new checker.
2216    
2217     * Makefile (_LinkTypeList.pm, RelExtensions.html): New rules.
2218    
2219     * _LinkTypeList.pm: New file.
2220    
2221     * mklinktypelist.pl: New file.
2222    
2223     * .cvsignore: |RelExtensions.html| added.
2224    
2225     * NanoDOM.pm (child_nodes): Returns an empty array
2226     for non-child-containing node types.
2227     (text_content): New attribute.
2228    
2229     2007-05-26 Wakaba <wakaba@suika.fam.cx>
2230    
2231 wakaba 1.25 * IMTChecker.pm: New module.
2232    
2233     * ContentChecker.pm ($HTMLIMTAttrChecker): Call IMTChecker
2234     to test parameter value validity.
2235    
2236     * HTML.pm.src ($style_start_tag): Attributes were
2237     discarded.
2238    
2239 wakaba 1.22 2007-05-25 Wakaba <wakaba@suika.fam.cx>
2240    
2241 wakaba 1.24 * ContentChecker.pm ($HTMLURIAttrChecker): Implemented.
2242    
2243     2007-05-25 Wakaba <wakaba@suika.fam.cx>
2244    
2245 wakaba 1.23 * URIChecker.pm: All recommendations from RFC 3986
2246     and RFC 3987 are listed (not all testable items are checked yet).
2247    
2248     2007-05-25 Wakaba <wakaba@suika.fam.cx>
2249    
2250 wakaba 1.22 * URIChecker.pm: New module.
2251    
2252 wakaba 1.18 2007-05-20 Wakaba <wakaba@suika.fam.cx>
2253    
2254 wakaba 1.21 * ContentChecker.pm: Now most attributes are implemented
2255     or associated to some placeholder.
2256     ($ElementDefault): Warn unknown attributes
2257     for unknown elements as "attribute not supported".
2258     ($HTMLLanguageTagAttrChecker, $HTMLMQAttrChecker): New placeholders.
2259     ($HTMLUsemapAttrChecker, $HTMLTargetAttrChecker): New checkers.
2260     (|a| attribute checker): Reimplemented.
2261    
2262     2007-05-20 Wakaba <wakaba@suika.fam.cx>
2263    
2264 wakaba 1.20 * ContentChecker.pm ($HTMLEventHandlerAttrChecker): New placeholder.
2265     ($HTMLAttrChecker): Event handler content attributes
2266     are added.
2267     (link, embed): Required attribute is now checked.
2268     (embed): Unknown local attributes are no longer warned.
2269    
2270     2007-05-20 Wakaba <wakaba@suika.fam.cx>
2271    
2272 wakaba 1.19 * ContentChecker.pm ($HTMLSpaceURIsAttrChecker): New placeholder.
2273     ($HTMLIMTAttrChecker): New checker.
2274     (link@rel, link@href, link@type, style@type,
2275     a@href, a@ping, a@ping, a@type, embed@src, embed@type,
2276     object@data, object@type, source@src, source@type, area@alt,
2277     area@shape, area@coords, area@href,
2278     area@ping, area@rel, area@type, script@src,
2279     script@defer, script@async, script@type): Checkers added.
2280    
2281     2007-05-20 Wakaba <wakaba@suika.fam.cx>
2282    
2283 wakaba 1.18 * ContentChecker.pm: Descendant checking was incorrect.
2284    
2285 wakaba 1.12 2007-05-19 Wakaba <wakaba@suika.fam.cx>
2286    
2287 wakaba 1.17 * ContentChecker.pm: Support |xml:*| and |xmlns:*|
2288     attributes. Report an error if |Element.prefix|
2289     is |xmlns|.
2290    
2291     * NanoDOM.pm (prefix): New attribute.
2292    
2293     2007-05-19 Wakaba <wakaba@suika.fam.cx>
2294    
2295 wakaba 1.16 * HTML.pm.src: In |main| phase, |in body| insertion
2296     mode, action for |<iframe>| was missing.
2297    
2298     2007-05-19 Wakaba <wakaba@suika.fam.cx>
2299    
2300 wakaba 1.15 * ContentChecker.pm: Support for many of HTML5 elements.
2301     ($GetHTMLNonNegativeIntegerAttrChecker): New.
2302    
2303     2007-05-19 Wakaba <wakaba@suika.fam.cx>
2304    
2305 wakaba 1.14 * ContentChecker.pm: Support for most elements up to |progress|.
2306     ($HTMLURIAttrChecker): Placeholder.
2307     ($HTMLIntegerAttrChecker, $GetHTMLFloatingPointNumberAttrChecker):
2308     New.
2309    
2310     2007-05-19 Wakaba <wakaba@suika.fam.cx>
2311    
2312 wakaba 1.13 * ContentChecker.pm: Attribute checkers
2313     for global attributes, |html|, |base|, |style|, and |meta|.
2314    
2315     * NanoDOM.pm (insert_before): Weaken reference
2316     to the parent node.
2317     (Attr::new): Set |owner_element| attribute.
2318     (namespace_uri, manakai_local_name): New attribute implementations.
2319     (owner_element): New attribute.
2320    
2321     2007-05-19 Wakaba <wakaba@suika.fam.cx>
2322    
2323 wakaba 1.12 * ContentChecker.pm ($AttrChecker, $HTMLAttrChecker,
2324     $AnyChecker->{attr_checker}, $HTMLAttrsChecker,
2325     $Element->{$HTML_NS}->{''}): New.
2326     (check_element): Invoke attrs_checker for each element.
2327    
2328 wakaba 1.6 2007-05-13 Wakaba <wakaba@suika.fam.cx>
2329    
2330 wakaba 1.11 * ContentChecker.pm: Don't use |manakai_element_type_match|.
2331    
2332     2007-05-13 Wakaba <wakaba@suika.fam.cx>
2333    
2334 wakaba 1.10 * ContentChecker.pm: Use hashs rather than lists for
2335     element type testings.
2336    
2337     2007-05-13 Wakaba <wakaba@suika.fam.cx>
2338    
2339 wakaba 1.9 * ContentChecker.pm: Don't generate duplicate
2340     error when an element type is put in the "minus" list
2341     and the element type is not allowed explicitly in the particular
2342     element content model.
2343     (html:a checker): New checker.
2344     (html:details, html:datagrid): New checkers.
2345     (html:legend): New checker.
2346    
2347     2007-05-13 Wakaba <wakaba@suika.fam.cx>
2348    
2349 wakaba 1.8 * ContentChecker.pm (html:li checker): Implemented.
2350    
2351     2007-05-13 Wakaba <wakaba@suika.fam.cx>
2352    
2353 wakaba 1.7 * ContentChecker.pm ($HTMLInlineOrStriclyInlineChecker): New
2354     checker.
2355     (html:dd checker): New checker.
2356     (html:q, html:em, html:strong, html:small,
2357     html:m, html:dfn, html:code, html:samp, html:span): New checkers.
2358    
2359     2007-05-13 Wakaba <wakaba@suika.fam.cx>
2360    
2361 wakaba 1.6 * ContentChecker.pm ($AnyChecker): Renamed
2362     from |$ElementDefault->{checker}|.
2363     ($ElementDefault->{checker}): Throw an error that
2364     the element type is not supported by the checker.
2365     ($HTMLMetadataElement): |html:base| was missing.
2366     ($HTMLEmptyChecker): Don't throw an error
2367     for inter-element whitespace nodes.
2368     (html:html checker): Errors were not
2369     thrown even if |html:head| and/or |html:body|
2370     children were missing.
2371     (html:head checker): An error was not
2372     thrown if <meta charset> appered after other
2373     elements.
2374    
2375 wakaba 1.5 2007-05-05 Wakaba <wakaba@suika.fam.cx>
2376    
2377     * ContentChecker.pm: |footer|, |video|, |audio|, |script|,
2378     and |noscript| elements are implemented.
2379     (new): New method.
2380    
2381 wakaba 1.3 2007-05-04 Wakaba <wakaba@suika.fam.cx>
2382    
2383 wakaba 1.4 * ContentChecker.pm: New module.
2384    
2385     2007-05-04 Wakaba <wakaba@suika.fam.cx>
2386    
2387 wakaba 1.3 * NanoDOM.pm (manakai_parent_element,
2388     document_element, manakai_local_name,
2389     manakai_element_type_match): New method.
2390    
2391     2007-05-03 Wakaba <wakaba@suika.fam.cx>
2392    
2393     * HTML.pm.src: Replace decimal and hexadecimal numeric
2394     entities in C1 range using Windows-1252 mapping. Bare LF
2395     did not count as new line for error reporting.
2396    
2397 wakaba 1.2 2007-05-02 Wakaba <wakaba@suika.fam.cx>
2398    
2399     * NanoDOM.pm (DOMImplementation): New class.
2400     (append_child): Weaken the |parent_node| reference.
2401     (create_element_ns, Element new): Set the |owner_document|
2402     reference.
2403     (implementation): New attribute.
2404     (owner_document, local_name, namespace_uri): New attributes.
2405    
2406     * HTML.pm.src (parse_string): Line and column numbers
2407     are now provided to error handler.
2408     (!!!parse-error): Short descriptions are added.
2409     (_construct_tree): Split into three methods; support
2410     for innerHTML mode.
2411     (set_inner_html): New method.
2412    
2413 wakaba 1.1 2007-05-01 Wakaba <wakaba@suika.fam.cx>
2414    
2415     * NanoDOM.html: Documentation is added.
2416    
2417     * HTML.pod, ContentType.html: Documentation is revised.
2418    
2419     * .cvsignore: Pod2html temporary files are added.
2420    
2421     * Makefile: Make |NanoDOM.html|.
2422    
2423     2007-05-01 Wakaba <wakaba@suika.fam.cx>
2424    
2425     * HTML.pm.src (parse_string): New method.
2426     (get_inner_html): Renamed from |inner_html|.
2427    
2428     * Makefile: A rule for |HTML.html| is added.
2429    
2430     * HTML.pod: New documentation.
2431    
2432     2007-05-01 Wakaba <wakaba@suika.fam.cx>
2433    
2434     * NanoDOM.pm (last_child, previous_sibling): New attributes.
2435     (clone_node): Attribute nodes were not completely copied.
2436    
2437     * HTML.pm.src: Many bugs are fixed.
2438    
2439     2007-04-30 Wakaba <wakaba@suika.fam.cx>
2440    
2441     * HTML.pm.src: Some typos are fixed.
2442    
2443     2007-04-30 Wakaba <wakaba@suika.fam.cx>
2444    
2445     * mkhtmlparser.pl, Makefile: References to the |HTML-consume-entity.src|
2446     are removed.
2447    
2448     * HTML.pm.src: Tokenizer's handling on named entities are rewritten.
2449    
2450     * HTML-consume-entity.src: Removed.
2451    
2452     2007-04-30 Wakaba <wakaba@suika.fam.cx>
2453    
2454     * HTML.pm.src: Tokenizer's handling on hexadecimal
2455     numeric entities are rewritten.
2456    
2457     2007-04-30 Wakaba <wakaba@suika.fam.cx>
2458    
2459     * HTML.pm.src: Some tokenizer bugs are fixed.
2460    
2461     2007-04-30 Wakaba <wakaba@suika.fam.cx>
2462    
2463     * HTML.pm.src: The tree construction stage is implemented.
2464    
2465     * mkhtmlparser.pl: New macros are added.
2466    
2467     2007-04-28 Wakaba <wakaba@suika.fam.cx>
2468    
2469     * ContentType.pm: A note on bug in the specification
2470     is removed since it's been now fixed.
2471    
2472     * .cvsignore: New file.
2473    
2474     2007-04-28 Wakaba <wakaba@suika.fam.cx>
2475    
2476     * HTML.pm.src, HTML-consume-entity.src: New files.
2477    
2478     * Makefile (HTML.pm): New rule.
2479    
2480     * mkhtmlparser.pl: New script.
2481    
2482     2007-04-25 Wakaba <wakaba@suika.fam.cx>
2483    
2484     * Makefile: New file.
2485    
2486     2007-04-24 Wakaba <wakaba@suika.fam.cx>
2487    
2488     * ContentType.pm: An error in pod is fixed.
2489    
2490     2007-04-24 Wakaba <wakaba@suika.fam.cx>
2491    
2492     * ContentType.pm: Documentation is added.
2493    
2494     2007-04-24 Wakaba <wakaba@suika.fam.cx>
2495    
2496     * ContentType.pm: New Perl module.
2497    
2498     * ChangeLog: New file.
2499    
2500    

admin@suikawiki.org
ViewVC Help
Powered by ViewVC 1.1.24