/[suikacvs]/markup/html/whatpm/Whatpm/ChangeLog
Suika

Contents of /markup/html/whatpm/Whatpm/ChangeLog

Parent Directory Parent Directory | Revision Log Revision Log


Revision 1.394 - (hide annotations) (download)
Sun Sep 6 02:20:52 2009 UTC (15 years, 10 months ago) by wakaba
Branch: MAIN
Changes since 1.393: +4 -0 lines
++ whatpm/Whatpm/ChangeLog	6 Sep 2009 02:20:24 -0000
	* HTML.pm.src: Simplified foster parenting (HTML5 revision 2732).

2009-09-06  Wakaba  <wakaba@suika.fam.cx>

1 wakaba 1.391 2009-09-06 Wakaba <wakaba@suika.fam.cx>
2    
3 wakaba 1.394 * HTML.pm.src: Simplified foster parenting (HTML5 revision 2732).
4    
5     2009-09-06 Wakaba <wakaba@suika.fam.cx>
6    
7 wakaba 1.393 * HTML.pm.src: No longer does foster parenting |style|, |script|,
8     and |input type=hidden| elements (HTML5 revision 2988).
9    
10     2009-09-06 Wakaba <wakaba@suika.fam.cx>
11    
12 wakaba 1.392 * HTML.pm.src: Dropped the |eventsource| element (HTML5 revision
13     2870).
14    
15     2009-09-06 Wakaba <wakaba@suika.fam.cx>
16    
17 wakaba 1.391 * HTML.pm.src (_tree_construction_main): Treate |keygen| as a void
18     element (HTML5 revision 2960).
19    
20 wakaba 1.389 2009-09-05 Wakaba <wakaba@suika.fam.cx>
21    
22 wakaba 1.390 * HTML.pm.src: Moved the "in table text" insertion mode code up
23     such that it does also work as specced in HTML5 in the remaining
24     cases (HTML5 revision 3382).
25    
26     2009-09-05 Wakaba <wakaba@suika.fam.cx>
27    
28 wakaba 1.389 * HTML.pm.src: Implementing the "in table text" insertion mode,
29     first half (HTML5 revision 3382).
30    
31 wakaba 1.388 2009-08-30 Wakaba <wakaba@suika.fam.cx>
32    
33     * ContentChecker.pm: Allow |xml:lang| in HTML document if the
34     owner element is not an HTML element (HTML5 revision 3697).
35    
36 wakaba 1.387 2009-08-23 Wakaba <wakaba@suika.fam.cx>
37    
38     * ContentChecker.pm: Allow <input type=hidden> in context where
39     interactive content is disallowed (HTML5 revision 2392).
40    
41 wakaba 1.380 2009-08-16 Wakaba <wakaba@suika.fam.cx>
42    
43 wakaba 1.386 * HTML.pm.src: Dropped support for "XSLT-compat" and added
44     "about:legacy-compat" (HTML5 revision 2725).
45    
46     2009-08-16 Wakaba <wakaba@suika.fam.cx>
47    
48 wakaba 1.385 * HTML.pm.src: "<!DOCTYPE>" is now explicitly supported in the
49     spec (HTML5 revision 2700).
50    
51     2009-08-16 Wakaba <wakaba@suika.fam.cx>
52    
53 wakaba 1.384 * HTML.pm.src: Added support for <script> element in the "in
54     select" and "in select in table" insertion modes (HTML5 revision
55     2500).
56    
57     2009-08-16 Wakaba <wakaba@suika.fam.cx>
58    
59 wakaba 1.383 * HTML.pm.src: An <hn> start tag in an <hn> element should close
60     the earlier element (HTML5 revision 2501).
61    
62     2009-08-16 Wakaba <wakaba@suika.fam.cx>
63    
64 wakaba 1.382 * HTML.pm.src: The </br> end tags before the <body> start tag
65     should equally survive as the first br element child of the body
66     element (HTML5 revision 2586).
67    
68     2009-08-16 Wakaba <wakaba@suika.fam.cx>
69    
70 wakaba 1.381 * HTML.pm.src: <font> in SVG land is handled as an SVG element as
71     long as it did not have HTML attributes (HTML5 revision 2590).
72    
73     2009-08-16 Wakaba <wakaba@suika.fam.cx>
74    
75 wakaba 1.380 * HTML.pm.src (_tree_construction_main): Improved error reporting
76     on "select" start tag in "in select" and "in select in table"
77     parsing modes (c.f. HTML5 revision 3604).
78    
79 wakaba 1.378 2009-07-26 Wakaba <wakaba@suika.fam.cx>
80    
81 wakaba 1.379 * mkhtmlclass.pl: Made xmlns:xlink with wrong namespace URL
82     invalid (HTML5 revision 2701).
83    
84     2009-07-26 Wakaba <wakaba@suika.fam.cx>
85    
86 wakaba 1.378 * HTML.pm.src: </rt>, </rp>, </option>, </optgroup> become
87     optional before </body> (HTML5 revision 2806).
88    
89 wakaba 1.376 2009-07-25 Wakaba <wakaba@suika.fam.cx>
90    
91 wakaba 1.377 * HTML.pm.src: Marked where SVG |script| execution should be
92     implemented (HTML5 revision 2904).
93    
94     2009-07-25 Wakaba <wakaba@suika.fam.cx>
95    
96 wakaba 1.376 * HTML.pm.src (set_inner_html): The fragment parsing algorithm
97     inherits the document quirksness flag of the main document (HTML5
98     revision 2935).
99    
100 wakaba 1.375 2009-07-05 Wakaba <wakaba@suika.fam.cx>
101    
102     * HTML.pm.src (_tree_construction_main): Put |table| in |p| in
103     quirks mode (The Hixie Quirk, HTML5 revision 3132).
104    
105 wakaba 1.372 2009-07-03 Wakaba <wakaba@suika.fam.cx>
106    
107 wakaba 1.374 * HTML.pm.src: <kegten> breaks out of <select> (HTML5 revision
108     3163).
109    
110     2009-07-03 Wakaba <wakaba@suika.fam.cx>
111    
112 wakaba 1.373 * HTML.pm.src: Don't change encoding to UTF-8 if the input
113     encoding is UTF-16 (HTML5 revision 3204).
114    
115     2009-07-03 Wakaba <wakaba@suika.fam.cx>
116    
117 wakaba 1.372 * HTML.pm.src (parse_byte_stream): Don't change to another
118     encoding if the input is UTF-16 (HTML5 revision 3205).
119    
120 wakaba 1.371 2009-06-28 Wakaba <wakaba@suika.fam.cx>
121    
122     * HTML.pm.src: |isindex| should no longer generate |p| element
123     node (HTML5 revision 3236).
124    
125 wakaba 1.368 2008-12-12 Wakaba <wakaba@suika.fam.cx>
126    
127 wakaba 1.370 * ContentChecker.pm: Introduced new flag |has_autofocus|. Added
128     support for new |id_type| value of |any|, which means any kind of
129     element is allowed as a referenced element.
130    
131     2008-12-12 Wakaba <wakaba@suika.fam.cx>
132    
133 wakaba 1.369 * ContentChecker.pm: Introduced |check_attrs2| method for the ease
134     of defining codes for checking required attribute and as such.
135    
136     2008-12-12 Wakaba <wakaba@suika.fam.cx>
137    
138 wakaba 1.368 * IMTChecker.pm: Added more definitions for subtypes.
139    
140 wakaba 1.367 2008-12-11 Wakaba <wakaba@suika.fam.cx>
141    
142     * URIChecker.pm: Some of |pos_end| values were wrong.
143    
144 wakaba 1.364 2008-12-06 Wakaba <wakaba@suika.fam.cx>
145    
146 wakaba 1.366 * ContentChecker.pm (check_element): Added support for "no
147     referenced datalist" error.
148    
149     2008-12-06 Wakaba <wakaba@suika.fam.cx>
150    
151 wakaba 1.365 * URIChecker.pm: Bug fix: It did not work unless
152     Message::DOM::DOMImplementation has been |require|d.
153    
154     2008-12-06 Wakaba <wakaba@suika.fam.cx>
155    
156 wakaba 1.364 * NanoDOM.pm (document_uri): New attribute.
157    
158     * ContentChecker.pm: Don't use methods not implemented by NanoDOM.
159    
160 wakaba 1.363 2008-11-07 Wakaba <wakaba@suika.fam.cx>
161    
162     * NanoDOM.pm (text_content): Don't create a Text node if the new
163     value is empty.
164    
165 wakaba 1.362 2008-11-06 Wakaba <wakaba@suika.fam.cx>
166    
167     * SWML/: New directory.
168    
169 wakaba 1.361 2008-10-20 Wakaba <wakaba@suika.fam.cx>
170    
171     * NanoDOM.pm (specified, all_declarations_processed,
172     manakai_attribute_type): New attributes.
173    
174 wakaba 1.359 2008-10-19 Wakaba <wakaba@suika.fam.cx>
175    
176 wakaba 1.360 * NanoDOM.pm (Entity->new): Initialize ->child_nodes as an empty
177     array.
178    
179     2008-10-19 Wakaba <wakaba@suika.fam.cx>
180    
181 wakaba 1.359 * NanoDOM.pm (notation_name): New attribute.
182    
183 wakaba 1.357 2008-10-18 Wakaba <wakaba@suika.fam.cx>
184    
185 wakaba 1.359 * NanoDOM.pm (public_id, system_id): New attributes.a
186 wakaba 1.358
187     2008-10-18 Wakaba <wakaba@suika.fam.cx>
188    
189 wakaba 1.357 * NanoDOM.pm (text_content): Moved to Node from Element. Setter
190     implemented.
191     (allowed_tokens, default_type, declared_type): Implemented.
192    
193 wakaba 1.356 2008-10-17 Wakaba <wakaba@suika.fam.cx>
194    
195     * NanoDOM.pm (node_name): New attribute.
196     (ELEMENT_TYPE_DEFINITION_NODE, ATTRIBUTE_DEFINITION_NODE): New
197     constants.
198     (create_element_type_definition_node, create_attribute_definition,
199     create_notation, create_general_entity,
200     get_element_type_definition_node,
201     set_element_type_definition_node, get_general_entity_node,
202     set_general_entity_node, get_notation_node, set_notation_node,
203     get_attribute_definition_node, set_attribute_definition_node): New
204     methods.
205     (element_types, entities, notations, attribute_definitions): New
206     attributes.
207     (DocumentType): Support for child nodes, entities, notations, and
208     element types.
209     (Entity, Notation, ElementTypeDefinition, AttributeDefinition):
210     New classes.
211    
212     * Dumper.pm: Support for general entities, notations, element type
213     definitions, and attribute definitions.
214    
215 wakaba 1.355 2008-10-15 Wakaba <wakaba@suika.fam.cx>
216    
217     * NanoDOM.pm (create_processing_instruction): New method.
218     (xml_version, xml_encoding, xml_standalone): New attributes.
219     (ProcessingInstruction): New class.
220    
221 wakaba 1.352 2008-10-14 Wakaba <wakaba@suika.fam.cx>
222    
223 wakaba 1.354 * HTML.pm.src: Handling of end tags in the foreign content
224     insertion mode was partially wrong, because of wrong bit
225     operations.
226    
227     2008-10-14 Wakaba <wakaba@suika.fam.cx>
228    
229 wakaba 1.353 * NanoDOM.pm (dom_config): New attribute (do nothing), for
230     Whatpm::XML::Parser support.
231    
232     2008-10-14 Wakaba <wakaba@suika.fam.cx>
233    
234 wakaba 1.352 * Makefile: New rule to make HTML/Tokenizer.pm is added.
235    
236     * HTML.pm.src: Tokenizer part moved to another file.
237    
238 wakaba 1.349 2008-10-13 Wakaba <wakaba@suika.fam.cx>
239    
240 wakaba 1.351 * HTML.pm.src: Merge |DT_EL| and |DD_EL| as |DTDD_EL|.
241    
242     2008-10-13 Wakaba <wakaba@suika.fam.cx>
243    
244 wakaba 1.350 * HTML.pm.src: Element category constants redefined.
245    
246     2008-10-13 Wakaba <wakaba@suika.fam.cx>
247    
248 wakaba 1.349 * HTML.pm.src: Steps for CDATA/RCDATA elements in tree
249     construction stage synced with the spec (HTML5 revisions 2139 and
250     2302).
251    
252 wakaba 1.348 2008-10-07 Wakaba <wakaba@suika.fam.cx>
253    
254     * ContentChecker.pm: New error level "html5_fact" added, which
255     should be tentatively used until all of requirements are properly
256     specced as RFC 2119 "MUST" in HTML5.
257    
258 wakaba 1.346 2008-10-05 Wakaba <wakaba@suika.fam.cx>
259    
260 wakaba 1.347 * ContentChecker.pod: Note on internal flags is added.
261    
262     2008-10-05 Wakaba <wakaba@suika.fam.cx>
263    
264 wakaba 1.346 * HTML.pm.src: An AAA bug fixed.
265    
266 wakaba 1.336 2008-10-04 Wakaba <wakaba@suika.fam.cx>
267    
268 wakaba 1.345 * HTML.pm.src: If another node is inserted by the parser, don't
269     reuse existing Text node to append a character (HTML5 revision
270     2124).
271    
272     2008-10-04 Wakaba <wakaba@suika.fam.cx>
273    
274 wakaba 1.344 * HTML.pm.src: Support for <option> and <optgroup> in body (HTML5
275     revisions 1731 and 2128).
276    
277     2008-10-04 Wakaba <wakaba@suika.fam.cx>
278    
279 wakaba 1.343 * HTML.pm.src: Make <foreignObject> scoping (HTML5 revision 1837).
280     Support for end tags of camelCase SVG elements were broken. A
281     wrong error type text fixed.
282    
283     2008-10-04 Wakaba <wakaba@suika.fam.cx>
284    
285 wakaba 1.342 * HTML.pm.src: Drop redundant code (HTML5 revision 1731).
286    
287     2008-10-04 Wakaba <wakaba@suika.fam.cx>
288    
289 wakaba 1.341 * HTML.pm.src: Support for new definition of |param| and |source|
290     start tag parsing (HTML5 revision 1731).
291    
292     2008-10-04 Wakaba <wakaba@suika.fam.cx>
293    
294 wakaba 1.340 * HTML.pm.src: <p> steps reimplemented (HTML5 revision 1731).
295    
296     2008-10-04 Wakaba <wakaba@suika.fam.cx>
297    
298 wakaba 1.339 * HTML.pm.src: <li>, <dt>, and <dd> steps reimplemented (HTML5
299     revisions 1731 and 1831).
300    
301     2008-10-04 Wakaba <wakaba@suika.fam.cx>
302    
303 wakaba 1.338 * HTML.pm.src: Support for new flow (but not phrasing) elements (HTML5
304     revisions 1731 and 1778). Support for the </sarcasm> end tag (HTML5
305     revision 1731).
306    
307     2008-10-04 Wakaba <wakaba@suika.fam.cx>
308    
309 wakaba 1.337 * HTML.pm.src: Support for |command| and |eventsource| elements (HTML5
310     revision 1731). End tags of |option| and |optgroup| elements are
311     now optional (HTML5 revision 1731).
312    
313     2008-10-04 Wakaba <wakaba@suika.fam.cx>
314    
315 wakaba 1.336 * HTML.pm.src: New "special" elements added to the list (HTML5
316     revision 1778). "strile" -> "strike".
317    
318 wakaba 1.335 2008-10-02 Wakaba <wakaba@suika.fam.cx>
319    
320     * ContentType.pm (get_sniffed_type): Support for the "better"
321     content sniffing (HTML5 revision 1927). In a case the official
322     type was not returned when the method is invoked in the list
323     context.
324    
325 wakaba 1.334 2008-09-22 Wakaba <wakaba@suika.fam.cx>
326    
327     * HTML.pm.src: Character references for non-space C0 characters,
328     including U+000B VT, DEL character, noncharacter code points, are
329     now converted to the U+FFFD character (cf. HTML5 revision 2138).
330    
331 wakaba 1.330 2008-09-21 Wakaba <wakaba@suika.fam.cx>
332    
333 wakaba 1.333 * ContentChecker.pm: |form=""| check support added.
334    
335     2008-09-21 Wakaba <wakaba@suika.fam.cx>
336    
337 wakaba 1.332 * ContentChecker.pm: |contextmenu| validness is now checked using
338     |id| and |id_type| properties, and |menu| property is removed.
339    
340     2008-09-21 Wakaba <wakaba@suika.fam.cx>
341    
342 wakaba 1.331 * ContentChecker.pm: Prepare for |form| |name| attribute's
343     duplication checking.
344    
345     2008-09-21 Wakaba <wakaba@suika.fam.cx>
346    
347 wakaba 1.330 * HTML.pm.src (parse_byte_stream): Support (or non-support) for
348     unsupported charset="" parameter value (HTML5 revision 2131).
349    
350 wakaba 1.325 2008-09-20 Wakaba <wakaba@suika.fam.cx>
351    
352 wakaba 1.329 * HTML.pm.src: Reminding places where U+000B is allowed as a space
353     character is fixed (cf. HTML5 revision 1738).
354    
355     * ContentChecker.pm, HTMLTable.pm: U+000B is no longer part of
356     space characters (HTML5 revision 1738).
357    
358     2008-09-20 Wakaba <wakaba@suika.fam.cx>
359    
360 wakaba 1.328 * HTML.pm.src: The "anything else" case for the "after after body"
361     insertion mode was not updated to swtich to the "in body"
362     insertion mode. U+000B is no longer a space character for the
363     purpose of tree construction phase (HTML5 revision 1738).
364    
365     2008-09-20 Wakaba <wakaba@suika.fam.cx>
366    
367 wakaba 1.327 * HTML.pm.src: U+000B is no longer a space character (HTML5
368     revision 1738).
369    
370     2008-09-20 Wakaba <wakaba@suika.fam.cx>
371    
372 wakaba 1.326 * ContentType.pm: 0x0B is no longer a space character (HTML5
373     revision 1738).
374    
375     * HTML.pm.src: U+000B is no longer a space character for the
376     algorithm for extracting an encoding from a Content-Type (HTML5
377     revision 1738).
378    
379     2008-09-20 Wakaba <wakaba@suika.fam.cx>
380    
381 wakaba 1.325 * ContentChecker.pm ($IsInHTMLInteractiveContent): New.
382    
383 wakaba 1.320 2008-09-18 Wakaba <wakaba@suika.fam.cx>
384    
385 wakaba 1.324 * LangTag.pm: Add checks for remaining requirements from RFC 4646.
386    
387     * mklangreg.pl: Sort 'Prefix' values by their length, to ease
388     matching.
389    
390     2008-09-18 Wakaba <wakaba@suika.fam.cx>
391    
392 wakaba 1.323 * LangTag.pm: Warn for private use language subtags. Error level
393     typos fixed. Support for Suppress-Script field.
394    
395     * mklangreg.pl: Support for dumping of nested structure.
396    
397     2008-09-18 Wakaba <wakaba@suika.fam.cx>
398    
399 wakaba 1.322 * LangTag.pm (check_rfc4646_langtag): Check if a tag is in the
400     recommended case as per RFC 4646.
401    
402     2008-09-18 Wakaba <wakaba@suika.fam.cx>
403    
404 wakaba 1.321 * LangTag.pm (check_rfc4646_langtag): New method.
405    
406     2008-09-18 Wakaba <wakaba@suika.fam.cx>
407    
408 wakaba 1.320 * mklangreg.pl: New script.
409    
410     * Makefile: Updated for creation of the module for language subtag
411     registry.
412    
413 wakaba 1.315 2008-09-16 Wakaba <wakaba@suika.fam.cx>
414    
415 wakaba 1.319 * Makefile: WebIDL.html added.
416    
417     * WebIDL.pod: New documentation.
418    
419     2008-09-16 Wakaba <wakaba@suika.fam.cx>
420    
421 wakaba 1.318 * WebIDL.pm: Checker's error types are redefined.
422    
423     2008-09-16 Wakaba <wakaba@suika.fam.cx>
424    
425 wakaba 1.317 * WebIDL.pm: Parser's error types are redefined. Some forward
426     compatible parsing bugs are fixed. Some unreachable codes are
427     commented out.
428    
429     2008-09-16 Wakaba <wakaba@suika.fam.cx>
430    
431 wakaba 1.316 * WebIDL.pm: Support for the reminding extended attributes are
432     added. It does not satisfy the definition that a forward
433     interface declaration has an extended attribute. It seems that
434     unless explicitly allowed multiple extended attributes with the
435     same name is not allowed, though it is not explicitly mentioned in
436     the spec.
437    
438     2008-09-16 Wakaba <wakaba@suika.fam.cx>
439    
440 wakaba 1.315 * WebIDL.pm: Unescapes extended attribute names and extended
441     attribute identifiers. Preserve whether an extended attribute has
442     an argument list of not. Support for extended attributes:
443     Constructor, ExceptionConsts, IndexGetter, IndexSetter,
444     NameGetter, NameSetter, and Null.
445     (has_argument_list): New attribute.
446     (idl_text): Stringifies argument lists, if any, even if it is
447     empty.
448    
449 wakaba 1.310 2008-09-15 Wakaba <wakaba@suika.fam.cx>
450    
451 wakaba 1.314 * HTML.pm.src: New state |PCDATA_STATE|. Use an empty string for
452     |{s_kwd}| in DATA_STATE as default.
453    
454     2008-09-15 Wakaba <wakaba@suika.fam.cx>
455    
456 wakaba 1.313 * HTML.pm.src, mkhtmlparser.pl: Replace |{prev_char}|
457     by |{s_kwd}| in DATA_STATE.
458    
459     2008-09-15 Wakaba <wakaba@suika.fam.cx>
460    
461 wakaba 1.312 * HTML.pm.src: Shorten keys.
462    
463     2008-09-15 Wakaba <wakaba@suika.fam.cx>
464    
465 wakaba 1.311 * HTML.pm.src: Remove checking for control character, surrogate
466     pair, or noncharacter code points and non-Unicode code
467     points (they should be handled by Whatpm::Charset::UnicodeChecker).
468     (parse_char_stream): Support for the |$get_wrapper| argument and
469     character stream error handlers.
470    
471     2008-09-15 Wakaba <wakaba@suika.fam.cx>
472    
473 wakaba 1.310 * ContentChecker.pm: Don't call |loda_ns_module|
474     for null-namespace elements/attributes.
475    
476     * HTML.pm.src: Fact out $disallowed_control_chars
477     as a hash.
478    
479 wakaba 1.300 2008-09-14 Wakaba <wakaba@suika.fam.cx>
480    
481 wakaba 1.309 * HTML.pm.src: Regexp typo fixed. |{prev_char}|
482     and |{next_char}| initializations are moved to initialization
483     method. |{read_until}| now supports buffering. Sync |set_inner_html|
484     with |parse_char_stream|.
485    
486     2008-09-14 Wakaba <wakaba@suika.fam.cx>
487    
488 wakaba 1.308 * HTML.pm.src (parse_char_stream): Make |set_next_char|
489     invoke |manakai_read_until|, not only |read|, where
490     possible, to decrease the number of |read| method calls.
491    
492     * mkhtmlparser.pl: Related changes to the aforementioned
493     modification.
494    
495     2008-09-14 Wakaba <wakaba@suika.fam.cx>
496    
497 wakaba 1.307 * HTML.pm.src: Use |read| instead of |getc|. |set_inner_html|
498     would report character error from now.
499    
500     2008-09-14 Wakaba <wakaba@suika.fam.cx>
501    
502 wakaba 1.306 * HTML.pm.src: White-space-leaded non-white-space character
503     tokens in "before head insertion mode" was not
504     correctly handled.
505     (set_inner_html): Reimplemented using CharString decodehandle
506     class. Support for $get_wrapper argument. Support
507     for |{read_until}| feature.
508    
509     2008-09-14 Wakaba <wakaba@suika.fam.cx>
510    
511 wakaba 1.305 * HTML.pm.src: Make a "bare ero" error for unknown
512     entities point the "&" character.
513    
514     2008-09-14 Wakaba <wakaba@suika.fam.cx>
515    
516 wakaba 1.304 * HTML.pm.src: It turns out that U+FFFD don't have to
517     be added to the list of excluded characters.
518    
519     2008-09-14 Wakaba <wakaba@suika.fam.cx>
520    
521 wakaba 1.303 * HTML.pm.src ($char_onerror): Have character decoder's |line|
522     and |column| a higher priority than the one set by the
523     tokenizer's input handler.
524     ($self->{read_until}): Exclude U+FFFD (but this might
525     not be necessary, since now we do line/column fixup in
526     the character decode handle).
527    
528     2008-09-14 Wakaba <wakaba@suika.fam.cx>
529    
530 wakaba 1.302 * HTML.pm.src: Use |{read_until}| where possible.
531    
532     2008-09-14 Wakaba <wakaba@suika.fam.cx>
533    
534 wakaba 1.301 * HTML.pm.src: Change |{getc_until}| to |{read_until}|
535     and |manakai_getc_until| to |manakai_read_until| to
536     reduce the number of string copies.
537    
538     2008-09-14 Wakaba <wakaba@suika.fam.cx>
539    
540 wakaba 1.300 * HTML.pm.src (parse_char_string): Use newly created
541     |Whatpm::Charset::DecodeHandle::CharString| instead of Perl's
542     standard feature to |open| a string as a filehandle,
543     since Perl's string filehandle seems not supporting |ungetc|
544     method correctly.
545     (parse_char_stream): Define |{getc_until}| method.
546     (DATA_STATE): Experimental support for |getc_until| feature.
547    
548 wakaba 1.292 2008-09-13 Wakaba <wakaba@suika.fam.cx>
549    
550 wakaba 1.299 * HTML.pm.src: Check points added to newly added branches.
551    
552     2008-09-13 Wakaba <wakaba@suika.fam.cx>
553    
554 wakaba 1.298 * HTML.pm.src: Remove |{char}|, which is no longer used.
555     Remove |{entity_in_attr}| and |{last_attribute_value_state}|
556     and replaced by |{prev_state}|.
557    
558     * mkhtmlparser.pl: Remove |{char}| feature.
559     Remove |!!!back-next-input-character;| macro.
560    
561     2008-09-13 Wakaba <wakaba@suika.fam.cx>
562    
563 wakaba 1.297 * HTML.pm.src: Finally we get rid of all the inner loops. Remove
564     entity related tokenizer states in favor of new states
565     implementing the consume character reference algorithm.
566    
567     2008-09-13 Wakaba <wakaba@suika.fam.cx>
568    
569     * HTML.pm.src: "Consume a character reference" algorithm is
570 wakaba 1.296 now implemented as a tokenizer's state, rather than
571     a method, with minimum changes (more changes will
572     be made, in due course). "Bogus comment state"'s inner
573     loop gets removed.
574    
575     2008-09-13 Wakaba <wakaba@suika.fam.cx>
576    
577 wakaba 1.297 * HTML.pm.src: Make |PUBLIC| and |SYSTEM| keyword tokenizing
578 wakaba 1.295 into their own tokenizer states.
579    
580     2008-09-13 Wakaba <wakaba@suika.fam.cx>
581    
582 wakaba 1.294 * HTML.pm.src: |CDATA_SECTION_STATE| (formally |CDATA_BLOCK_STATE|
583     is split into three states.
584    
585     2008-09-13 Wakaba <wakaba@suika.fam.cx>
586    
587 wakaba 1.293 * HTML.pm.src: |CLOSE_TAG_OPEN_STATE| is broken into
588     itself and new |CDATA_PCDATA_CLOSE_TAG_STATE| so that
589     no longer does the tokenizer have to push back next input
590     characters in those states.
591    
592     2008-09-13 Wakaba <wakaba@suika.fam.cx>
593    
594 wakaba 1.292 * HTML.pm.src: |MARKUP_DECLARATION_OPEN_STATE| broken
595     into four states so that no longer does the tokenizer have to push
596     back next input characters in that state.
597    
598 wakaba 1.291 2008-09-11 Wakaba <wakaba@suika.fam.cx>
599    
600     * HTML.pm.src: Methods now accept additional parameter, $get_wrapper,
601     which can be used to insert some wrapper between the character
602     stream handle and the tokenizer. (It is currently not supported
603     for |set_inner_html| for |Element|s).
604    
605 wakaba 1.289 2008-09-10 Wakaba <wakaba@suika.fam.cx>
606    
607 wakaba 1.290 * HTML.pm.src: Ignore punctuations in charset names.
608    
609     2008-09-10 Wakaba <wakaba@suika.fam.cx>
610    
611 wakaba 1.289 * ContentChecker.pm: Support for charset-layer error levels.
612    
613     * HTML.pm.src: Don't specify |text| argument for the
614     |chardecode:fallback| error, since it is not the encoding
615     being used alternatively.
616    
617 wakaba 1.288 2008-09-06 Wakaba <wakaba@suika.fam.cx>
618    
619     * HTML.pm.src: Support for |XSLT-compat| (HTML5 revision 2141).
620    
621 wakaba 1.286 2008-08-31 Wakaba <wakaba@suika.fam.cx>
622    
623 wakaba 1.287 * CacheManifest.pm: Support for extensibility (HTML5 revision 2051).
624    
625     2008-08-31 Wakaba <wakaba@suika.fam.cx>
626    
627 wakaba 1.286 * HTML.pm.src: Bug fix and sync with the spec with regard
628     to after after frameset insertion mode processing (HTML5
629     revision 1909). Note that the implementation was wrong
630     per the old spec before the r1909 changes.
631    
632 wakaba 1.278 2008-08-30 Wakaba <wakaba@suika.fam.cx>
633    
634 wakaba 1.285 * HTMLTable.pm: scope=auto algorithm fix synced with the
635     spec (HTML5 revision 2093).
636     ($process_row): Algorithm step numbers synced with the
637     spec (HTML5 revision 2092).
638    
639     2008-08-30 Wakaba <wakaba@suika.fam.cx>
640    
641 wakaba 1.284 * HTMLTable.pm: Zs is not what we want; we want White_Space! (HTML5
642     revision 2094).
643    
644     2008-08-30 Wakaba <wakaba@suika.fam.cx>
645    
646     * ContentType.pm: Support for image/svg+xml (HTML5 revision 2096).
647    
648     2008-08-30 Wakaba <wakaba@suika.fam.cx>
649    
650 wakaba 1.283 * HTML.pm.src: '"' and "'" at the end of attribute
651     name (after another attribute) now raise parse error (HTML5
652     revision 2123). Empty unquoted attribute values are no
653     longer allowed (HTML5 revision 2122).
654    
655     2008-08-30 Wakaba <wakaba@suika.fam.cx>
656    
657 wakaba 1.282 * mkhtmlparser.pl: Support for MathML |definitionURL| attribute (HTML5
658     revision 2130).
659    
660     2008-08-30 Wakaba <wakaba@suika.fam.cx>
661    
662 wakaba 1.281 * ContentChecker.pm: |xml:lang| attribute value must be same
663     as |lang| attribute value for HTML elements (HTML5 revision 2062
664     and so on).
665    
666     2008-08-30 Wakaba <wakaba@suika.fam.cx>
667    
668 wakaba 1.280 * ContentChecker.pm: Error level definition for |xml_id_error|
669     was missing.
670    
671     * URIChecker.pm: The end of the URL should be marked as the
672     error location for an empty path error. The position
673     between the userinfo and the port components should be
674     marked as the error location for an empty host error.
675    
676     2008-08-30 Wakaba <wakaba@suika.fam.cx>
677    
678 wakaba 1.279 * URIChecker.pm: Set parameters representing where in the
679     value the error occurs for errors. Report unknown
680     address format error in warning level, since address
681     formats are rarely added. Path segments starting with "/.."
682     were misinterpreted as a dot-segment.
683    
684     2008-08-30 Wakaba <wakaba@suika.fam.cx>
685    
686 wakaba 1.278 * URIChecker.pm (check_iri_reference): Requires
687     |Message::DOM::DOMImplementation|.
688    
689 wakaba 1.277 2008-08-29 Wakaba <wakaba@suika.fam.cx>
690    
691     * IMTChecker.pm: Updated for the new error reporting architecture.
692    
693     * ContentChecker.pm: Error levels for IMTs are added.
694    
695 wakaba 1.276 2008-08-17 Wakaba <wakaba@suika.fam.cx>
696    
697     * H2H.pm (_shift_token): Support for unquoted HTML attribute
698     values.
699    
700 wakaba 1.274 2008-08-16 Wakaba <wakaba@suika.fam.cx>
701    
702 wakaba 1.275 * CacheManifest.pm: Support for new style of error
703     reports.
704    
705     * HTML.pm.src: Set line=1, column=1 to the document node.
706    
707     2008-08-16 Wakaba <wakaba@suika.fam.cx>
708    
709 wakaba 1.274 * ContentChecker.pm, RDFXML.pm: Pass {level} object to language tag
710     and URL checkers. Support for more error levels for bogus
711     langauge tag and URL "standards".
712    
713     * LangTag.pm, URIChecker.pm: Support for new style error
714     level reporting.
715    
716 wakaba 1.271 2008-08-15 Wakaba <wakaba@suika.fam.cx>
717    
718 wakaba 1.273 * ContentChecker.pm: Support for RDF/XML error levels.
719    
720     * HTMLTable.pm, RDFXML.pm: Support for new style of error level
721     specifying. Error types are revised.
722    
723     2008-08-15 Wakaba <wakaba@suika.fam.cx>
724    
725 wakaba 1.272 * ContentChecker.pm: All error reporting method calls are
726     renewed.
727    
728     2008-08-15 Wakaba <wakaba@suika.fam.cx>
729    
730 wakaba 1.271 * HTML.pm.src: All error type names and "text" parameters
731     are revised. Use new style for "level" specification.
732    
733     * mkhtmlparser.pl: Use new style for "level" specification.
734    
735 wakaba 1.268 2008-08-03 Wakaba <wakaba@suika.fam.cx>
736    
737 wakaba 1.270 * WebIDL.pm (parse_char_string): Simplified error
738     reporting process for broken ignored valuetype definition.
739     (Valuetype idl_text): Support for special "DOMString" name.
740    
741     2008-08-03 Wakaba <wakaba@suika.fam.cx>
742    
743 wakaba 1.269 * WebIDL.pm ($get_scoped_name): Append "::::" if the last
744     terminal of the ScopedName is "DOMString", such that whether
745     the last part of the scoped name is "DOMString" or "_DOMString"
746     later. It is necessary to determine whether a |typedef|
747     definition should be ignored or not.
748     (parse_char_string): Unescape the identifier of
749     exception members.
750     ($resolve): Return undef for builtin types and sequence<T>
751     types (we might not have to do this, however...).
752     (check): Support checking for Exceptions, Valuetypes,
753     and Typedefs.
754     ($serialize_type): Support for "DOMString::::" syntax.
755     (Typedef idl_text): Output Type as "DOMString" if it
756     is really "DOMString" (i.e. its internal representation
757     is "::DOMString::").
758    
759     2008-08-03 Wakaba <wakaba@suika.fam.cx>
760    
761 wakaba 1.268 * WebIDL.pm ($resolve): New code, based on resolve code
762     for constant types in the |check| method.
763     (check): Support for checking of attributes, operations, and
764     arguments.
765     (Attribute/Operation idl_text): Exception names in getraises,
766     setraises, and raises clauses is serizlied by |$serialize_type|
767     code.
768    
769 wakaba 1.264 2008-08-02 Wakaba <wakaba@suika.fam.cx>
770    
771 wakaba 1.267 * WebIDL.pm ($integer): Order of selections are changed to match
772     hexadecimal numbers (the original pattern, taken from the spec,
773     was not work for hexadecimal numbers, because the "0" prefix
774     matches to the [0-7]* part (as an empty string) and therefore
775     it does not match with remaining "x..." part of a "0x..." integer
776     literal.
777     ($get_type): It now returns a string, not an array reference,
778     for regular types and |sequence| types (i.e. it in any case
779     returns a string).
780     ($get_next_token): The second item in the array that represents
781     a integer or float token is now a Perl number value, not the
782     original string representation of the number.
783     (check): Support for const value consistency checking.
784     No extended attribute is defined for constants.
785     (Node subclasses): Use simple strings rather than array references
786     for default data type values.
787     ($serialize_type): Type values are now simple strings.
788     (value): If the new attribute value is a false value, then
789     a FALSE value is set to the attribute.
790    
791     2008-08-02 Wakaba <wakaba@suika.fam.cx>
792    
793 wakaba 1.266 * WebIDL.pm ($get_scoped_name): Now scoped names are stored
794     in its stringified format ("scoped name" as defined in the
795     spec). Note that future version of this module should not use
796     array references for type values and the |type_text| attribute
797     should be made obsolete.
798     (parse_char_string): Unescape attribute names.
799     (check): Support for checking of whether inherited interfaces
800     are actually defined or not. Support for checking of whether
801     interface member identifiers are duplicated or not.
802     ($serialize_type): Scoped names are returned as is. A future
803     version of this code should escape identifiers other than "DOMString",
804     otherwise the idl_text would be non-conforming.
805    
806     2008-08-02 Wakaba <wakaba@suika.fam.cx>
807    
808 wakaba 1.264 * WebIDL.pm (parse_char_string): Set line/column numbers
809 wakaba 1.265 to generated nodes. Unescape identifiers. Extended attributes
810     for Definition's were ignored.
811     (append_child): Set |parent_node| attribute.
812     (parent_node): New attribute.
813     (check): Support interface/exception members. Support
814     extended attributes. Support definition identifier uniqueness
815     constraint.
816     (qualified_name): New attribute.
817     (Interface/Exception idl_text): Extended attributes were
818     not prepended to the returned text.
819    
820     2008-08-02 Wakaba <wakaba@suika.fam.cx>
821    
822     * WebIDL.pm (parse_char_string): Set line/column numbers
823 wakaba 1.264 to interface object experimentally. s/shift/pop/g, shift
824     would make things wrong. Support for interface forward
825     declarations was missing. Broken interface declarations
826     with no block were not ignored entirely.
827     (Whatpm::WebIDL::Node): New abstract class. This class
828     makes things easier.
829     (child_nodes): New attribute. Unlike DOM's attribute with
830     same name, this attribute returns a dead list of nodes for
831     simplicity.
832     (get_user_data, set_user_data): New methods.
833     (Module idl_text): A SPACE character should be inserted
834     before the |{| character.
835     (Interface idl_text): Support for interface forward declarations.
836     (is_forward_declaration): New attribute.
837    
838 wakaba 1.259 2008-07-19 Wakaba <wakaba@suika.fam.cx>
839    
840 wakaba 1.263 * WebIDL.pm (type_text): Better serializer.
841    
842     2008-07-19 Wakaba <wakaba@suika.fam.cx>
843    
844 wakaba 1.262 * WebIDL.pm: Revise forward-compatible parsing so that
845     it now can handle broken extended attributes and as such.
846    
847     2008-07-19 Wakaba <wakaba@suika.fam.cx>
848    
849 wakaba 1.261 * WebIDL.pm: Real support for extended attributes.
850     Support for extended attributes with arguments.
851    
852     2008-07-19 Wakaba <wakaba@suika.fam.cx>
853    
854 wakaba 1.260 * WebIDL.pm: Support for |exception| syntax.
855     (Interface->idl_text): Tentative support for inheritances.
856    
857     2008-07-19 Wakaba <wakaba@suika.fam.cx>
858    
859 wakaba 1.259 * WebIDL.pm: Hierarchical scoped name support was broken.
860     Support for raises, setraises, and getraises syntaxes.
861    
862 wakaba 1.258 2008-07-18 Wakaba <wakaba@suika.fam.cx>
863    
864     * WebIDL.pm: Support for |idl_text| attribute, version 1 (no
865     proper support for types, extended attributes, and exceptions yet).
866     WebIDL parser, version 1 (no support for exceptions yet,
867     no proper support for extended attributes yet).
868    
869 wakaba 1.257 2008-07-09 Wakaba <wakaba@suika.fam.cx>
870    
871     * WebIDL.pm (parse_char_string): Support for basic attribute syntax.
872    
873 wakaba 1.255 2008-06-29 Wakaba <wakaba@suika.fam.cx>
874    
875 wakaba 1.256 * WebIDL.pm: Support for valuetype and const.
876    
877     2008-06-29 Wakaba <wakaba@suika.fam.cx>
878    
879 wakaba 1.255 * WebIDL.pm: New module.
880    
881 wakaba 1.254 2008-06-15 Wakaba <wakaba@suika.fam.cx>
882    
883     * Makefile (Entities.html): URI changed.
884    
885 wakaba 1.253 2008-06-08 Wakaba <wakaba@suika.fam.cx>
886    
887     * HTML.pm.src: Support for ruby parsing (HTML5 revision 1704).
888    
889 wakaba 1.251 2008-06-01 Wakaba <wakaba@suika.fam.cx>
890    
891 wakaba 1.252 * HTML.pm.src (_get_next_token): A parse error was missing.
892    
893     2008-06-01 Wakaba <wakaba@suika.fam.cx>
894    
895 wakaba 1.251 * mklinktypelist.pl: rel=contact is no longer part of the HTML5
896     spec (commented out). (HTML5 revision 1711).
897    
898 wakaba 1.248 2008-05-25 Wakaba <wakaba@suika.fam.cx>
899    
900 wakaba 1.250 * ContentType.pm: Drop support for UTF-32 (HTML5 revision 1701).
901    
902     * HTML.pm.src: UTF-16BE and UTF-16LE should be considered
903     as UTF-16 (HTML5 revision 1701).
904    
905     2008-05-25 Wakaba <wakaba@suika.fam.cx>
906    
907 wakaba 1.249 * HTML.pm.src: Support for <noframes> in <head> (HTML5 revision
908     1692).
909    
910     2008-05-25 Wakaba <wakaba@suika.fam.cx>
911    
912     * HTML.pm.src: The secondary insertion mode used when switching
913     to foreign content is the "in body" insertion mode (HTML5 revision
914     1696).
915    
916     2008-05-25 Wakaba <wakaba@suika.fam.cx>
917    
918 wakaba 1.248 * HTML.pm.src: Don't raise parse error for <isindex/> (HTML5
919     revision 1697).
920    
921 wakaba 1.239 2008-05-24 Wakaba <wakaba@suika.fam.cx>
922    
923 wakaba 1.247 * HTML.pm.src: Support for end-of-file token in foreign content
924     insertion mode (HTML5 revision 1693). Update SVG camelCase
925     attribute list (HTML5 revision 1700). <textarea> closes
926     </select> (HTML5 revision 1699). More start tags close in
927     foreign content insertion mode (HTML5 revision 1698).
928    
929     2008-05-24 Wakaba <wakaba@suika.fam.cx>
930    
931 wakaba 1.246 * HTML.pm.src: ";" is not part of charset name (HTML5 revision 1665).
932    
933     2008-05-24 Wakaba <wakaba@suika.fam.cx>
934    
935 wakaba 1.245 * HTML.pm.src: More robust charset parameter detection (HTML5
936     revision 1674).
937    
938     2008-05-24 Wakaba <wakaba@suika.fam.cx>
939    
940 wakaba 1.244 * ContentType.pm: Support for image/vnd.microsoft.icon (HTML5
941     revision 1676).
942    
943     2008-05-24 Wakaba <wakaba@suika.fam.cx>
944    
945 wakaba 1.243 * HTML.pm.src: Ignore language part of public identifiers for
946     quriks mode detection (HTML5 revision 1679).
947    
948     2008-05-24 Wakaba <wakaba@suika.fam.cx>
949    
950 wakaba 1.242 * HTML.pm.src: Reduce the number of errors in truncated doctypes (HTML5
951     revision 1685).
952    
953     2008-05-24 Wakaba <wakaba@suika.fam.cx>
954    
955 wakaba 1.241 * HTML.pm.src: Support for EOF in new states for tags (HTML5
956     revision 1684).
957    
958     2008-05-24 Wakaba <wakaba@suika.fam.cx>
959    
960 wakaba 1.240 * HTML.pm.src (_reset_insertion_mode): Make <td>.innerHTML
961     work (HTML5 revision 1690).
962    
963     2008-05-24 Wakaba <wakaba@suika.fam.cx>
964    
965 wakaba 1.239 * HTML.pm.src (_tree_construction_main): Change handling of
966     end tags in head insertion modes (HTML5 revision 1686).
967     (parse_char_string): Bug fix for non-utf8 character string handlings.
968     (parse_char_stream): |ungetc| does not work well for this context.
969    
970 wakaba 1.237 2008-05-18 Wakaba <wakaba@suika.fam.cx>
971    
972 wakaba 1.238 * HTML.pm.src (parse_byte_string): Redefined to invoke
973     |parse_byte_stream|.
974     (parse_byte_stream): New method.
975    
976     2008-05-18 Wakaba <wakaba@suika.fam.cx>
977    
978 wakaba 1.237 * HTML.pm.src (parse_byte_string): Fix the column number reported
979     by encoding layer error reporter.
980    
981 wakaba 1.233 2008-05-17 Wakaba <wakaba@suika.fam.cx>
982    
983 wakaba 1.236 * HTML.pm.src (parse_byte_string): Use streaming decoder
984     rather than converting the whole byte string and then parsing.
985     Propagate errors in character encoding layer.
986     (get_next_token): Precise error reporting for |bare stago| error.
987    
988     2008-05-17 Wakaba <wakaba@suika.fam.cx>
989    
990 wakaba 1.235 * HTML.pm.src (parse_char_stream): New method.
991     (parse_char_string): This method is now defined as an invocation
992     of the |parse_char_stream| method.
993    
994     2008-05-17 Wakaba <wakaba@suika.fam.cx>
995    
996 wakaba 1.234 * HTML.pm.src (parse_byte_string): Report various status
997     of the sniffing as info-level errors. Support for new
998     decoding framework in parser resestting.
999     (new): Various default error levels were not set.
1000    
1001     2008-05-17 Wakaba <wakaba@suika.fam.cx>
1002    
1003 wakaba 1.233 * HTML.pm.src (parse_byte_string): HTML5 encoding siniffing
1004     algorithm, except for the actual sniffing, is implemented
1005     with new framework with Message::Charset::Info.
1006    
1007 wakaba 1.232 2008-05-16 Wakaba <wakaba@suika.fam.cx>
1008    
1009     * CacheManifest.pm (_parse): Drop fragment identifiers from
1010     URIs in fallback section (HTML5 revision 1596).
1011    
1012 wakaba 1.230 2008-05-10 Wakaba <wakaba@suika.fam.cx>
1013    
1014 wakaba 1.231 * Makefile (Entities.html): URI has changed.
1015    
1016     2008-05-10 Wakaba <wakaba@suika.fam.cx>
1017    
1018 wakaba 1.230 * CacheManifest.pm: Don't replace U+0000 NULL (HTML5 revision
1019     1553).
1020    
1021 wakaba 1.228 2008-05-06 Wakaba <wakaba@suika.fam.cx>
1022    
1023 wakaba 1.229 * ContentChecker.pm: Noted that those returned in |table| are
1024     no longer table elements, but table objects returned
1025     by Whatpm::HTMLTable.
1026    
1027     * HTMLTable.pm (form_table): Return table element node
1028     as |$table->{element}|.
1029     (assign_header): Support for the |headers=""| attribute.
1030    
1031     2008-05-06 Wakaba <wakaba@suika.fam.cx>
1032    
1033 wakaba 1.228 * HTMLTable.pm (assign_header): New function; first version
1034     with no support for headers="".
1035     (form_table): Include table width and height to the returned
1036     table object for covenience. Indexing in column assignement was wrong.
1037     Set whether a data cell is empty or not for convenience.
1038    
1039 wakaba 1.222 2008-05-05 Wakaba <wakaba@suika.fam.cx>
1040    
1041 wakaba 1.227 * HTMLTable.pm: Robuster caption support (HTML5 revision 1393).
1042    
1043     2008-05-05 Wakaba <wakaba@suika.fam.cx>
1044    
1045 wakaba 1.226 * HTMLTable.pm: How table model errors are detected is
1046     changed (HTML5 revision 1387).
1047    
1048     2008-05-05 Wakaba <wakaba@suika.fam.cx>
1049    
1050 wakaba 1.225 * HTMLTable.pm: The algorithm now moves |tfoot| elements
1051     to the end of the table (HTML5 revision 1380).
1052    
1053     2008-05-05 Wakaba <wakaba@suika.fam.cx>
1054    
1055 wakaba 1.224 * HTMLTable.pm: The algorithm is now 0-based indexing, instead
1056     of 1-based (HTML5 revision 1376).
1057    
1058     2008-05-05 Wakaba <wakaba@suika.fam.cx>
1059    
1060 wakaba 1.223 * ContentType.pm: "Content-Type: text/plain; charset=UTF-8"
1061     and "Content-Encoding" no longer prevent sniffing (HTML5
1062     revision 1288).
1063    
1064     2008-05-05 Wakaba <wakaba@suika.fam.cx>
1065    
1066 wakaba 1.222 * ContentType.pm: Skip BOMs n feed or HTML algorithm (HTML5 revision
1067     1282).
1068    
1069 wakaba 1.220 2008-05-03 Wakaba <wakaba@suika.fam.cx>
1070    
1071 wakaba 1.221 * ContentChecker.pm: Support for global attributes.
1072     Status of XML specs are added.
1073    
1074     2008-05-03 Wakaba <wakaba@suika.fam.cx>
1075    
1076 wakaba 1.220 * ContentChecker.pm (check_element): Support for |template=""|
1077     and |ref=""| attribute (referent element type checking).
1078    
1079 wakaba 1.218 2008-04-29 Wakaba <wakaba@suika.fam.cx>
1080    
1081 wakaba 1.219 * CacheManifest.pm (_parse): New same origin definition (HTML5 revision
1082     1500) is implemented (except for IDNA part and URI-scheme-specific
1083 wakaba 1.218 knowledge). Line number counting was wrong for LF-only
1084     documents.
1085    
1086 wakaba 1.211 2008-04-13 Wakaba <wakaba@suika.fam.cx>
1087    
1088 wakaba 1.217 * HTML.pm.src: Raise an parse error for any disallowed
1089     character (HTML5 revision 1263).
1090    
1091     2008-04-13 Wakaba <wakaba@suika.fam.cx>
1092    
1093 wakaba 1.216 * mkentitylist.pl: Support for new HTML5 entity table format (the
1094     definition for |AElig;| was missing).
1095    
1096     2008-04-13 Wakaba <wakaba@suika.fam.cx>
1097    
1098 wakaba 1.215 * HTML.pm.src, mkhtmlparser.pl: Support for element/attribute
1099     name/namespace fixup (HTML5 revisions 1413, 1415, 1416, and 1417).
1100    
1101     2008-04-13 Wakaba <wakaba@suika.fam.cx>
1102    
1103 wakaba 1.214 * HTML.pm.src: List of element names that close foreign content
1104     insertion mode is added (HTML5 revisions 1412 and 1418).
1105    
1106     2008-04-13 Wakaba <wakaba@suika.fam.cx>
1107    
1108 wakaba 1.213 * HTML.pm.src: Support for |mglyph| and |malignmark|
1109     elements (HTML5 revision 1410).
1110    
1111     2008-04-13 Wakaba <wakaba@suika.fam.cx>
1112    
1113 wakaba 1.212 * HTML.pm.src: Support for new long MathML entities (HTML5
1114     revision 1406).
1115    
1116     2008-04-13 Wakaba <wakaba@suika.fam.cx>
1117    
1118 wakaba 1.211 * HTML.pm.src: CDATA section support for MathML and SVG
1119     elements (HTML5 revision 1404 and 1420).
1120    
1121 wakaba 1.209 2008-04-12 Wakaba <wakaba@suika.fam.cx>
1122    
1123 wakaba 1.210 * HTML.pm.src, mkhtmlparser.pl: Support for MathML and SVG
1124     elements (HTML5 revision 1404). Unused !!!macro definitions
1125     are removed.
1126    
1127     2008-04-12 Wakaba <wakaba@suika.fam.cx>
1128    
1129 wakaba 1.209 * HTML.pm.src, mkhtmlparser.pl: The way permitted slash errors
1130     are raised is changed (HTML5 revision 1404).
1131    
1132 wakaba 1.206 2008-04-06 Wakaba <wakaba@suika.fam.cx>
1133    
1134 wakaba 1.208 * HTML.pm.src: Category sets that are no longer used are removed.
1135    
1136     2008-04-06 Wakaba <wakaba@suika.fam.cx>
1137    
1138 wakaba 1.207 * HTML.pm.src: The ->[1] property of stack entries are now
1139     replaced by constants representing element category.
1140    
1141     2008-04-06 Wakaba <wakaba@suika.fam.cx>
1142    
1143 wakaba 1.206 * HTML.pm.src: Don't use local name stored in stack (i.e. ->[1])
1144     for error reporting. (This is a preparation for using constant
1145     value for ->[1].)
1146    
1147 wakaba 1.204 2008-03-22 Wakaba <wakaba@suika.fam.cx>
1148    
1149 wakaba 1.205 * RDFXML.pm: Typo fixed.
1150    
1151     2008-03-22 Wakaba <wakaba@suika.fam.cx>
1152    
1153 wakaba 1.204 * ContentChecker.pm: |fact_level| is now treated
1154     as same as |must_level|, i.e. level = |m|.
1155     (check_element): Make list of URIs in the DOM.
1156    
1157 wakaba 1.197 2008-03-21 Wakaba <wakaba@suika.fam.cx>
1158    
1159 wakaba 1.203 * RDFXML.pm: Language accessor implemented.
1160     Local (null-namespace) attribute support.
1161    
1162     2008-03-21 Wakaba <wakaba@suika.fam.cx>
1163    
1164 wakaba 1.202 * RDFXML.pm: Factored out ID checking code.
1165    
1166     2008-03-21 Wakaba <wakaba@suika.fam.cx>
1167    
1168 wakaba 1.201 * RDFXML.pm: TODO items noted. Validation of ID
1169     and URI attributes is implemented. Warn if unknown
1170     value is used in rdf:parseType="" attribute.
1171    
1172     * URIChecker.pm (check_rdf_uri_reference): New function.
1173    
1174     2008-03-21 Wakaba <wakaba@suika.fam.cx>
1175    
1176 wakaba 1.200 * RDFXML.pm: bnodeid implemented. Relative references
1177     are now resolved.
1178    
1179     2008-03-21 Wakaba <wakaba@suika.fam.cx>
1180    
1181 wakaba 1.199 * ContentChecker.pm: RDF reification implemented.
1182    
1183     * RDFXML.pm: undef vs false bug fixed.
1184     Reification implemented.
1185    
1186     2008-03-21 Wakaba <wakaba@suika.fam.cx>
1187    
1188 wakaba 1.198 * RDFXML.pm: s/id/ID/ for attribute name.
1189     The |node| arguments are added for |ontriple| calls.
1190     Too many "attribute not allowed" errors were raised.
1191    
1192     * ContentChecker.pm: Initial experimental support for rdf:RDF
1193     element.
1194    
1195     2008-03-21 Wakaba <wakaba@suika.fam.cx>
1196    
1197 wakaba 1.197 * RDFXML.pm: New module.
1198    
1199 wakaba 1.192 2008-03-20 Wakaba <wakaba@suika.fam.cx>
1200    
1201 wakaba 1.196 * HTML.pm.src (set_inner_html): Line/column number
1202     code was old one yet.
1203    
1204     2008-03-20 Wakaba <wakaba@suika.fam.cx>
1205    
1206 wakaba 1.195 * HTML.pm.src: Better line/column reporting for "duplicate attribute"
1207     errors. Line/column markings for DOCTYPE, comment, and
1208     character tokens are reintroduced; otherwise,
1209     error location for "not HTML5" error and errors
1210     for implied elements are not attached.
1211    
1212     2008-03-20 Wakaba <wakaba@suika.fam.cx>
1213    
1214 wakaba 1.194 * HTML.pm.src: Set line/column numbers to attributes.
1215    
1216     * NanoDOM.pm (create_attribute_ns, set_attribute_node_ns): Added.
1217     (value): Setter implemented.
1218    
1219     * mkhtmlparser.pl: Set line/column numbers to Attr nodes.
1220    
1221     2008-03-20 Wakaba <wakaba@suika.fam.cx>
1222    
1223 wakaba 1.193 * HTML.pm.src: Unused line/column markings are removed.
1224    
1225     2008-03-20 Wakaba <wakaba@suika.fam.cx>
1226    
1227 wakaba 1.192 * HTML.pm.src (_get_next_token): Remove |first_start_tag|
1228     flag, which is no longer used.
1229    
1230 wakaba 1.190 2008-03-17 Wakaba <wakaba@suika.fam.cx>
1231    
1232 wakaba 1.191 * HTML.pm.src: Set line/column information to element nodes.
1233    
1234     * mkhtmlparser.pl (!!!create-element, !!!insert-element,
1235     and !!!insert-element-t): Set line/column information to
1236     element nodes.
1237    
1238     2008-03-17 Wakaba <wakaba@suika.fam.cx>
1239    
1240 wakaba 1.190 * HTML.pm.src (_get_next_token): The first "<" character
1241     in "<?", "<>", or "</>" should be the error point.
1242    
1243 wakaba 1.187 2008-03-16 Wakaba <wakaba@suika.fam.cx>
1244    
1245 wakaba 1.189 * HTML.pm.src: Some more fixes on error position reporting.
1246    
1247     2008-03-16 Wakaba <wakaba@suika.fam.cx>
1248    
1249 wakaba 1.188 * HTML.pm.src: Token-level precious error reporting.
1250    
1251     2008-03-16 Wakaba <wakaba@suika.fam.cx>
1252    
1253 wakaba 1.187 * HTML.pm.src: Preparation for more precious error point
1254     reporting.
1255    
1256 wakaba 1.184 2008-03-11 Wakaba <wakaba@suika.fam.cx>
1257    
1258 wakaba 1.186 * HTML.pm.src: Error type revised.
1259    
1260     2008-03-11 Wakaba <wakaba@suika.fam.cx>
1261    
1262 wakaba 1.185 * HTML.pm.src: Similar codes are merged together, again.
1263    
1264     2008-03-11 Wakaba <wakaba@suika.fam.cx>
1265    
1266 wakaba 1.184 * HTML.pm.src: Similar codes are merged together.
1267    
1268 wakaba 1.183 2008-03-10 Wakaba <wakaba@suika.fam.cx>
1269    
1270     * mkhtmlparser.pl: Set "level" parameter to parse errors.
1271    
1272     * HTML.pm.src: Code refined.
1273    
1274 wakaba 1.175 2008-03-09 Wakaba <wakaba@suika.fam.cx>
1275    
1276 wakaba 1.182 * HTML.pm.src: |</body>| treatement has been changed (HTML5
1277     revision 1348). Note that I really don't know this makes
1278     any difference in the black-box behavior of the parser.
1279    
1280     2008-03-09 Wakaba <wakaba@suika.fam.cx>
1281    
1282 wakaba 1.181 * HTML.pm.src: New end-of-file token implementation (HTML5
1283     revision 1348).
1284    
1285     2008-03-09 Wakaba <wakaba@suika.fam.cx>
1286    
1287 wakaba 1.180 * HTML.pm.src: |applet| support (HTML5 revision 1347).
1288    
1289     2008-03-09 Wakaba <wakaba@suika.fam.cx>
1290    
1291 wakaba 1.179 * HTML.pm.src: Foster parenting in AAA (HTML5 revision 1343).
1292    
1293     2008-03-09 Wakaba <wakaba@suika.fam.cx>
1294    
1295 wakaba 1.178 * HTML.pm.src: Support for |<input>| in the "in select" insertion
1296     mode and support for the "in select in table" insertion mode (HTML5
1297     revision 1342).
1298    
1299     2008-03-09 Wakaba <wakaba@suika.fam.cx>
1300    
1301 wakaba 1.177 * HTML.pm.src: No foster parenting for <script> and <script>
1302     in non-tainted <table>s (HTML5 revision 1336).
1303    
1304     2008-03-09 Wakaba <wakaba@suika.fam.cx>
1305    
1306 wakaba 1.176 * HTML.pm.src: Ignore white space characters between <html>
1307     and <head> (HTML5 revision 1332).
1308    
1309     2008-03-09 Wakaba <wakaba@suika.fam.cx>
1310    
1311 wakaba 1.175 * HTML.pm.src: Treat <input type=hidden> as if it were a
1312     white space (HTML5 revision 1331).
1313    
1314 wakaba 1.164 2008-03-08 Wakaba <wakaba@suika.fam.cx>
1315    
1316 wakaba 1.174 * HTML.pm.src: Ignore U+000A at the beginning of a |listing|
1317     element (HTML5 revision 1330).
1318    
1319     2008-03-08 Wakaba <wakaba@suika.fam.cx>
1320    
1321 wakaba 1.173 * HTML.pm.src: <title> is always appended to the current
1322     element (HTML5 revision 1328).
1323    
1324     2008-03-08 Wakaba <wakaba@suika.fam.cx>
1325    
1326 wakaba 1.172 * HTML.pm.src: White space in tainted tables are moved
1327     into foster parents (HTML5 revision 1326).
1328    
1329     2008-03-08 Wakaba <wakaba@suika.fam.cx>
1330    
1331 wakaba 1.171 * HTML.pm.src: Reduce errors from foster parenting cases (HTML5
1332     revision 1321).
1333    
1334     2008-03-08 Wakaba <wakaba@suika.fam.cx>
1335    
1336 wakaba 1.170 * HTML.pm.src: |</h/n/>| case code rearranged to align with
1337     the spec (HTML5 revision 1320). Note that we finally complete
1338     all of HTML5 revision 1320 changes.
1339    
1340     2008-03-08 Wakaba <wakaba@suika.fam.cx>
1341    
1342 wakaba 1.169 * HTML.pm.src: |</form>| now works similar to |</div>| for unclosed
1343     tags (HTML5 revision 1320).
1344    
1345     2008-03-08 Wakaba <wakaba@suika.fam.cx>
1346    
1347 wakaba 1.168 * HTML.pm.src: |</p>| case rearranged with no actual change in fact.
1348    
1349     2008-03-08 Wakaba <wakaba@suika.fam.cx>
1350    
1351 wakaba 1.167 * HTML.pm.src: A "generate implied end tags" code (t409.1)
1352     could not be reached so that it is now removed (HTML5 revision
1353     1320).
1354    
1355     2008-03-08 Wakaba <wakaba@suika.fam.cx>
1356    
1357 wakaba 1.166 * HTML.pm.src: Code for the case of |</div>| and so on are revised
1358     to align with new spec text (HTML5 revision 1320).
1359    
1360     2008-03-08 Wakaba <wakaba@suika.fam.cx>
1361    
1362 wakaba 1.165 * HTML.pm.src: Remove strange |if| condition; however, it should
1363     have had no harm in theory.
1364    
1365     2008-03-08 Wakaba <wakaba@suika.fam.cx>
1366    
1367 wakaba 1.164 * HTML.pm.src (_tree_construction_main): '</p>' in body
1368     case is split from other end tags for the preparation
1369     of implementing HTML revision 1320.
1370    
1371 wakaba 1.162 2008-03-07 Wakaba <wakaba@suika.fam.cx>
1372    
1373 wakaba 1.163 * HTML.pm.src: Simplified "generate implied end tag" (HTML5
1374     revision 1320).
1375    
1376     2008-03-07 Wakaba <wakaba@suika.fam.cx>
1377    
1378 wakaba 1.162 * HTML.pm.src (_tree_construction_main): Merge rules for "h1"
1379     and "div" (HTML5 revision 1318). Add comments to where
1380     |form| pointer association codes should be inserted (HTML5 revision
1381     1319).
1382    
1383 wakaba 1.161 2008-03-06 Wakaba <wakaba@suika.fam.cx>
1384    
1385     * HTML.pm.src: <html> treatement refined (HTML5 revision 1314).
1386    
1387 wakaba 1.159 2008-03-05 Wakaba <wakaba@suika.fam.cx>
1388    
1389 wakaba 1.160 * HTML.pm.src: Since the case t268 should never be reached (no
1390     other token type, there are), it is replaced by a |die| statement.
1391    
1392     2008-03-05 Wakaba <wakaba@suika.fam.cx>
1393    
1394 wakaba 1.159 * HTML.pm.src: Typo fixed.
1395    
1396 wakaba 1.158 2008-03-04 Wakaba <wakaba@suika.fam.cx>
1397    
1398     * HTML.pm.src (_tree_construction_initial): Some limited quirks
1399     doctypes were not uppercased for comparison.
1400    
1401 wakaba 1.152 2008-03-03 Wakaba <wakaba@suika.fam.cx>
1402    
1403 wakaba 1.157 * HTML.pm.src (tree construction and set_inner_html): Checkpoints
1404     are added.
1405    
1406     2008-03-03 Wakaba <wakaba@suika.fam.cx>
1407    
1408 wakaba 1.156 * HTML.pm.src (_tokenize_attempt_to_consume_an_entity): Checkpoints
1409     are set. Cases that are unlikely reached are noted as so.
1410    
1411     2008-03-03 Wakaba <wakaba@suika.fam.cx>
1412    
1413 wakaba 1.155 * HTML.pm.src: Checkpoints for debugging are added.
1414    
1415     * mkhtmlparser.pl: Support for |!!!cp| syntax.
1416    
1417     2008-03-03 Wakaba <wakaba@suika.fam.cx>
1418    
1419 wakaba 1.154 * HTML.pm.src, mkhtmlparser.pl: s/_input_character/_char/g
1420     for simplicity.
1421    
1422     2008-03-03 Wakaba <wakaba@suika.fam.cx>
1423    
1424 wakaba 1.153 * HTML.pm.src: Flag name changed: s/correct/force-quirks/g (HTML5
1425     revision 1307).
1426    
1427     2008-03-03 Wakaba <wakaba@suika.fam.cx>
1428    
1429 wakaba 1.152 * HTML.pm.src (_get_next_token): Where the /incorrect/ flag
1430     is set are changed (HTML5 revision 1305).
1431    
1432 wakaba 1.149 2008-03-02 Wakaba <wakaba@suika.fam.cx>
1433    
1434 wakaba 1.151 * HTML.pm.src: Raise a parse error for |<span ===>| (HTML5 revision
1435     1292). Entities are not parsed in comment-like part in RCDATA
1436     elements (HTML5 revision 1294). Allow bare & at the end
1437     of attribute value literals (HTML5 revision 1296). More
1438     quirks mode doctypes (HTML5 revision 1302). Requires spaces
1439     between attributes and ban attribute names or unquoted
1440     attribute values containing single or double quotes (HTML5
1441     revision 1303).
1442    
1443     2008-03-02 Wakaba <wakaba@suika.fam.cx>
1444    
1445 wakaba 1.150 * ContentChecker.pm: Typo fixed. Don't raise "character encoding"
1446     and related errors unless it is an HTML document (though the spec
1447     is unclear on whether it is applied to XHTML document).
1448    
1449     * HTML.pm (%HTMLAttrStatus): WF2 repetition model attributes
1450     are added.
1451    
1452     2008-03-02 Wakaba <wakaba@suika.fam.cx>
1453    
1454 wakaba 1.149 * HTML.pm.src: s/local_name/manakai_local_name/g.
1455    
1456 wakaba 1.148 2008-03-01 Wakaba <wakaba@suika.fam.cx>
1457    
1458     * _NamedEntityList.pm: Updated (HTML5 revision 1286).
1459    
1460     * HTML.pm.src: |charset| in |content| attribute is
1461     case-insensitive (HTML5 revision 1270).
1462    
1463 wakaba 1.147 2008-02-26 Wakaba <wakaba@suika.fam.cx>
1464    
1465     * ContentChecker.pm: New status constants are added.
1466     ($ElementDefault): |status| added.
1467     (check_element): Err for non-standard or deprecated elements.
1468     (_attr_status_info): For non-standard or deprecated attributes.
1469    
1470 wakaba 1.142 2008-02-24 Wakaba <wakaba@suika.fam.cx>
1471    
1472 wakaba 1.146 * ContentChecker.pm (_attr_status_info): New internal method.
1473    
1474     2008-02-24 Wakaba <wakaba@suika.fam.cx>
1475    
1476 wakaba 1.145 * ContentChecker.pm (check_element): Element standardized
1477     status information is now dispatched.
1478    
1479     2008-02-24 Wakaba <wakaba@suika.fam.cx>
1480    
1481 wakaba 1.144 * ContentChecker.pm (check_element): Fix |del|-and-significant
1482     problem by adding some more arguments.
1483    
1484     2008-02-24 Wakaba <wakaba@suika.fam.cx>
1485    
1486 wakaba 1.143 * ContentChecker.pm (check_element): Use context of
1487     container-for-the-purpose-of-content-model element (not transparent
1488     element) for |check_child_element| calling and significant
1489     text flag marking. This reintroduces |<del>|-and-significant
1490     problem again.
1491    
1492     2008-02-24 Wakaba <wakaba@suika.fam.cx>
1493    
1494 wakaba 1.142 * ContentChecker.pm (check_element): Make semi-transparent
1495     elements ignored for the purpose of phase changes in
1496     content model checking.
1497    
1498 wakaba 1.138 2008-02-23 Wakaba <wakaba@suika.fam.cx>
1499    
1500 wakaba 1.141 * ContentChecker.pm (check_element): In-element state
1501     was not properly managed for transparent cases.
1502    
1503     2008-02-23 Wakaba <wakaba@suika.fam.cx>
1504    
1505 wakaba 1.140 * ContentChecker.pm (check_element): Support for |video|
1506     and |audio| as semi-transparent elements.
1507    
1508     2008-02-23 Wakaba <wakaba@suika.fam.cx>
1509    
1510 wakaba 1.139 * ContentChecker.pm ($HTMLSemiTransparentElements): New.
1511     (check_element): s/minuses/minus_elements/, s/pluses/plus_elements/.
1512     Support for |html:object| as a semi-transparent element.
1513    
1514     2008-02-23 Wakaba <wakaba@suika.fam.cx>
1515    
1516 wakaba 1.138 * ContentChecker.pm (check_element): The way to traverse
1517     the tree is entirely revised to make it easier to track
1518     the state of ancestors/descendants. As a result of this
1519     revision (which rewrites almost all of Whatpm::ContentChecker::HTML),
1520     support for content model checking for HTML elements |figure|,
1521     |object|, |video|, and |audio| and checking for XML elements (and
1522     some XMLNS checkings) are dropped for now. They will be
1523     reimplemented in due cource.
1524    
1525 wakaba 1.134 2008-02-17 Wakaba <wakaba@suika.fam.cx>
1526    
1527 wakaba 1.137 * HTML.pm.src: |>| in public or system literal closes the DOCTYPE
1528     token (HTML5 revision 1225).
1529    
1530     2008-02-17 Wakaba <wakaba@suika.fam.cx>
1531    
1532 wakaba 1.136 * ContentChecker.pm ({unsupported_level}): New value.
1533    
1534     * HTML.pm.src: Save whether |meta| |content| attribute
1535     contains character references or not.
1536    
1537     2008-02-17 Wakaba <wakaba@suika.fam.cx>
1538    
1539 wakaba 1.135 * ContentChecker.pm (_get_children): (Incompleted) attempt to
1540     imlement significant content checking for contents
1541     with |del| elements.
1542    
1543     2008-02-17 Wakaba <wakaba@suika.fam.cx>
1544    
1545 wakaba 1.134 * ContenteChecker.pm ($HTMLTransparentElements): More
1546     elements are added.
1547     (_get_children): HTML |object| elements are now semi-transparent.
1548    
1549     * NanoDOM.pm (manakai_html, manakai_head): New methods.
1550    
1551 wakaba 1.133 2008-02-16 Wakaba <wakaba@suika.fam.cx>
1552    
1553     * CacheManifest.pm: HTML5 revision 1211 implemented.
1554    
1555     * CacheManifest.pod: Updated.
1556    
1557 wakaba 1.132 2008-02-10 Wakaba <wakaba@suika.fam.cx>
1558    
1559     * ContentChecker.pm (check_document, check_element): Support
1560     for second argument ($onsubdoc).
1561     (_get_css_parser): Removed (now it is part of WDCC).
1562    
1563 wakaba 1.131 2008-02-09 Wakaba <wakaba@suika.fam.cx>
1564    
1565     * ContentChecker.pm (_get_css_parser): New.
1566    
1567 wakaba 1.129 2007-11-25 Wakaba <wakaba@suika.fam.cx>
1568    
1569 wakaba 1.130 * ContentChecker.pm ($AnyChecker): Old way to add child elements
1570     for checking had been used.
1571    
1572     2007-11-25 Wakaba <wakaba@suika.fam.cx>
1573    
1574 wakaba 1.129 * ContentChecker.pm (check_element): New todo item type |descendant|.
1575    
1576 wakaba 1.125 2007-11-23 Wakaba <wakaba@suika.fam.cx>
1577    
1578 wakaba 1.128 * IMTChecker.pm: Revised to raise errors and warnings as (poorly)
1579     specced in RFC 2046 and RFC 4288.
1580     (application/atom+xml): Definition added.
1581    
1582     2007-11-23 Wakaba <wakaba@suika.fam.cx>
1583    
1584 wakaba 1.127 * URIChecker.pm: Make RFC 3986 should-level errors
1585     warnings (rather than SHOULD-level errors).
1586    
1587     2007-11-23 Wakaba <wakaba@suika.fam.cx>
1588    
1589 wakaba 1.126 * NanoDOM.pm (get_user_data, set_user_data): New methods.
1590    
1591     * HTML.pm.src: A flag for character references in attribute
1592     values are added. Set |manakai_has_reference| user data
1593     to |charset| attribute.
1594    
1595     2007-11-23 Wakaba <wakaba@suika.fam.cx>
1596    
1597 wakaba 1.125 * NanoDOM.pm (input_encoding, manakai_charset, manakai_has_bom): New
1598     attributes.
1599    
1600     * ContentChecker.pm (check_document): Warn if charset requirements
1601     cannot be tested.
1602    
1603 wakaba 1.124 2007-11-19 Wakaba <wakaba@suika.fam.cx>
1604    
1605     * HTML.pm.src (parse_byte_string): Detect charset
1606     by universalchardet if charset parameter is not specified.
1607    
1608     * Makefile (Charset-all, Charset-clean): New rules.
1609    
1610 wakaba 1.120 2007-11-18 Wakaba <wakaba@suika.fam.cx>
1611    
1612 wakaba 1.123 * ContentChecker.pm (check_document): Check the existence
1613     of character encoding declaration and selection of encoding
1614     for HTML document.
1615    
1616     2007-11-18 Wakaba <wakaba@suika.fam.cx>
1617    
1618 wakaba 1.122 * ContentType.pm (get_sniffed_type): Return also the official
1619     type in list context.
1620    
1621     2007-11-18 Wakaba <wakaba@suika.fam.cx>
1622    
1623 wakaba 1.121 * ContentType.pm: Sniffing with leading white space
1624     ignoring (HTML5 revisions 1013 and 1016).
1625    
1626     2007-11-18 Wakaba <wakaba@suika.fam.cx>
1627    
1628 wakaba 1.120 * ContentType.pm: HTML5 revision 1013 changes, except for leading
1629     white spaces, are implemented.
1630    
1631 wakaba 1.117 2007-11-11 Wakaba <wakaba@suika.fam.cx>
1632    
1633 wakaba 1.119 * HTML.pm.src (parse_char_string): Set |inner_encoding|
1634     attribute if possible.
1635    
1636     2007-11-11 Wakaba <wakaba@suika.fam.cx>
1637    
1638 wakaba 1.118 * HTML.pm.src (parse_byte_string): New method.
1639     (parse_char_string): New alias for |parse_string|.
1640     (main phase): Invoking "change the encoding" algorithm if desired.
1641    
1642     * HTML.pod: Updated.
1643    
1644     2007-11-11 Wakaba <wakaba@suika.fam.cx>
1645    
1646 wakaba 1.117 * HTML.pod (get_inner_html): Removed.
1647    
1648     * Makefile (HTML-all, HTML-clean): New.
1649    
1650     2007-11-11 Wakaba <wakaba@suika.fam.cx>
1651    
1652     * HTML.pm.src (get_inner_html): Removed (moved to HTML/Serializer.pm).
1653    
1654 wakaba 1.116 2007-11-08 Wakaba <wakaba@suika.fam.cx>
1655    
1656     * mklinktypelist.pl: s/noreferer/noreferrer/ (HTML5 revision 1132).
1657    
1658 wakaba 1.112 2007-11-04 Wakaba <wakaba@suika.fam.cx>
1659    
1660 wakaba 1.115 * Makefile: |CacheManifest.html| is added.
1661    
1662     * CacheManifest.pod: New file.
1663    
1664     2007-11-04 Wakaba <wakaba@suika.fam.cx>
1665    
1666 wakaba 1.114 * CacheManifest.pm: New module.
1667    
1668     2007-11-04 Wakaba <wakaba@suika.fam.cx>
1669    
1670 wakaba 1.113 * HTML.pm.src: Support for application cache selection algorithm
1671     callback.
1672    
1673     2007-11-04 Wakaba <wakaba@suika.fam.cx>
1674    
1675 wakaba 1.112 * mklinktypelist.pl: Support for rel=noreferer (HTML5 revision 1118).
1676    
1677 wakaba 1.110 2007-10-17 Wakaba <wakaba@suika.fam.cx>
1678    
1679 wakaba 1.111 * Makefile (clean): New rule.
1680    
1681     * NanoDOM.pm (public_id, system_id): New attributes.
1682    
1683     2007-10-17 Wakaba <wakaba@suika.fam.cx>
1684    
1685 wakaba 1.110 * Makefile (CSS-all, CSS-clean, clean): New rules.
1686    
1687 wakaba 1.109 2007-10-14 Wakaba <wakaba@suika.fam.cx>
1688    
1689     * ContentChecker.pm (check_document): Support for
1690     new |is_xml_root| flag.
1691     (check_element): Support for new |pluses| state.
1692     (_add_pluses): New method.
1693     (_remove_minuses): Support for new |minus| item.
1694    
1695 wakaba 1.106 2007-09-24 Wakaba <wakaba@suika.fam.cx>
1696    
1697 wakaba 1.107 * ContentChecker.pm: Raise specific error for invalid
1698     root element.
1699    
1700     2007-09-24 Wakaba <wakaba@suika.fam.cx>
1701    
1702 wakaba 1.106 * ContentChecker.pm: Set level values for later uses.
1703    
1704 wakaba 1.103 2007-09-09 Wakaba <wakaba@suika.fam.cx>
1705    
1706 wakaba 1.105 * ContentChecker.pm: Support for language tag validation.
1707    
1708     2007-09-09 Wakaba <wakaba@suika.fam.cx>
1709    
1710 wakaba 1.104 * LangTag.pm (check_rfc3066_language_tag): New method.
1711    
1712     2007-09-09 Wakaba <wakaba@suika.fam.cx>
1713    
1714 wakaba 1.103 * LangTag.pm: New module.
1715    
1716 wakaba 1.102 2007-09-04 Wakaba <wakaba@suika.fam.cx>
1717    
1718     * HTML.pm.src: Some error types were wrong.
1719    
1720 wakaba 1.100 2007-08-17 Wakaba <wakaba@suika.fam.cx>
1721    
1722 wakaba 1.101 * CSS/: New directory.
1723    
1724     2007-08-17 Wakaba <wakaba@suika.fam.cx>
1725    
1726 wakaba 1.100 * ContentChecker.pm (_check_get_children): Support
1727     for |noscript| in |head|.
1728    
1729 wakaba 1.99 2007-08-12 Wakaba <wakaba@suika.fam.cx>
1730    
1731     * URI/: New directory.
1732    
1733 wakaba 1.94 2007-08-11 Wakaba <wakaba@suika.fam.cx>
1734    
1735 wakaba 1.98 * HTML.pm.src: Tokenizer's states are now represented in
1736     number.
1737    
1738     2007-08-11 Wakaba <wakaba@suika.fam.cx>
1739    
1740 wakaba 1.97 * HTML.pm.src: |or|s for insertion modes are replaced
1741     by |&|s.
1742    
1743     2007-08-11 Wakaba <wakaba@suika.fam.cx>
1744    
1745 wakaba 1.96 * HTML.pm.src: Token types are now represented in number.
1746    
1747     2007-08-11 Wakaba <wakaba@suika.fam.cx>
1748    
1749 wakaba 1.95 * ContentType.pm (SEE ALSO): Updated.
1750    
1751     * HTML.pm.src: Insertion modes are now represented in number.
1752    
1753     2007-08-11 Wakaba <wakaba@suika.fam.cx>
1754    
1755 wakaba 1.94 * ContentType.pm: Sniffing for bitmap images (HTML5 revision
1756     999) is implemented.
1757    
1758 wakaba 1.93 2007-08-08 Wakaba <wakaba@suika.fam.cx>
1759    
1760     * ContentType.pm: Sniffing for |<script| (HTML5 revision
1761     983) is implemented.
1762    
1763 wakaba 1.92 2007-08-06 Wakaba <wakaba@suika.fam.cx>
1764    
1765     * ContentChecker.pod: New documentation.
1766    
1767     * Makefile: A rule for |ContentChecker.html| is added.
1768    
1769     * ContentChecker.pm: A pod "LICENSE" section is added.
1770    
1771     * NanoDOM.pm ($VERSION): New variable.
1772    
1773 wakaba 1.90 2007-08-05 Wakaba <wakaba@suika.fam.cx>
1774    
1775 wakaba 1.91 * H2H.pm: |b|, |i|, and |sub| are added to the
1776     list of allowed HTML elements.
1777    
1778     2007-08-05 Wakaba <wakaba@suika.fam.cx>
1779    
1780 wakaba 1.90 * H2H.pm: |samp| is added to the list of allowed
1781     HTML elements.
1782    
1783     * URIChecker.pm (check_iri): New.
1784     (check_iri_reference): Error type for IRI reference
1785     syntax error is changed.
1786    
1787 wakaba 1.87 2007-08-04 Wakaba <wakaba@suika.fam.cx>
1788    
1789 wakaba 1.89 * ContentChecker.pm: Reference to the |Whatpm::ContentChecker::Atom|
1790     is added.
1791     (check_document): Load appropriate module before validation.
1792    
1793     2007-08-04 Wakaba <wakaba@suika.fam.cx>
1794    
1795 wakaba 1.88 * ContentChecker/: New directory.
1796    
1797     2007-08-04 Wakaba <wakaba@suika.fam.cx>
1798    
1799 wakaba 1.87 * ContentChecker.pm: HTML |time| element is implemented.
1800    
1801     * HTMLTable.pm: Comments are updated as HTML5 is revised.
1802    
1803 wakaba 1.71 2007-07-21 Wakaba <wakaba@suika.fam.cx>
1804    
1805 wakaba 1.86 * ContentChecker.pm (check_document): Return value
1806     even if no document element is found.
1807    
1808     2007-07-21 Wakaba <wakaba@suika.fam.cx>
1809    
1810 wakaba 1.85 * HTML.pm.src: |$in_body| is no longer a function.
1811    
1812     2007-07-21 Wakaba <wakaba@suika.fam.cx>
1813    
1814 wakaba 1.84 * HTML.pm.src: The |$in_body| code has been moved down.
1815    
1816     2007-07-21 Wakaba <wakaba@suika.fam.cx>
1817    
1818 wakaba 1.83 * HTML.pm.src: The "trailing end" insertion mode
1819     is split into "after html body" and "after html frameset"
1820     insertion modes. Their codes are merged with "after body"
1821     and "after frameset" codes. |$previous_insertion_mode|
1822     has been removed. "after frameset" code is
1823     merged with "in frameset" code.
1824    
1825     2007-07-21 Wakaba <wakaba@suika.fam.cx>
1826    
1827 wakaba 1.82 * HTML.pm.src: The "before head" insertion mode is
1828     merged with the "in head" insertion mode.
1829    
1830     2007-07-21 Wakaba <wakaba@suika.fam.cx>
1831    
1832 wakaba 1.81 * HTML.pm.src: Cases in "in head" insertion mode are
1833     reorganized.
1834    
1835     2007-07-21 Wakaba <wakaba@suika.fam.cx>
1836    
1837 wakaba 1.80 * HTML.pm.src: Some cases in "in table" insertion mode
1838     are merged.
1839    
1840     2007-07-21 Wakaba <wakaba@suika.fam.cx>
1841    
1842 wakaba 1.79 * HTML.pm.src: The "in row" insertion mode is merged
1843     with "in table" insertion mode.
1844    
1845     2007-07-21 Wakaba <wakaba@suika.fam.cx>
1846    
1847 wakaba 1.78 * HTML.pm.src: The "in table" and "in table body" insertion
1848     modes are merged.
1849    
1850     2007-07-21 Wakaba <wakaba@suika.fam.cx>
1851    
1852 wakaba 1.77 * HTML.pm.src: There is no "in table head" or "in table foot"
1853     insertion mode!
1854    
1855     2007-07-21 Wakaba <wakaba@suika.fam.cx>
1856    
1857 wakaba 1.76 * HTML.pm.src: |<noframes>| "in frameset" and "in noframes"
1858     now directly invoke the handler.
1859    
1860     2007-07-21 Wakaba <wakaba@suika.fam.cx>
1861    
1862 wakaba 1.75 * HTML.pm.src: Codes for "in cell" insertion mode
1863     is merged to the "in body" insertion mode code.
1864    
1865     2007-07-21 Wakaba <wakaba@suika.fam.cx>
1866    
1867 wakaba 1.74 * HTML.pm.src: Codes for "in body" and "in caption"
1868     insertion modes are merged.
1869    
1870     2007-07-21 Wakaba <wakaba@suika.fam.cx>
1871    
1872 wakaba 1.73 * HTML.pm.src: Two |!!!next-token|s were missing.
1873    
1874     2007-07-21 Wakaba <wakaba@suika.fam.cx>
1875    
1876 wakaba 1.72 * HTML.pm.src: Use numeric constant for |{content_mode}|
1877     instead of string constant for |{content_model_flag}|.
1878    
1879     2007-07-21 Wakaba <wakaba@suika.fam.cx>
1880    
1881 wakaba 1.71 * HTML.pm.src: Add the name of the attribute
1882     to the "duplicate attribute" error.
1883    
1884 wakaba 1.69 2007-07-17 Wakaba <wakaba@suika.fam.cx>
1885    
1886 wakaba 1.70 * ContentChecker.pm: Return the |class| node list.
1887    
1888     2007-07-17 Wakaba <wakaba@suika.fam.cx>
1889    
1890 wakaba 1.69 * ContentChecker.pm: Return the |id| node list.
1891    
1892     * HTML.pm.src: A typo is fixed.
1893    
1894 wakaba 1.63 2007-07-16 Wakaba <wakaba@suika.fam.cx>
1895    
1896 wakaba 1.68 * ContentChecker.pm: Drop wrong |level => 'error'| specification
1897     from "in HTML:xml:lang" error. Character position
1898     is now the last part of the error type in the URI error
1899     description. Report "unsupported" status for language
1900     tags, media queries, script codes, and style sheets.
1901    
1902     2007-07-16 Wakaba <wakaba@suika.fam.cx>
1903    
1904 wakaba 1.67 * ContentChecker.pm: Report error if |xml:lang|
1905     in HTML, |lang| in XML, |xmlns| in XML, and |meta| |charset|
1906     in XML.
1907    
1908     * NanoDOM.pm (Attr.owner_document): New attribute.
1909    
1910     2007-07-16 Wakaba <wakaba@suika.fam.cx>
1911    
1912 wakaba 1.66 * HTML.pm.src: The character immediately following
1913     a bare |hcro| was discarded. Fix handling of
1914     entity references in attribute values.
1915    
1916     2007-07-16 Wakaba <wakaba@suika.fam.cx>
1917    
1918 wakaba 1.65 * HTML.pm.src (main and trailin end phases): Token
1919     types |DOCTYPE|, |comment|, |end-of-file|, and |<html>| are
1920     factored out. Error types |in html:#DOCTYPE|
1921     and |after html:#DOCTYPE| are merged
1922     into |DOCTYPE in the middle|. |</frameset>|
1923     in fragment parsing mode changed the
1924     insertion mode.
1925    
1926     2007-07-16 Wakaba <wakaba@suika.fam.cx>
1927    
1928 wakaba 1.64 * HTML.pm.src: |$phase| has been removed; The |trailing end|
1929     phase is now an insertion mode. Treatments for white
1930     space character tokens were incorrect for some
1931     insertion modes. An old |meta| case was not removed.
1932    
1933     2007-07-16 Wakaba <wakaba@suika.fam.cx>
1934    
1935 wakaba 1.63 * HTML.pm.src: |meta| charset declaration extraction
1936     implemented (but changing the encoding is not yet:-).
1937    
1938 wakaba 1.60 2007-07-15 Wakaba <wakaba@suika.fam.cx>
1939    
1940 wakaba 1.62 * Charset/: New directory.
1941    
1942     2007-07-15 Wakaba <wakaba@suika.fam.cx>
1943    
1944 wakaba 1.61 * H2H.pm: New Perl module (created from
1945     manakai's H2H.dis).
1946    
1947     2007-07-15 Wakaba <wakaba@suika.fam.cx>
1948    
1949 wakaba 1.60 * XMLSerializer.pm: New Perl module (created from
1950     manakai's SimpleLS.dis).
1951    
1952 wakaba 1.59 2007-07-07 Wakaba <wakaba@suika.fam.cx>
1953    
1954     * HTML.pm.src: |<!---x-->| was not processed correctly.
1955    
1956 wakaba 1.57 2007-07-01 Wakaba <wakaba@suika.fam.cx>
1957    
1958 wakaba 1.58 * HTML.pm.src: Report correct error message
1959     for |<body></div></body>|.
1960    
1961     2007-07-01 Wakaba <wakaba@suika.fam.cx>
1962    
1963 wakaba 1.57 * HTMLTable.pm: An error description was incorrect.
1964    
1965     2007-06-30 Wakaba <wakaba@suika.fam.cx>
1966    
1967     * ContentChecker.pm: Return |{term}| list.
1968    
1969 wakaba 1.55 2007-06-30 Wakaba <wakaba@suika.fam.cx>
1970    
1971 wakaba 1.56 * HTML.pm.src: HTML5 revisions 961-966 (</p>, </br>,
1972     nested <nobr>, implied </tbody>, </tfoot>, and </thead>,
1973     and <title> outside of head).
1974    
1975     2007-06-30 Wakaba <wakaba@suika.fam.cx>
1976    
1977 wakaba 1.55 * IMTChecker.pm: Report warning for unregistered
1978     and private types/subtypes.
1979    
1980     * ContentChecker.pm, HTML.pm.src, IMTChecker.pm,
1981     URIChecker.pm, HTMLTable.pm: Error messages are now
1982     consistent; they are all listed in
1983     <http://suika.fam.cx/gate/2005/sw/Whatpm%20Error%20Types>.
1984    
1985 wakaba 1.52 2007-06-25 Wakaba <wakaba@suika.fam.cx>
1986    
1987 wakaba 1.54 * ContentChecker.pm: |<img ismap>| not in |<a></a>|
1988     is now erred. |<datalist>| is implemented.
1989     Attribute checker for |<command>| and |<menu>| are
1990     added. Support for |contextmenu| global attribute
1991     is added.
1992    
1993     2007-06-25 Wakaba <wakaba@suika.fam.cx>
1994    
1995 wakaba 1.53 * HTML.pm.src (_reset_insertion_mode): Interpretation
1996     of Step 3 has been changed.
1997    
1998     2007-06-25 Wakaba <wakaba@suika.fam.cx>
1999    
2000 wakaba 1.52 * HTML.pm.src: Late |<html>| parse error is implemented.
2001    
2002 wakaba 1.47 2007-06-24 Wakaba <wakaba@suika.fam.cx>
2003    
2004 wakaba 1.51 * URIChecker.pm (check_iri_reference): A |decode| method name was
2005     incorrect.
2006    
2007     * ContentChecker.pm: Support for the |footer| element.
2008     Check URI syntax for space-separated URI attributes.
2009     Support for the |tabindex| attribute. Support
2010     for |datetime| attribute.
2011    
2012     2007-06-24 Wakaba <wakaba@suika.fam.cx>
2013    
2014 wakaba 1.50 * HTML.pm.src: HTML5 revision 1.144 (&#x0D;) and 1.145 (invalid
2015     character references). HTML5 revision 1.146 (white space
2016     characters before root start tag). HTML5 revision
2017     1.148 (named character references in attribute values).
2018     HTML5 revision 1.152 (<plaintext>.innerHTML get).
2019    
2020     2007-06-24 Wakaba <wakaba@suika.fam.cx>
2021    
2022     * HTML.pm.src: HTML5 revisions 1.142 and 1.143 (<noscript>
2023 wakaba 1.49 in <head>).
2024    
2025     2007-06-24 Wakaba <wakaba@suika.fam.cx>
2026    
2027 wakaba 1.48 * HTML.pm.src: HTML5 revision 935 (<base>, <link>, <meta>
2028     in body).
2029    
2030     * ContentChecker.pm: HTML5 revision 938 (scoped="").
2031    
2032     2007-06-24 Wakaba <wakaba@suika.fam.cx>
2033    
2034 wakaba 1.47 * HTML.pm.src: HTML5 revision 923 (matching end tag in CDATA
2035     or RCDATA in fragment parsing mode). HTML5 revision
2036     924 (<!--> and <!--->). HTML5 revision 926 (hn in hn).
2037    
2038     2007-06-23 Wakaba <wakaba@suika.fam.cx>
2039    
2040     * HTML.pm.src (get_inner_html): HTML5 revision 922 (inner_html
2041     for <pre> and <textarea>).
2042    
2043 wakaba 1.33 2007-06-23 Wakaba <wakaba@suika.fam.cx>
2044    
2045 wakaba 1.46 * HTML.pm.src: HTML5 revision 920 (<isindex>).
2046    
2047     2007-06-23 Wakaba <wakaba@suika.fam.cx>
2048    
2049 wakaba 1.45 * HTML.pm.src: HTML5 revision 918 (</head>, </body>, </html>).
2050    
2051     2007-06-23 Wakaba <wakaba@suika.fam.cx>
2052    
2053 wakaba 1.44 * HTML.pm.src: HTML5 revision 916 (</body>).
2054     HTML5 revision 917 (conforming bare &).
2055    
2056     2007-06-23 Wakaba <wakaba@suika.fam.cx>
2057    
2058 wakaba 1.43 * NanoDOM.pm (manakai_is_html): Setting to false did not work.
2059    
2060     * HTML.pm.src: HTML5 revision 914 (</ in CDATA, RCDATA).
2061     HTML5 revision 915 (<nobr>).
2062    
2063     2007-06-23 Wakaba <wakaba@suika.fam.cx>
2064    
2065 wakaba 1.42 * HTML.pm.src: HTML5 revisions 908, 909, 912, and 913 (quirks mode).
2066    
2067     * NanoDOM.pm (manakai_is_html, manakai_compat_mode, compat_mode):
2068     New attributes.
2069    
2070     2007-06-23 Wakaba <wakaba@suika.fam.cx>
2071    
2072 wakaba 1.41 * HTML.pm.src: HTML5 revisions 900, 901, 902, and 911 (<
2073     in tags).
2074    
2075     2007-06-23 Wakaba <wakaba@suika.fam.cx>
2076    
2077 wakaba 1.40 * .cvsignore: |Entities.html| is added.
2078    
2079     * HTML.pm.src: |$entity_char| is removed and
2080     requires |Whatpm::_NamedEntityList| instead.
2081     HTML5 revision 898 (refc), except that lack of refc
2082     is parse error.
2083    
2084     * mkentitylist.pl: New script.
2085    
2086     * Makefile (all): |_NamedEntityList.pm| is added.
2087     (_NamedEntityList.pm, Entities.html): New rules.
2088    
2089     2007-06-23 Wakaba <wakaba@suika.fam.cx>
2090    
2091 wakaba 1.39 * HTML.pm.src: Parse errors immediately after U+000D
2092     were ignored and U+000D immediately following another
2093     U+000D was not converted to U+000A.
2094    
2095     2007-06-23 Wakaba <wakaba@suika.fam.cx>
2096    
2097 wakaba 1.38 * HTML.pm.src (set_inner_html): HTML5 revision 892 (adopt
2098     nodes before appended). Parser was not ready for NULL
2099     parse error and escape flag.
2100    
2101     * NanoDOM.pm (adopt_node): New.
2102    
2103     2007-06-23 Wakaba <wakaba@suika.fam.cx>
2104    
2105 wakaba 1.37 * HTML.pm.src: HTML5 revision 886 (insane comment in
2106     CDATA and RCDATA). Note that current implementation
2107     is simply repeating what the spec says and it is maybe not
2108     a best way to do it.
2109    
2110     2007-06-23 Wakaba <wakaba@suika.fam.cx>
2111    
2112 wakaba 1.36 * HTML.pm.src: HTML5 revision 884 (</form> don't close
2113     the form element if a descendant element without implied end tag has
2114     still been open).
2115    
2116     2007-06-23 Wakaba <wakaba@suika.fam.cx>
2117    
2118 wakaba 1.35 * ContentChecker.pm: HTML5 revision 881 (Make |id| attribute with
2119     space characters non-conforming).
2120    
2121     2007-06-23 Wakaba <wakaba@suika.fam.cx>
2122    
2123 wakaba 1.34 * HTML.pm.src: An error message was incorrect.
2124     HTML5 revision 869 (C1 character references).
2125    
2126     2007-06-23 Wakaba <wakaba@suika.fam.cx>
2127    
2128 wakaba 1.33 * HTML.pm.src: HTML5 revision 867 (a LF at the beginning of
2129     a |textarea| is removed).
2130    
2131 wakaba 1.32 2007-06-05 Wakaba <wakaba@suika.fam.cx>
2132    
2133     * NanoDOM.pm (get_attribute_node_ns): New method.
2134    
2135     * ContentChecker.pm: |script| |async| and |defer|
2136     no longer require |src|. |async| MUST NOT be
2137     specified if |defer|. (HTML5 revision 858).
2138    
2139 wakaba 1.31 2007-05-30 Wakaba <wakaba@suika.fam.cx>
2140    
2141     * HTML.pm.src: |<form><form>| went to inifinite loop.
2142    
2143 wakaba 1.27 2007-05-27 Wakaba <wakaba@suika.fam.cx>
2144    
2145 wakaba 1.30 * ContentChecker.pm (html): Set |is_root| (allowed
2146     as a document element) flag on.
2147     (new): Removed.
2148     (check_document): New method.
2149    
2150     2007-05-27 Wakaba <wakaba@suika.fam.cx>
2151    
2152 wakaba 1.29 * ContentChecker.pm (thead, tfoot): Checker specifications
2153     were incorrect.
2154    
2155     2007-05-27 Wakaba <wakaba@suika.fam.cx>
2156    
2157 wakaba 1.28 * ContentChecker.pm ($HTMLURIAttrChecker): Include
2158     error position in the |type| option of the error.
2159    
2160     * HTMLTable.pm (form_table): The |$onerror| parameter
2161     is now optional. Some bugs are fixed.
2162    
2163     2007-05-27 Wakaba <wakaba@suika.fam.cx>
2164    
2165 wakaba 1.27 * HTMLTable.pm: New module.
2166    
2167     * ContentChecker.pm (table): Invoke table model error checker.
2168    
2169     * NanoDOM.pm (first_child, get_attribute_ns): New.
2170    
2171 wakaba 1.25 2007-05-26 Wakaba <wakaba@suika.fam.cx>
2172    
2173 wakaba 1.26 * ContentChecker.pm ($HTMLLinkTypesAttrChecker): New checker.
2174     (link/@rel, a/@rel, area/@rel): Use new checker.
2175    
2176     * Makefile (_LinkTypeList.pm, RelExtensions.html): New rules.
2177    
2178     * _LinkTypeList.pm: New file.
2179    
2180     * mklinktypelist.pl: New file.
2181    
2182     * .cvsignore: |RelExtensions.html| added.
2183    
2184     * NanoDOM.pm (child_nodes): Returns an empty array
2185     for non-child-containing node types.
2186     (text_content): New attribute.
2187    
2188     2007-05-26 Wakaba <wakaba@suika.fam.cx>
2189    
2190 wakaba 1.25 * IMTChecker.pm: New module.
2191    
2192     * ContentChecker.pm ($HTMLIMTAttrChecker): Call IMTChecker
2193     to test parameter value validity.
2194    
2195     * HTML.pm.src ($style_start_tag): Attributes were
2196     discarded.
2197    
2198 wakaba 1.22 2007-05-25 Wakaba <wakaba@suika.fam.cx>
2199    
2200 wakaba 1.24 * ContentChecker.pm ($HTMLURIAttrChecker): Implemented.
2201    
2202     2007-05-25 Wakaba <wakaba@suika.fam.cx>
2203    
2204 wakaba 1.23 * URIChecker.pm: All recommendations from RFC 3986
2205     and RFC 3987 are listed (not all testable items are checked yet).
2206    
2207     2007-05-25 Wakaba <wakaba@suika.fam.cx>
2208    
2209 wakaba 1.22 * URIChecker.pm: New module.
2210    
2211 wakaba 1.18 2007-05-20 Wakaba <wakaba@suika.fam.cx>
2212    
2213 wakaba 1.21 * ContentChecker.pm: Now most attributes are implemented
2214     or associated to some placeholder.
2215     ($ElementDefault): Warn unknown attributes
2216     for unknown elements as "attribute not supported".
2217     ($HTMLLanguageTagAttrChecker, $HTMLMQAttrChecker): New placeholders.
2218     ($HTMLUsemapAttrChecker, $HTMLTargetAttrChecker): New checkers.
2219     (|a| attribute checker): Reimplemented.
2220    
2221     2007-05-20 Wakaba <wakaba@suika.fam.cx>
2222    
2223 wakaba 1.20 * ContentChecker.pm ($HTMLEventHandlerAttrChecker): New placeholder.
2224     ($HTMLAttrChecker): Event handler content attributes
2225     are added.
2226     (link, embed): Required attribute is now checked.
2227     (embed): Unknown local attributes are no longer warned.
2228    
2229     2007-05-20 Wakaba <wakaba@suika.fam.cx>
2230    
2231 wakaba 1.19 * ContentChecker.pm ($HTMLSpaceURIsAttrChecker): New placeholder.
2232     ($HTMLIMTAttrChecker): New checker.
2233     (link@rel, link@href, link@type, style@type,
2234     a@href, a@ping, a@ping, a@type, embed@src, embed@type,
2235     object@data, object@type, source@src, source@type, area@alt,
2236     area@shape, area@coords, area@href,
2237     area@ping, area@rel, area@type, script@src,
2238     script@defer, script@async, script@type): Checkers added.
2239    
2240     2007-05-20 Wakaba <wakaba@suika.fam.cx>
2241    
2242 wakaba 1.18 * ContentChecker.pm: Descendant checking was incorrect.
2243    
2244 wakaba 1.12 2007-05-19 Wakaba <wakaba@suika.fam.cx>
2245    
2246 wakaba 1.17 * ContentChecker.pm: Support |xml:*| and |xmlns:*|
2247     attributes. Report an error if |Element.prefix|
2248     is |xmlns|.
2249    
2250     * NanoDOM.pm (prefix): New attribute.
2251    
2252     2007-05-19 Wakaba <wakaba@suika.fam.cx>
2253    
2254 wakaba 1.16 * HTML.pm.src: In |main| phase, |in body| insertion
2255     mode, action for |<iframe>| was missing.
2256    
2257     2007-05-19 Wakaba <wakaba@suika.fam.cx>
2258    
2259 wakaba 1.15 * ContentChecker.pm: Support for many of HTML5 elements.
2260     ($GetHTMLNonNegativeIntegerAttrChecker): New.
2261    
2262     2007-05-19 Wakaba <wakaba@suika.fam.cx>
2263    
2264 wakaba 1.14 * ContentChecker.pm: Support for most elements up to |progress|.
2265     ($HTMLURIAttrChecker): Placeholder.
2266     ($HTMLIntegerAttrChecker, $GetHTMLFloatingPointNumberAttrChecker):
2267     New.
2268    
2269     2007-05-19 Wakaba <wakaba@suika.fam.cx>
2270    
2271 wakaba 1.13 * ContentChecker.pm: Attribute checkers
2272     for global attributes, |html|, |base|, |style|, and |meta|.
2273    
2274     * NanoDOM.pm (insert_before): Weaken reference
2275     to the parent node.
2276     (Attr::new): Set |owner_element| attribute.
2277     (namespace_uri, manakai_local_name): New attribute implementations.
2278     (owner_element): New attribute.
2279    
2280     2007-05-19 Wakaba <wakaba@suika.fam.cx>
2281    
2282 wakaba 1.12 * ContentChecker.pm ($AttrChecker, $HTMLAttrChecker,
2283     $AnyChecker->{attr_checker}, $HTMLAttrsChecker,
2284     $Element->{$HTML_NS}->{''}): New.
2285     (check_element): Invoke attrs_checker for each element.
2286    
2287 wakaba 1.6 2007-05-13 Wakaba <wakaba@suika.fam.cx>
2288    
2289 wakaba 1.11 * ContentChecker.pm: Don't use |manakai_element_type_match|.
2290    
2291     2007-05-13 Wakaba <wakaba@suika.fam.cx>
2292    
2293 wakaba 1.10 * ContentChecker.pm: Use hashs rather than lists for
2294     element type testings.
2295    
2296     2007-05-13 Wakaba <wakaba@suika.fam.cx>
2297    
2298 wakaba 1.9 * ContentChecker.pm: Don't generate duplicate
2299     error when an element type is put in the "minus" list
2300     and the element type is not allowed explicitly in the particular
2301     element content model.
2302     (html:a checker): New checker.
2303     (html:details, html:datagrid): New checkers.
2304     (html:legend): New checker.
2305    
2306     2007-05-13 Wakaba <wakaba@suika.fam.cx>
2307    
2308 wakaba 1.8 * ContentChecker.pm (html:li checker): Implemented.
2309    
2310     2007-05-13 Wakaba <wakaba@suika.fam.cx>
2311    
2312 wakaba 1.7 * ContentChecker.pm ($HTMLInlineOrStriclyInlineChecker): New
2313     checker.
2314     (html:dd checker): New checker.
2315     (html:q, html:em, html:strong, html:small,
2316     html:m, html:dfn, html:code, html:samp, html:span): New checkers.
2317    
2318     2007-05-13 Wakaba <wakaba@suika.fam.cx>
2319    
2320 wakaba 1.6 * ContentChecker.pm ($AnyChecker): Renamed
2321     from |$ElementDefault->{checker}|.
2322     ($ElementDefault->{checker}): Throw an error that
2323     the element type is not supported by the checker.
2324     ($HTMLMetadataElement): |html:base| was missing.
2325     ($HTMLEmptyChecker): Don't throw an error
2326     for inter-element whitespace nodes.
2327     (html:html checker): Errors were not
2328     thrown even if |html:head| and/or |html:body|
2329     children were missing.
2330     (html:head checker): An error was not
2331     thrown if <meta charset> appered after other
2332     elements.
2333    
2334 wakaba 1.5 2007-05-05 Wakaba <wakaba@suika.fam.cx>
2335    
2336     * ContentChecker.pm: |footer|, |video|, |audio|, |script|,
2337     and |noscript| elements are implemented.
2338     (new): New method.
2339    
2340 wakaba 1.3 2007-05-04 Wakaba <wakaba@suika.fam.cx>
2341    
2342 wakaba 1.4 * ContentChecker.pm: New module.
2343    
2344     2007-05-04 Wakaba <wakaba@suika.fam.cx>
2345    
2346 wakaba 1.3 * NanoDOM.pm (manakai_parent_element,
2347     document_element, manakai_local_name,
2348     manakai_element_type_match): New method.
2349    
2350     2007-05-03 Wakaba <wakaba@suika.fam.cx>
2351    
2352     * HTML.pm.src: Replace decimal and hexadecimal numeric
2353     entities in C1 range using Windows-1252 mapping. Bare LF
2354     did not count as new line for error reporting.
2355    
2356 wakaba 1.2 2007-05-02 Wakaba <wakaba@suika.fam.cx>
2357    
2358     * NanoDOM.pm (DOMImplementation): New class.
2359     (append_child): Weaken the |parent_node| reference.
2360     (create_element_ns, Element new): Set the |owner_document|
2361     reference.
2362     (implementation): New attribute.
2363     (owner_document, local_name, namespace_uri): New attributes.
2364    
2365     * HTML.pm.src (parse_string): Line and column numbers
2366     are now provided to error handler.
2367     (!!!parse-error): Short descriptions are added.
2368     (_construct_tree): Split into three methods; support
2369     for innerHTML mode.
2370     (set_inner_html): New method.
2371    
2372 wakaba 1.1 2007-05-01 Wakaba <wakaba@suika.fam.cx>
2373    
2374     * NanoDOM.html: Documentation is added.
2375    
2376     * HTML.pod, ContentType.html: Documentation is revised.
2377    
2378     * .cvsignore: Pod2html temporary files are added.
2379    
2380     * Makefile: Make |NanoDOM.html|.
2381    
2382     2007-05-01 Wakaba <wakaba@suika.fam.cx>
2383    
2384     * HTML.pm.src (parse_string): New method.
2385     (get_inner_html): Renamed from |inner_html|.
2386    
2387     * Makefile: A rule for |HTML.html| is added.
2388    
2389     * HTML.pod: New documentation.
2390    
2391     2007-05-01 Wakaba <wakaba@suika.fam.cx>
2392    
2393     * NanoDOM.pm (last_child, previous_sibling): New attributes.
2394     (clone_node): Attribute nodes were not completely copied.
2395    
2396     * HTML.pm.src: Many bugs are fixed.
2397    
2398     2007-04-30 Wakaba <wakaba@suika.fam.cx>
2399    
2400     * HTML.pm.src: Some typos are fixed.
2401    
2402     2007-04-30 Wakaba <wakaba@suika.fam.cx>
2403    
2404     * mkhtmlparser.pl, Makefile: References to the |HTML-consume-entity.src|
2405     are removed.
2406    
2407     * HTML.pm.src: Tokenizer's handling on named entities are rewritten.
2408    
2409     * HTML-consume-entity.src: Removed.
2410    
2411     2007-04-30 Wakaba <wakaba@suika.fam.cx>
2412    
2413     * HTML.pm.src: Tokenizer's handling on hexadecimal
2414     numeric entities are rewritten.
2415    
2416     2007-04-30 Wakaba <wakaba@suika.fam.cx>
2417    
2418     * HTML.pm.src: Some tokenizer bugs are fixed.
2419    
2420     2007-04-30 Wakaba <wakaba@suika.fam.cx>
2421    
2422     * HTML.pm.src: The tree construction stage is implemented.
2423    
2424     * mkhtmlparser.pl: New macros are added.
2425    
2426     2007-04-28 Wakaba <wakaba@suika.fam.cx>
2427    
2428     * ContentType.pm: A note on bug in the specification
2429     is removed since it's been now fixed.
2430    
2431     * .cvsignore: New file.
2432    
2433     2007-04-28 Wakaba <wakaba@suika.fam.cx>
2434    
2435     * HTML.pm.src, HTML-consume-entity.src: New files.
2436    
2437     * Makefile (HTML.pm): New rule.
2438    
2439     * mkhtmlparser.pl: New script.
2440    
2441     2007-04-25 Wakaba <wakaba@suika.fam.cx>
2442    
2443     * Makefile: New file.
2444    
2445     2007-04-24 Wakaba <wakaba@suika.fam.cx>
2446    
2447     * ContentType.pm: An error in pod is fixed.
2448    
2449     2007-04-24 Wakaba <wakaba@suika.fam.cx>
2450    
2451     * ContentType.pm: Documentation is added.
2452    
2453     2007-04-24 Wakaba <wakaba@suika.fam.cx>
2454    
2455     * ContentType.pm: New Perl module.
2456    
2457     * ChangeLog: New file.
2458    
2459    

admin@suikawiki.org
ViewVC Help
Powered by ViewVC 1.1.24