/[suikacvs]/markup/html/whatpm/Whatpm/ChangeLog
Suika

Contents of /markup/html/whatpm/Whatpm/ChangeLog

Parent Directory Parent Directory | Revision Log Revision Log


Revision 1.396 - (hide annotations) (download)
Sun Sep 6 08:15:37 2009 UTC (15 years, 2 months ago) by wakaba
Branch: MAIN
Changes since 1.395: +7 -0 lines
++ whatpm/Whatpm/ChangeLog	6 Sep 2009 08:15:17 -0000
	* HTML.pm.src: Added |xmp| to the list of Special elements (HTML5
	revision 3689).  It should make no difference since the |xmp|
	element has the PCDATA content and cannot be the non-bottommost
	element in the stack of open elements.

2009-09-06  Wakaba  <wakaba@suika.fam.cx>

1 wakaba 1.391 2009-09-06 Wakaba <wakaba@suika.fam.cx>
2    
3 wakaba 1.396 * HTML.pm.src: Added |xmp| to the list of Special elements (HTML5
4     revision 3689). It should make no difference since the |xmp|
5     element has the PCDATA content and cannot be the non-bottommost
6     element in the stack of open elements.
7    
8     2009-09-06 Wakaba <wakaba@suika.fam.cx>
9    
10 wakaba 1.395 * HTML.pm.src: Set the |confidence| flag to |irrlevant| when the
11     fragment parsing algorithm is invoked (HTML5 revision 2699).
12    
13     2009-09-06 Wakaba <wakaba@suika.fam.cx>
14    
15 wakaba 1.394 * HTML.pm.src: Simplified foster parenting (HTML5 revision 2732).
16    
17     2009-09-06 Wakaba <wakaba@suika.fam.cx>
18    
19 wakaba 1.393 * HTML.pm.src: No longer does foster parenting |style|, |script|,
20     and |input type=hidden| elements (HTML5 revision 2988).
21    
22     2009-09-06 Wakaba <wakaba@suika.fam.cx>
23    
24 wakaba 1.392 * HTML.pm.src: Dropped the |eventsource| element (HTML5 revision
25     2870).
26    
27     2009-09-06 Wakaba <wakaba@suika.fam.cx>
28    
29 wakaba 1.391 * HTML.pm.src (_tree_construction_main): Treate |keygen| as a void
30     element (HTML5 revision 2960).
31    
32 wakaba 1.389 2009-09-05 Wakaba <wakaba@suika.fam.cx>
33    
34 wakaba 1.390 * HTML.pm.src: Moved the "in table text" insertion mode code up
35     such that it does also work as specced in HTML5 in the remaining
36     cases (HTML5 revision 3382).
37    
38     2009-09-05 Wakaba <wakaba@suika.fam.cx>
39    
40 wakaba 1.389 * HTML.pm.src: Implementing the "in table text" insertion mode,
41     first half (HTML5 revision 3382).
42    
43 wakaba 1.388 2009-08-30 Wakaba <wakaba@suika.fam.cx>
44    
45     * ContentChecker.pm: Allow |xml:lang| in HTML document if the
46     owner element is not an HTML element (HTML5 revision 3697).
47    
48 wakaba 1.387 2009-08-23 Wakaba <wakaba@suika.fam.cx>
49    
50     * ContentChecker.pm: Allow <input type=hidden> in context where
51     interactive content is disallowed (HTML5 revision 2392).
52    
53 wakaba 1.380 2009-08-16 Wakaba <wakaba@suika.fam.cx>
54    
55 wakaba 1.386 * HTML.pm.src: Dropped support for "XSLT-compat" and added
56     "about:legacy-compat" (HTML5 revision 2725).
57    
58     2009-08-16 Wakaba <wakaba@suika.fam.cx>
59    
60 wakaba 1.385 * HTML.pm.src: "<!DOCTYPE>" is now explicitly supported in the
61     spec (HTML5 revision 2700).
62    
63     2009-08-16 Wakaba <wakaba@suika.fam.cx>
64    
65 wakaba 1.384 * HTML.pm.src: Added support for <script> element in the "in
66     select" and "in select in table" insertion modes (HTML5 revision
67     2500).
68    
69     2009-08-16 Wakaba <wakaba@suika.fam.cx>
70    
71 wakaba 1.383 * HTML.pm.src: An <hn> start tag in an <hn> element should close
72     the earlier element (HTML5 revision 2501).
73    
74     2009-08-16 Wakaba <wakaba@suika.fam.cx>
75    
76 wakaba 1.382 * HTML.pm.src: The </br> end tags before the <body> start tag
77     should equally survive as the first br element child of the body
78     element (HTML5 revision 2586).
79    
80     2009-08-16 Wakaba <wakaba@suika.fam.cx>
81    
82 wakaba 1.381 * HTML.pm.src: <font> in SVG land is handled as an SVG element as
83     long as it did not have HTML attributes (HTML5 revision 2590).
84    
85     2009-08-16 Wakaba <wakaba@suika.fam.cx>
86    
87 wakaba 1.380 * HTML.pm.src (_tree_construction_main): Improved error reporting
88     on "select" start tag in "in select" and "in select in table"
89     parsing modes (c.f. HTML5 revision 3604).
90    
91 wakaba 1.378 2009-07-26 Wakaba <wakaba@suika.fam.cx>
92    
93 wakaba 1.379 * mkhtmlclass.pl: Made xmlns:xlink with wrong namespace URL
94     invalid (HTML5 revision 2701).
95    
96     2009-07-26 Wakaba <wakaba@suika.fam.cx>
97    
98 wakaba 1.378 * HTML.pm.src: </rt>, </rp>, </option>, </optgroup> become
99     optional before </body> (HTML5 revision 2806).
100    
101 wakaba 1.376 2009-07-25 Wakaba <wakaba@suika.fam.cx>
102    
103 wakaba 1.377 * HTML.pm.src: Marked where SVG |script| execution should be
104     implemented (HTML5 revision 2904).
105    
106     2009-07-25 Wakaba <wakaba@suika.fam.cx>
107    
108 wakaba 1.376 * HTML.pm.src (set_inner_html): The fragment parsing algorithm
109     inherits the document quirksness flag of the main document (HTML5
110     revision 2935).
111    
112 wakaba 1.375 2009-07-05 Wakaba <wakaba@suika.fam.cx>
113    
114     * HTML.pm.src (_tree_construction_main): Put |table| in |p| in
115     quirks mode (The Hixie Quirk, HTML5 revision 3132).
116    
117 wakaba 1.372 2009-07-03 Wakaba <wakaba@suika.fam.cx>
118    
119 wakaba 1.374 * HTML.pm.src: <kegten> breaks out of <select> (HTML5 revision
120     3163).
121    
122     2009-07-03 Wakaba <wakaba@suika.fam.cx>
123    
124 wakaba 1.373 * HTML.pm.src: Don't change encoding to UTF-8 if the input
125     encoding is UTF-16 (HTML5 revision 3204).
126    
127     2009-07-03 Wakaba <wakaba@suika.fam.cx>
128    
129 wakaba 1.372 * HTML.pm.src (parse_byte_stream): Don't change to another
130     encoding if the input is UTF-16 (HTML5 revision 3205).
131    
132 wakaba 1.371 2009-06-28 Wakaba <wakaba@suika.fam.cx>
133    
134     * HTML.pm.src: |isindex| should no longer generate |p| element
135     node (HTML5 revision 3236).
136    
137 wakaba 1.368 2008-12-12 Wakaba <wakaba@suika.fam.cx>
138    
139 wakaba 1.370 * ContentChecker.pm: Introduced new flag |has_autofocus|. Added
140     support for new |id_type| value of |any|, which means any kind of
141     element is allowed as a referenced element.
142    
143     2008-12-12 Wakaba <wakaba@suika.fam.cx>
144    
145 wakaba 1.369 * ContentChecker.pm: Introduced |check_attrs2| method for the ease
146     of defining codes for checking required attribute and as such.
147    
148     2008-12-12 Wakaba <wakaba@suika.fam.cx>
149    
150 wakaba 1.368 * IMTChecker.pm: Added more definitions for subtypes.
151    
152 wakaba 1.367 2008-12-11 Wakaba <wakaba@suika.fam.cx>
153    
154     * URIChecker.pm: Some of |pos_end| values were wrong.
155    
156 wakaba 1.364 2008-12-06 Wakaba <wakaba@suika.fam.cx>
157    
158 wakaba 1.366 * ContentChecker.pm (check_element): Added support for "no
159     referenced datalist" error.
160    
161     2008-12-06 Wakaba <wakaba@suika.fam.cx>
162    
163 wakaba 1.365 * URIChecker.pm: Bug fix: It did not work unless
164     Message::DOM::DOMImplementation has been |require|d.
165    
166     2008-12-06 Wakaba <wakaba@suika.fam.cx>
167    
168 wakaba 1.364 * NanoDOM.pm (document_uri): New attribute.
169    
170     * ContentChecker.pm: Don't use methods not implemented by NanoDOM.
171    
172 wakaba 1.363 2008-11-07 Wakaba <wakaba@suika.fam.cx>
173    
174     * NanoDOM.pm (text_content): Don't create a Text node if the new
175     value is empty.
176    
177 wakaba 1.362 2008-11-06 Wakaba <wakaba@suika.fam.cx>
178    
179     * SWML/: New directory.
180    
181 wakaba 1.361 2008-10-20 Wakaba <wakaba@suika.fam.cx>
182    
183     * NanoDOM.pm (specified, all_declarations_processed,
184     manakai_attribute_type): New attributes.
185    
186 wakaba 1.359 2008-10-19 Wakaba <wakaba@suika.fam.cx>
187    
188 wakaba 1.360 * NanoDOM.pm (Entity->new): Initialize ->child_nodes as an empty
189     array.
190    
191     2008-10-19 Wakaba <wakaba@suika.fam.cx>
192    
193 wakaba 1.359 * NanoDOM.pm (notation_name): New attribute.
194    
195 wakaba 1.357 2008-10-18 Wakaba <wakaba@suika.fam.cx>
196    
197 wakaba 1.359 * NanoDOM.pm (public_id, system_id): New attributes.a
198 wakaba 1.358
199     2008-10-18 Wakaba <wakaba@suika.fam.cx>
200    
201 wakaba 1.357 * NanoDOM.pm (text_content): Moved to Node from Element. Setter
202     implemented.
203     (allowed_tokens, default_type, declared_type): Implemented.
204    
205 wakaba 1.356 2008-10-17 Wakaba <wakaba@suika.fam.cx>
206    
207     * NanoDOM.pm (node_name): New attribute.
208     (ELEMENT_TYPE_DEFINITION_NODE, ATTRIBUTE_DEFINITION_NODE): New
209     constants.
210     (create_element_type_definition_node, create_attribute_definition,
211     create_notation, create_general_entity,
212     get_element_type_definition_node,
213     set_element_type_definition_node, get_general_entity_node,
214     set_general_entity_node, get_notation_node, set_notation_node,
215     get_attribute_definition_node, set_attribute_definition_node): New
216     methods.
217     (element_types, entities, notations, attribute_definitions): New
218     attributes.
219     (DocumentType): Support for child nodes, entities, notations, and
220     element types.
221     (Entity, Notation, ElementTypeDefinition, AttributeDefinition):
222     New classes.
223    
224     * Dumper.pm: Support for general entities, notations, element type
225     definitions, and attribute definitions.
226    
227 wakaba 1.355 2008-10-15 Wakaba <wakaba@suika.fam.cx>
228    
229     * NanoDOM.pm (create_processing_instruction): New method.
230     (xml_version, xml_encoding, xml_standalone): New attributes.
231     (ProcessingInstruction): New class.
232    
233 wakaba 1.352 2008-10-14 Wakaba <wakaba@suika.fam.cx>
234    
235 wakaba 1.354 * HTML.pm.src: Handling of end tags in the foreign content
236     insertion mode was partially wrong, because of wrong bit
237     operations.
238    
239     2008-10-14 Wakaba <wakaba@suika.fam.cx>
240    
241 wakaba 1.353 * NanoDOM.pm (dom_config): New attribute (do nothing), for
242     Whatpm::XML::Parser support.
243    
244     2008-10-14 Wakaba <wakaba@suika.fam.cx>
245    
246 wakaba 1.352 * Makefile: New rule to make HTML/Tokenizer.pm is added.
247    
248     * HTML.pm.src: Tokenizer part moved to another file.
249    
250 wakaba 1.349 2008-10-13 Wakaba <wakaba@suika.fam.cx>
251    
252 wakaba 1.351 * HTML.pm.src: Merge |DT_EL| and |DD_EL| as |DTDD_EL|.
253    
254     2008-10-13 Wakaba <wakaba@suika.fam.cx>
255    
256 wakaba 1.350 * HTML.pm.src: Element category constants redefined.
257    
258     2008-10-13 Wakaba <wakaba@suika.fam.cx>
259    
260 wakaba 1.349 * HTML.pm.src: Steps for CDATA/RCDATA elements in tree
261     construction stage synced with the spec (HTML5 revisions 2139 and
262     2302).
263    
264 wakaba 1.348 2008-10-07 Wakaba <wakaba@suika.fam.cx>
265    
266     * ContentChecker.pm: New error level "html5_fact" added, which
267     should be tentatively used until all of requirements are properly
268     specced as RFC 2119 "MUST" in HTML5.
269    
270 wakaba 1.346 2008-10-05 Wakaba <wakaba@suika.fam.cx>
271    
272 wakaba 1.347 * ContentChecker.pod: Note on internal flags is added.
273    
274     2008-10-05 Wakaba <wakaba@suika.fam.cx>
275    
276 wakaba 1.346 * HTML.pm.src: An AAA bug fixed.
277    
278 wakaba 1.336 2008-10-04 Wakaba <wakaba@suika.fam.cx>
279    
280 wakaba 1.345 * HTML.pm.src: If another node is inserted by the parser, don't
281     reuse existing Text node to append a character (HTML5 revision
282     2124).
283    
284     2008-10-04 Wakaba <wakaba@suika.fam.cx>
285    
286 wakaba 1.344 * HTML.pm.src: Support for <option> and <optgroup> in body (HTML5
287     revisions 1731 and 2128).
288    
289     2008-10-04 Wakaba <wakaba@suika.fam.cx>
290    
291 wakaba 1.343 * HTML.pm.src: Make <foreignObject> scoping (HTML5 revision 1837).
292     Support for end tags of camelCase SVG elements were broken. A
293     wrong error type text fixed.
294    
295     2008-10-04 Wakaba <wakaba@suika.fam.cx>
296    
297 wakaba 1.342 * HTML.pm.src: Drop redundant code (HTML5 revision 1731).
298    
299     2008-10-04 Wakaba <wakaba@suika.fam.cx>
300    
301 wakaba 1.341 * HTML.pm.src: Support for new definition of |param| and |source|
302     start tag parsing (HTML5 revision 1731).
303    
304     2008-10-04 Wakaba <wakaba@suika.fam.cx>
305    
306 wakaba 1.340 * HTML.pm.src: <p> steps reimplemented (HTML5 revision 1731).
307    
308     2008-10-04 Wakaba <wakaba@suika.fam.cx>
309    
310 wakaba 1.339 * HTML.pm.src: <li>, <dt>, and <dd> steps reimplemented (HTML5
311     revisions 1731 and 1831).
312    
313     2008-10-04 Wakaba <wakaba@suika.fam.cx>
314    
315 wakaba 1.338 * HTML.pm.src: Support for new flow (but not phrasing) elements (HTML5
316     revisions 1731 and 1778). Support for the </sarcasm> end tag (HTML5
317     revision 1731).
318    
319     2008-10-04 Wakaba <wakaba@suika.fam.cx>
320    
321 wakaba 1.337 * HTML.pm.src: Support for |command| and |eventsource| elements (HTML5
322     revision 1731). End tags of |option| and |optgroup| elements are
323     now optional (HTML5 revision 1731).
324    
325     2008-10-04 Wakaba <wakaba@suika.fam.cx>
326    
327 wakaba 1.336 * HTML.pm.src: New "special" elements added to the list (HTML5
328     revision 1778). "strile" -> "strike".
329    
330 wakaba 1.335 2008-10-02 Wakaba <wakaba@suika.fam.cx>
331    
332     * ContentType.pm (get_sniffed_type): Support for the "better"
333     content sniffing (HTML5 revision 1927). In a case the official
334     type was not returned when the method is invoked in the list
335     context.
336    
337 wakaba 1.334 2008-09-22 Wakaba <wakaba@suika.fam.cx>
338    
339     * HTML.pm.src: Character references for non-space C0 characters,
340     including U+000B VT, DEL character, noncharacter code points, are
341     now converted to the U+FFFD character (cf. HTML5 revision 2138).
342    
343 wakaba 1.330 2008-09-21 Wakaba <wakaba@suika.fam.cx>
344    
345 wakaba 1.333 * ContentChecker.pm: |form=""| check support added.
346    
347     2008-09-21 Wakaba <wakaba@suika.fam.cx>
348    
349 wakaba 1.332 * ContentChecker.pm: |contextmenu| validness is now checked using
350     |id| and |id_type| properties, and |menu| property is removed.
351    
352     2008-09-21 Wakaba <wakaba@suika.fam.cx>
353    
354 wakaba 1.331 * ContentChecker.pm: Prepare for |form| |name| attribute's
355     duplication checking.
356    
357     2008-09-21 Wakaba <wakaba@suika.fam.cx>
358    
359 wakaba 1.330 * HTML.pm.src (parse_byte_stream): Support (or non-support) for
360     unsupported charset="" parameter value (HTML5 revision 2131).
361    
362 wakaba 1.325 2008-09-20 Wakaba <wakaba@suika.fam.cx>
363    
364 wakaba 1.329 * HTML.pm.src: Reminding places where U+000B is allowed as a space
365     character is fixed (cf. HTML5 revision 1738).
366    
367     * ContentChecker.pm, HTMLTable.pm: U+000B is no longer part of
368     space characters (HTML5 revision 1738).
369    
370     2008-09-20 Wakaba <wakaba@suika.fam.cx>
371    
372 wakaba 1.328 * HTML.pm.src: The "anything else" case for the "after after body"
373     insertion mode was not updated to swtich to the "in body"
374     insertion mode. U+000B is no longer a space character for the
375     purpose of tree construction phase (HTML5 revision 1738).
376    
377     2008-09-20 Wakaba <wakaba@suika.fam.cx>
378    
379 wakaba 1.327 * HTML.pm.src: U+000B is no longer a space character (HTML5
380     revision 1738).
381    
382     2008-09-20 Wakaba <wakaba@suika.fam.cx>
383    
384 wakaba 1.326 * ContentType.pm: 0x0B is no longer a space character (HTML5
385     revision 1738).
386    
387     * HTML.pm.src: U+000B is no longer a space character for the
388     algorithm for extracting an encoding from a Content-Type (HTML5
389     revision 1738).
390    
391     2008-09-20 Wakaba <wakaba@suika.fam.cx>
392    
393 wakaba 1.325 * ContentChecker.pm ($IsInHTMLInteractiveContent): New.
394    
395 wakaba 1.320 2008-09-18 Wakaba <wakaba@suika.fam.cx>
396    
397 wakaba 1.324 * LangTag.pm: Add checks for remaining requirements from RFC 4646.
398    
399     * mklangreg.pl: Sort 'Prefix' values by their length, to ease
400     matching.
401    
402     2008-09-18 Wakaba <wakaba@suika.fam.cx>
403    
404 wakaba 1.323 * LangTag.pm: Warn for private use language subtags. Error level
405     typos fixed. Support for Suppress-Script field.
406    
407     * mklangreg.pl: Support for dumping of nested structure.
408    
409     2008-09-18 Wakaba <wakaba@suika.fam.cx>
410    
411 wakaba 1.322 * LangTag.pm (check_rfc4646_langtag): Check if a tag is in the
412     recommended case as per RFC 4646.
413    
414     2008-09-18 Wakaba <wakaba@suika.fam.cx>
415    
416 wakaba 1.321 * LangTag.pm (check_rfc4646_langtag): New method.
417    
418     2008-09-18 Wakaba <wakaba@suika.fam.cx>
419    
420 wakaba 1.320 * mklangreg.pl: New script.
421    
422     * Makefile: Updated for creation of the module for language subtag
423     registry.
424    
425 wakaba 1.315 2008-09-16 Wakaba <wakaba@suika.fam.cx>
426    
427 wakaba 1.319 * Makefile: WebIDL.html added.
428    
429     * WebIDL.pod: New documentation.
430    
431     2008-09-16 Wakaba <wakaba@suika.fam.cx>
432    
433 wakaba 1.318 * WebIDL.pm: Checker's error types are redefined.
434    
435     2008-09-16 Wakaba <wakaba@suika.fam.cx>
436    
437 wakaba 1.317 * WebIDL.pm: Parser's error types are redefined. Some forward
438     compatible parsing bugs are fixed. Some unreachable codes are
439     commented out.
440    
441     2008-09-16 Wakaba <wakaba@suika.fam.cx>
442    
443 wakaba 1.316 * WebIDL.pm: Support for the reminding extended attributes are
444     added. It does not satisfy the definition that a forward
445     interface declaration has an extended attribute. It seems that
446     unless explicitly allowed multiple extended attributes with the
447     same name is not allowed, though it is not explicitly mentioned in
448     the spec.
449    
450     2008-09-16 Wakaba <wakaba@suika.fam.cx>
451    
452 wakaba 1.315 * WebIDL.pm: Unescapes extended attribute names and extended
453     attribute identifiers. Preserve whether an extended attribute has
454     an argument list of not. Support for extended attributes:
455     Constructor, ExceptionConsts, IndexGetter, IndexSetter,
456     NameGetter, NameSetter, and Null.
457     (has_argument_list): New attribute.
458     (idl_text): Stringifies argument lists, if any, even if it is
459     empty.
460    
461 wakaba 1.310 2008-09-15 Wakaba <wakaba@suika.fam.cx>
462    
463 wakaba 1.314 * HTML.pm.src: New state |PCDATA_STATE|. Use an empty string for
464     |{s_kwd}| in DATA_STATE as default.
465    
466     2008-09-15 Wakaba <wakaba@suika.fam.cx>
467    
468 wakaba 1.313 * HTML.pm.src, mkhtmlparser.pl: Replace |{prev_char}|
469     by |{s_kwd}| in DATA_STATE.
470    
471     2008-09-15 Wakaba <wakaba@suika.fam.cx>
472    
473 wakaba 1.312 * HTML.pm.src: Shorten keys.
474    
475     2008-09-15 Wakaba <wakaba@suika.fam.cx>
476    
477 wakaba 1.311 * HTML.pm.src: Remove checking for control character, surrogate
478     pair, or noncharacter code points and non-Unicode code
479     points (they should be handled by Whatpm::Charset::UnicodeChecker).
480     (parse_char_stream): Support for the |$get_wrapper| argument and
481     character stream error handlers.
482    
483     2008-09-15 Wakaba <wakaba@suika.fam.cx>
484    
485 wakaba 1.310 * ContentChecker.pm: Don't call |loda_ns_module|
486     for null-namespace elements/attributes.
487    
488     * HTML.pm.src: Fact out $disallowed_control_chars
489     as a hash.
490    
491 wakaba 1.300 2008-09-14 Wakaba <wakaba@suika.fam.cx>
492    
493 wakaba 1.309 * HTML.pm.src: Regexp typo fixed. |{prev_char}|
494     and |{next_char}| initializations are moved to initialization
495     method. |{read_until}| now supports buffering. Sync |set_inner_html|
496     with |parse_char_stream|.
497    
498     2008-09-14 Wakaba <wakaba@suika.fam.cx>
499    
500 wakaba 1.308 * HTML.pm.src (parse_char_stream): Make |set_next_char|
501     invoke |manakai_read_until|, not only |read|, where
502     possible, to decrease the number of |read| method calls.
503    
504     * mkhtmlparser.pl: Related changes to the aforementioned
505     modification.
506    
507     2008-09-14 Wakaba <wakaba@suika.fam.cx>
508    
509 wakaba 1.307 * HTML.pm.src: Use |read| instead of |getc|. |set_inner_html|
510     would report character error from now.
511    
512     2008-09-14 Wakaba <wakaba@suika.fam.cx>
513    
514 wakaba 1.306 * HTML.pm.src: White-space-leaded non-white-space character
515     tokens in "before head insertion mode" was not
516     correctly handled.
517     (set_inner_html): Reimplemented using CharString decodehandle
518     class. Support for $get_wrapper argument. Support
519     for |{read_until}| feature.
520    
521     2008-09-14 Wakaba <wakaba@suika.fam.cx>
522    
523 wakaba 1.305 * HTML.pm.src: Make a "bare ero" error for unknown
524     entities point the "&" character.
525    
526     2008-09-14 Wakaba <wakaba@suika.fam.cx>
527    
528 wakaba 1.304 * HTML.pm.src: It turns out that U+FFFD don't have to
529     be added to the list of excluded characters.
530    
531     2008-09-14 Wakaba <wakaba@suika.fam.cx>
532    
533 wakaba 1.303 * HTML.pm.src ($char_onerror): Have character decoder's |line|
534     and |column| a higher priority than the one set by the
535     tokenizer's input handler.
536     ($self->{read_until}): Exclude U+FFFD (but this might
537     not be necessary, since now we do line/column fixup in
538     the character decode handle).
539    
540     2008-09-14 Wakaba <wakaba@suika.fam.cx>
541    
542 wakaba 1.302 * HTML.pm.src: Use |{read_until}| where possible.
543    
544     2008-09-14 Wakaba <wakaba@suika.fam.cx>
545    
546 wakaba 1.301 * HTML.pm.src: Change |{getc_until}| to |{read_until}|
547     and |manakai_getc_until| to |manakai_read_until| to
548     reduce the number of string copies.
549    
550     2008-09-14 Wakaba <wakaba@suika.fam.cx>
551    
552 wakaba 1.300 * HTML.pm.src (parse_char_string): Use newly created
553     |Whatpm::Charset::DecodeHandle::CharString| instead of Perl's
554     standard feature to |open| a string as a filehandle,
555     since Perl's string filehandle seems not supporting |ungetc|
556     method correctly.
557     (parse_char_stream): Define |{getc_until}| method.
558     (DATA_STATE): Experimental support for |getc_until| feature.
559    
560 wakaba 1.292 2008-09-13 Wakaba <wakaba@suika.fam.cx>
561    
562 wakaba 1.299 * HTML.pm.src: Check points added to newly added branches.
563    
564     2008-09-13 Wakaba <wakaba@suika.fam.cx>
565    
566 wakaba 1.298 * HTML.pm.src: Remove |{char}|, which is no longer used.
567     Remove |{entity_in_attr}| and |{last_attribute_value_state}|
568     and replaced by |{prev_state}|.
569    
570     * mkhtmlparser.pl: Remove |{char}| feature.
571     Remove |!!!back-next-input-character;| macro.
572    
573     2008-09-13 Wakaba <wakaba@suika.fam.cx>
574    
575 wakaba 1.297 * HTML.pm.src: Finally we get rid of all the inner loops. Remove
576     entity related tokenizer states in favor of new states
577     implementing the consume character reference algorithm.
578    
579     2008-09-13 Wakaba <wakaba@suika.fam.cx>
580    
581     * HTML.pm.src: "Consume a character reference" algorithm is
582 wakaba 1.296 now implemented as a tokenizer's state, rather than
583     a method, with minimum changes (more changes will
584     be made, in due course). "Bogus comment state"'s inner
585     loop gets removed.
586    
587     2008-09-13 Wakaba <wakaba@suika.fam.cx>
588    
589 wakaba 1.297 * HTML.pm.src: Make |PUBLIC| and |SYSTEM| keyword tokenizing
590 wakaba 1.295 into their own tokenizer states.
591    
592     2008-09-13 Wakaba <wakaba@suika.fam.cx>
593    
594 wakaba 1.294 * HTML.pm.src: |CDATA_SECTION_STATE| (formally |CDATA_BLOCK_STATE|
595     is split into three states.
596    
597     2008-09-13 Wakaba <wakaba@suika.fam.cx>
598    
599 wakaba 1.293 * HTML.pm.src: |CLOSE_TAG_OPEN_STATE| is broken into
600     itself and new |CDATA_PCDATA_CLOSE_TAG_STATE| so that
601     no longer does the tokenizer have to push back next input
602     characters in those states.
603    
604     2008-09-13 Wakaba <wakaba@suika.fam.cx>
605    
606 wakaba 1.292 * HTML.pm.src: |MARKUP_DECLARATION_OPEN_STATE| broken
607     into four states so that no longer does the tokenizer have to push
608     back next input characters in that state.
609    
610 wakaba 1.291 2008-09-11 Wakaba <wakaba@suika.fam.cx>
611    
612     * HTML.pm.src: Methods now accept additional parameter, $get_wrapper,
613     which can be used to insert some wrapper between the character
614     stream handle and the tokenizer. (It is currently not supported
615     for |set_inner_html| for |Element|s).
616    
617 wakaba 1.289 2008-09-10 Wakaba <wakaba@suika.fam.cx>
618    
619 wakaba 1.290 * HTML.pm.src: Ignore punctuations in charset names.
620    
621     2008-09-10 Wakaba <wakaba@suika.fam.cx>
622    
623 wakaba 1.289 * ContentChecker.pm: Support for charset-layer error levels.
624    
625     * HTML.pm.src: Don't specify |text| argument for the
626     |chardecode:fallback| error, since it is not the encoding
627     being used alternatively.
628    
629 wakaba 1.288 2008-09-06 Wakaba <wakaba@suika.fam.cx>
630    
631     * HTML.pm.src: Support for |XSLT-compat| (HTML5 revision 2141).
632    
633 wakaba 1.286 2008-08-31 Wakaba <wakaba@suika.fam.cx>
634    
635 wakaba 1.287 * CacheManifest.pm: Support for extensibility (HTML5 revision 2051).
636    
637     2008-08-31 Wakaba <wakaba@suika.fam.cx>
638    
639 wakaba 1.286 * HTML.pm.src: Bug fix and sync with the spec with regard
640     to after after frameset insertion mode processing (HTML5
641     revision 1909). Note that the implementation was wrong
642     per the old spec before the r1909 changes.
643    
644 wakaba 1.278 2008-08-30 Wakaba <wakaba@suika.fam.cx>
645    
646 wakaba 1.285 * HTMLTable.pm: scope=auto algorithm fix synced with the
647     spec (HTML5 revision 2093).
648     ($process_row): Algorithm step numbers synced with the
649     spec (HTML5 revision 2092).
650    
651     2008-08-30 Wakaba <wakaba@suika.fam.cx>
652    
653 wakaba 1.284 * HTMLTable.pm: Zs is not what we want; we want White_Space! (HTML5
654     revision 2094).
655    
656     2008-08-30 Wakaba <wakaba@suika.fam.cx>
657    
658     * ContentType.pm: Support for image/svg+xml (HTML5 revision 2096).
659    
660     2008-08-30 Wakaba <wakaba@suika.fam.cx>
661    
662 wakaba 1.283 * HTML.pm.src: '"' and "'" at the end of attribute
663     name (after another attribute) now raise parse error (HTML5
664     revision 2123). Empty unquoted attribute values are no
665     longer allowed (HTML5 revision 2122).
666    
667     2008-08-30 Wakaba <wakaba@suika.fam.cx>
668    
669 wakaba 1.282 * mkhtmlparser.pl: Support for MathML |definitionURL| attribute (HTML5
670     revision 2130).
671    
672     2008-08-30 Wakaba <wakaba@suika.fam.cx>
673    
674 wakaba 1.281 * ContentChecker.pm: |xml:lang| attribute value must be same
675     as |lang| attribute value for HTML elements (HTML5 revision 2062
676     and so on).
677    
678     2008-08-30 Wakaba <wakaba@suika.fam.cx>
679    
680 wakaba 1.280 * ContentChecker.pm: Error level definition for |xml_id_error|
681     was missing.
682    
683     * URIChecker.pm: The end of the URL should be marked as the
684     error location for an empty path error. The position
685     between the userinfo and the port components should be
686     marked as the error location for an empty host error.
687    
688     2008-08-30 Wakaba <wakaba@suika.fam.cx>
689    
690 wakaba 1.279 * URIChecker.pm: Set parameters representing where in the
691     value the error occurs for errors. Report unknown
692     address format error in warning level, since address
693     formats are rarely added. Path segments starting with "/.."
694     were misinterpreted as a dot-segment.
695    
696     2008-08-30 Wakaba <wakaba@suika.fam.cx>
697    
698 wakaba 1.278 * URIChecker.pm (check_iri_reference): Requires
699     |Message::DOM::DOMImplementation|.
700    
701 wakaba 1.277 2008-08-29 Wakaba <wakaba@suika.fam.cx>
702    
703     * IMTChecker.pm: Updated for the new error reporting architecture.
704    
705     * ContentChecker.pm: Error levels for IMTs are added.
706    
707 wakaba 1.276 2008-08-17 Wakaba <wakaba@suika.fam.cx>
708    
709     * H2H.pm (_shift_token): Support for unquoted HTML attribute
710     values.
711    
712 wakaba 1.274 2008-08-16 Wakaba <wakaba@suika.fam.cx>
713    
714 wakaba 1.275 * CacheManifest.pm: Support for new style of error
715     reports.
716    
717     * HTML.pm.src: Set line=1, column=1 to the document node.
718    
719     2008-08-16 Wakaba <wakaba@suika.fam.cx>
720    
721 wakaba 1.274 * ContentChecker.pm, RDFXML.pm: Pass {level} object to language tag
722     and URL checkers. Support for more error levels for bogus
723     langauge tag and URL "standards".
724    
725     * LangTag.pm, URIChecker.pm: Support for new style error
726     level reporting.
727    
728 wakaba 1.271 2008-08-15 Wakaba <wakaba@suika.fam.cx>
729    
730 wakaba 1.273 * ContentChecker.pm: Support for RDF/XML error levels.
731    
732     * HTMLTable.pm, RDFXML.pm: Support for new style of error level
733     specifying. Error types are revised.
734    
735     2008-08-15 Wakaba <wakaba@suika.fam.cx>
736    
737 wakaba 1.272 * ContentChecker.pm: All error reporting method calls are
738     renewed.
739    
740     2008-08-15 Wakaba <wakaba@suika.fam.cx>
741    
742 wakaba 1.271 * HTML.pm.src: All error type names and "text" parameters
743     are revised. Use new style for "level" specification.
744    
745     * mkhtmlparser.pl: Use new style for "level" specification.
746    
747 wakaba 1.268 2008-08-03 Wakaba <wakaba@suika.fam.cx>
748    
749 wakaba 1.270 * WebIDL.pm (parse_char_string): Simplified error
750     reporting process for broken ignored valuetype definition.
751     (Valuetype idl_text): Support for special "DOMString" name.
752    
753     2008-08-03 Wakaba <wakaba@suika.fam.cx>
754    
755 wakaba 1.269 * WebIDL.pm ($get_scoped_name): Append "::::" if the last
756     terminal of the ScopedName is "DOMString", such that whether
757     the last part of the scoped name is "DOMString" or "_DOMString"
758     later. It is necessary to determine whether a |typedef|
759     definition should be ignored or not.
760     (parse_char_string): Unescape the identifier of
761     exception members.
762     ($resolve): Return undef for builtin types and sequence<T>
763     types (we might not have to do this, however...).
764     (check): Support checking for Exceptions, Valuetypes,
765     and Typedefs.
766     ($serialize_type): Support for "DOMString::::" syntax.
767     (Typedef idl_text): Output Type as "DOMString" if it
768     is really "DOMString" (i.e. its internal representation
769     is "::DOMString::").
770    
771     2008-08-03 Wakaba <wakaba@suika.fam.cx>
772    
773 wakaba 1.268 * WebIDL.pm ($resolve): New code, based on resolve code
774     for constant types in the |check| method.
775     (check): Support for checking of attributes, operations, and
776     arguments.
777     (Attribute/Operation idl_text): Exception names in getraises,
778     setraises, and raises clauses is serizlied by |$serialize_type|
779     code.
780    
781 wakaba 1.264 2008-08-02 Wakaba <wakaba@suika.fam.cx>
782    
783 wakaba 1.267 * WebIDL.pm ($integer): Order of selections are changed to match
784     hexadecimal numbers (the original pattern, taken from the spec,
785     was not work for hexadecimal numbers, because the "0" prefix
786     matches to the [0-7]* part (as an empty string) and therefore
787     it does not match with remaining "x..." part of a "0x..." integer
788     literal.
789     ($get_type): It now returns a string, not an array reference,
790     for regular types and |sequence| types (i.e. it in any case
791     returns a string).
792     ($get_next_token): The second item in the array that represents
793     a integer or float token is now a Perl number value, not the
794     original string representation of the number.
795     (check): Support for const value consistency checking.
796     No extended attribute is defined for constants.
797     (Node subclasses): Use simple strings rather than array references
798     for default data type values.
799     ($serialize_type): Type values are now simple strings.
800     (value): If the new attribute value is a false value, then
801     a FALSE value is set to the attribute.
802    
803     2008-08-02 Wakaba <wakaba@suika.fam.cx>
804    
805 wakaba 1.266 * WebIDL.pm ($get_scoped_name): Now scoped names are stored
806     in its stringified format ("scoped name" as defined in the
807     spec). Note that future version of this module should not use
808     array references for type values and the |type_text| attribute
809     should be made obsolete.
810     (parse_char_string): Unescape attribute names.
811     (check): Support for checking of whether inherited interfaces
812     are actually defined or not. Support for checking of whether
813     interface member identifiers are duplicated or not.
814     ($serialize_type): Scoped names are returned as is. A future
815     version of this code should escape identifiers other than "DOMString",
816     otherwise the idl_text would be non-conforming.
817    
818     2008-08-02 Wakaba <wakaba@suika.fam.cx>
819    
820 wakaba 1.264 * WebIDL.pm (parse_char_string): Set line/column numbers
821 wakaba 1.265 to generated nodes. Unescape identifiers. Extended attributes
822     for Definition's were ignored.
823     (append_child): Set |parent_node| attribute.
824     (parent_node): New attribute.
825     (check): Support interface/exception members. Support
826     extended attributes. Support definition identifier uniqueness
827     constraint.
828     (qualified_name): New attribute.
829     (Interface/Exception idl_text): Extended attributes were
830     not prepended to the returned text.
831    
832     2008-08-02 Wakaba <wakaba@suika.fam.cx>
833    
834     * WebIDL.pm (parse_char_string): Set line/column numbers
835 wakaba 1.264 to interface object experimentally. s/shift/pop/g, shift
836     would make things wrong. Support for interface forward
837     declarations was missing. Broken interface declarations
838     with no block were not ignored entirely.
839     (Whatpm::WebIDL::Node): New abstract class. This class
840     makes things easier.
841     (child_nodes): New attribute. Unlike DOM's attribute with
842     same name, this attribute returns a dead list of nodes for
843     simplicity.
844     (get_user_data, set_user_data): New methods.
845     (Module idl_text): A SPACE character should be inserted
846     before the |{| character.
847     (Interface idl_text): Support for interface forward declarations.
848     (is_forward_declaration): New attribute.
849    
850 wakaba 1.259 2008-07-19 Wakaba <wakaba@suika.fam.cx>
851    
852 wakaba 1.263 * WebIDL.pm (type_text): Better serializer.
853    
854     2008-07-19 Wakaba <wakaba@suika.fam.cx>
855    
856 wakaba 1.262 * WebIDL.pm: Revise forward-compatible parsing so that
857     it now can handle broken extended attributes and as such.
858    
859     2008-07-19 Wakaba <wakaba@suika.fam.cx>
860    
861 wakaba 1.261 * WebIDL.pm: Real support for extended attributes.
862     Support for extended attributes with arguments.
863    
864     2008-07-19 Wakaba <wakaba@suika.fam.cx>
865    
866 wakaba 1.260 * WebIDL.pm: Support for |exception| syntax.
867     (Interface->idl_text): Tentative support for inheritances.
868    
869     2008-07-19 Wakaba <wakaba@suika.fam.cx>
870    
871 wakaba 1.259 * WebIDL.pm: Hierarchical scoped name support was broken.
872     Support for raises, setraises, and getraises syntaxes.
873    
874 wakaba 1.258 2008-07-18 Wakaba <wakaba@suika.fam.cx>
875    
876     * WebIDL.pm: Support for |idl_text| attribute, version 1 (no
877     proper support for types, extended attributes, and exceptions yet).
878     WebIDL parser, version 1 (no support for exceptions yet,
879     no proper support for extended attributes yet).
880    
881 wakaba 1.257 2008-07-09 Wakaba <wakaba@suika.fam.cx>
882    
883     * WebIDL.pm (parse_char_string): Support for basic attribute syntax.
884    
885 wakaba 1.255 2008-06-29 Wakaba <wakaba@suika.fam.cx>
886    
887 wakaba 1.256 * WebIDL.pm: Support for valuetype and const.
888    
889     2008-06-29 Wakaba <wakaba@suika.fam.cx>
890    
891 wakaba 1.255 * WebIDL.pm: New module.
892    
893 wakaba 1.254 2008-06-15 Wakaba <wakaba@suika.fam.cx>
894    
895     * Makefile (Entities.html): URI changed.
896    
897 wakaba 1.253 2008-06-08 Wakaba <wakaba@suika.fam.cx>
898    
899     * HTML.pm.src: Support for ruby parsing (HTML5 revision 1704).
900    
901 wakaba 1.251 2008-06-01 Wakaba <wakaba@suika.fam.cx>
902    
903 wakaba 1.252 * HTML.pm.src (_get_next_token): A parse error was missing.
904    
905     2008-06-01 Wakaba <wakaba@suika.fam.cx>
906    
907 wakaba 1.251 * mklinktypelist.pl: rel=contact is no longer part of the HTML5
908     spec (commented out). (HTML5 revision 1711).
909    
910 wakaba 1.248 2008-05-25 Wakaba <wakaba@suika.fam.cx>
911    
912 wakaba 1.250 * ContentType.pm: Drop support for UTF-32 (HTML5 revision 1701).
913    
914     * HTML.pm.src: UTF-16BE and UTF-16LE should be considered
915     as UTF-16 (HTML5 revision 1701).
916    
917     2008-05-25 Wakaba <wakaba@suika.fam.cx>
918    
919 wakaba 1.249 * HTML.pm.src: Support for <noframes> in <head> (HTML5 revision
920     1692).
921    
922     2008-05-25 Wakaba <wakaba@suika.fam.cx>
923    
924     * HTML.pm.src: The secondary insertion mode used when switching
925     to foreign content is the "in body" insertion mode (HTML5 revision
926     1696).
927    
928     2008-05-25 Wakaba <wakaba@suika.fam.cx>
929    
930 wakaba 1.248 * HTML.pm.src: Don't raise parse error for <isindex/> (HTML5
931     revision 1697).
932    
933 wakaba 1.239 2008-05-24 Wakaba <wakaba@suika.fam.cx>
934    
935 wakaba 1.247 * HTML.pm.src: Support for end-of-file token in foreign content
936     insertion mode (HTML5 revision 1693). Update SVG camelCase
937     attribute list (HTML5 revision 1700). <textarea> closes
938     </select> (HTML5 revision 1699). More start tags close in
939     foreign content insertion mode (HTML5 revision 1698).
940    
941     2008-05-24 Wakaba <wakaba@suika.fam.cx>
942    
943 wakaba 1.246 * HTML.pm.src: ";" is not part of charset name (HTML5 revision 1665).
944    
945     2008-05-24 Wakaba <wakaba@suika.fam.cx>
946    
947 wakaba 1.245 * HTML.pm.src: More robust charset parameter detection (HTML5
948     revision 1674).
949    
950     2008-05-24 Wakaba <wakaba@suika.fam.cx>
951    
952 wakaba 1.244 * ContentType.pm: Support for image/vnd.microsoft.icon (HTML5
953     revision 1676).
954    
955     2008-05-24 Wakaba <wakaba@suika.fam.cx>
956    
957 wakaba 1.243 * HTML.pm.src: Ignore language part of public identifiers for
958     quriks mode detection (HTML5 revision 1679).
959    
960     2008-05-24 Wakaba <wakaba@suika.fam.cx>
961    
962 wakaba 1.242 * HTML.pm.src: Reduce the number of errors in truncated doctypes (HTML5
963     revision 1685).
964    
965     2008-05-24 Wakaba <wakaba@suika.fam.cx>
966    
967 wakaba 1.241 * HTML.pm.src: Support for EOF in new states for tags (HTML5
968     revision 1684).
969    
970     2008-05-24 Wakaba <wakaba@suika.fam.cx>
971    
972 wakaba 1.240 * HTML.pm.src (_reset_insertion_mode): Make <td>.innerHTML
973     work (HTML5 revision 1690).
974    
975     2008-05-24 Wakaba <wakaba@suika.fam.cx>
976    
977 wakaba 1.239 * HTML.pm.src (_tree_construction_main): Change handling of
978     end tags in head insertion modes (HTML5 revision 1686).
979     (parse_char_string): Bug fix for non-utf8 character string handlings.
980     (parse_char_stream): |ungetc| does not work well for this context.
981    
982 wakaba 1.237 2008-05-18 Wakaba <wakaba@suika.fam.cx>
983    
984 wakaba 1.238 * HTML.pm.src (parse_byte_string): Redefined to invoke
985     |parse_byte_stream|.
986     (parse_byte_stream): New method.
987    
988     2008-05-18 Wakaba <wakaba@suika.fam.cx>
989    
990 wakaba 1.237 * HTML.pm.src (parse_byte_string): Fix the column number reported
991     by encoding layer error reporter.
992    
993 wakaba 1.233 2008-05-17 Wakaba <wakaba@suika.fam.cx>
994    
995 wakaba 1.236 * HTML.pm.src (parse_byte_string): Use streaming decoder
996     rather than converting the whole byte string and then parsing.
997     Propagate errors in character encoding layer.
998     (get_next_token): Precise error reporting for |bare stago| error.
999    
1000     2008-05-17 Wakaba <wakaba@suika.fam.cx>
1001    
1002 wakaba 1.235 * HTML.pm.src (parse_char_stream): New method.
1003     (parse_char_string): This method is now defined as an invocation
1004     of the |parse_char_stream| method.
1005    
1006     2008-05-17 Wakaba <wakaba@suika.fam.cx>
1007    
1008 wakaba 1.234 * HTML.pm.src (parse_byte_string): Report various status
1009     of the sniffing as info-level errors. Support for new
1010     decoding framework in parser resestting.
1011     (new): Various default error levels were not set.
1012    
1013     2008-05-17 Wakaba <wakaba@suika.fam.cx>
1014    
1015 wakaba 1.233 * HTML.pm.src (parse_byte_string): HTML5 encoding siniffing
1016     algorithm, except for the actual sniffing, is implemented
1017     with new framework with Message::Charset::Info.
1018    
1019 wakaba 1.232 2008-05-16 Wakaba <wakaba@suika.fam.cx>
1020    
1021     * CacheManifest.pm (_parse): Drop fragment identifiers from
1022     URIs in fallback section (HTML5 revision 1596).
1023    
1024 wakaba 1.230 2008-05-10 Wakaba <wakaba@suika.fam.cx>
1025    
1026 wakaba 1.231 * Makefile (Entities.html): URI has changed.
1027    
1028     2008-05-10 Wakaba <wakaba@suika.fam.cx>
1029    
1030 wakaba 1.230 * CacheManifest.pm: Don't replace U+0000 NULL (HTML5 revision
1031     1553).
1032    
1033 wakaba 1.228 2008-05-06 Wakaba <wakaba@suika.fam.cx>
1034    
1035 wakaba 1.229 * ContentChecker.pm: Noted that those returned in |table| are
1036     no longer table elements, but table objects returned
1037     by Whatpm::HTMLTable.
1038    
1039     * HTMLTable.pm (form_table): Return table element node
1040     as |$table->{element}|.
1041     (assign_header): Support for the |headers=""| attribute.
1042    
1043     2008-05-06 Wakaba <wakaba@suika.fam.cx>
1044    
1045 wakaba 1.228 * HTMLTable.pm (assign_header): New function; first version
1046     with no support for headers="".
1047     (form_table): Include table width and height to the returned
1048     table object for covenience. Indexing in column assignement was wrong.
1049     Set whether a data cell is empty or not for convenience.
1050    
1051 wakaba 1.222 2008-05-05 Wakaba <wakaba@suika.fam.cx>
1052    
1053 wakaba 1.227 * HTMLTable.pm: Robuster caption support (HTML5 revision 1393).
1054    
1055     2008-05-05 Wakaba <wakaba@suika.fam.cx>
1056    
1057 wakaba 1.226 * HTMLTable.pm: How table model errors are detected is
1058     changed (HTML5 revision 1387).
1059    
1060     2008-05-05 Wakaba <wakaba@suika.fam.cx>
1061    
1062 wakaba 1.225 * HTMLTable.pm: The algorithm now moves |tfoot| elements
1063     to the end of the table (HTML5 revision 1380).
1064    
1065     2008-05-05 Wakaba <wakaba@suika.fam.cx>
1066    
1067 wakaba 1.224 * HTMLTable.pm: The algorithm is now 0-based indexing, instead
1068     of 1-based (HTML5 revision 1376).
1069    
1070     2008-05-05 Wakaba <wakaba@suika.fam.cx>
1071    
1072 wakaba 1.223 * ContentType.pm: "Content-Type: text/plain; charset=UTF-8"
1073     and "Content-Encoding" no longer prevent sniffing (HTML5
1074     revision 1288).
1075    
1076     2008-05-05 Wakaba <wakaba@suika.fam.cx>
1077    
1078 wakaba 1.222 * ContentType.pm: Skip BOMs n feed or HTML algorithm (HTML5 revision
1079     1282).
1080    
1081 wakaba 1.220 2008-05-03 Wakaba <wakaba@suika.fam.cx>
1082    
1083 wakaba 1.221 * ContentChecker.pm: Support for global attributes.
1084     Status of XML specs are added.
1085    
1086     2008-05-03 Wakaba <wakaba@suika.fam.cx>
1087    
1088 wakaba 1.220 * ContentChecker.pm (check_element): Support for |template=""|
1089     and |ref=""| attribute (referent element type checking).
1090    
1091 wakaba 1.218 2008-04-29 Wakaba <wakaba@suika.fam.cx>
1092    
1093 wakaba 1.219 * CacheManifest.pm (_parse): New same origin definition (HTML5 revision
1094     1500) is implemented (except for IDNA part and URI-scheme-specific
1095 wakaba 1.218 knowledge). Line number counting was wrong for LF-only
1096     documents.
1097    
1098 wakaba 1.211 2008-04-13 Wakaba <wakaba@suika.fam.cx>
1099    
1100 wakaba 1.217 * HTML.pm.src: Raise an parse error for any disallowed
1101     character (HTML5 revision 1263).
1102    
1103     2008-04-13 Wakaba <wakaba@suika.fam.cx>
1104    
1105 wakaba 1.216 * mkentitylist.pl: Support for new HTML5 entity table format (the
1106     definition for |AElig;| was missing).
1107    
1108     2008-04-13 Wakaba <wakaba@suika.fam.cx>
1109    
1110 wakaba 1.215 * HTML.pm.src, mkhtmlparser.pl: Support for element/attribute
1111     name/namespace fixup (HTML5 revisions 1413, 1415, 1416, and 1417).
1112    
1113     2008-04-13 Wakaba <wakaba@suika.fam.cx>
1114    
1115 wakaba 1.214 * HTML.pm.src: List of element names that close foreign content
1116     insertion mode is added (HTML5 revisions 1412 and 1418).
1117    
1118     2008-04-13 Wakaba <wakaba@suika.fam.cx>
1119    
1120 wakaba 1.213 * HTML.pm.src: Support for |mglyph| and |malignmark|
1121     elements (HTML5 revision 1410).
1122    
1123     2008-04-13 Wakaba <wakaba@suika.fam.cx>
1124    
1125 wakaba 1.212 * HTML.pm.src: Support for new long MathML entities (HTML5
1126     revision 1406).
1127    
1128     2008-04-13 Wakaba <wakaba@suika.fam.cx>
1129    
1130 wakaba 1.211 * HTML.pm.src: CDATA section support for MathML and SVG
1131     elements (HTML5 revision 1404 and 1420).
1132    
1133 wakaba 1.209 2008-04-12 Wakaba <wakaba@suika.fam.cx>
1134    
1135 wakaba 1.210 * HTML.pm.src, mkhtmlparser.pl: Support for MathML and SVG
1136     elements (HTML5 revision 1404). Unused !!!macro definitions
1137     are removed.
1138    
1139     2008-04-12 Wakaba <wakaba@suika.fam.cx>
1140    
1141 wakaba 1.209 * HTML.pm.src, mkhtmlparser.pl: The way permitted slash errors
1142     are raised is changed (HTML5 revision 1404).
1143    
1144 wakaba 1.206 2008-04-06 Wakaba <wakaba@suika.fam.cx>
1145    
1146 wakaba 1.208 * HTML.pm.src: Category sets that are no longer used are removed.
1147    
1148     2008-04-06 Wakaba <wakaba@suika.fam.cx>
1149    
1150 wakaba 1.207 * HTML.pm.src: The ->[1] property of stack entries are now
1151     replaced by constants representing element category.
1152    
1153     2008-04-06 Wakaba <wakaba@suika.fam.cx>
1154    
1155 wakaba 1.206 * HTML.pm.src: Don't use local name stored in stack (i.e. ->[1])
1156     for error reporting. (This is a preparation for using constant
1157     value for ->[1].)
1158    
1159 wakaba 1.204 2008-03-22 Wakaba <wakaba@suika.fam.cx>
1160    
1161 wakaba 1.205 * RDFXML.pm: Typo fixed.
1162    
1163     2008-03-22 Wakaba <wakaba@suika.fam.cx>
1164    
1165 wakaba 1.204 * ContentChecker.pm: |fact_level| is now treated
1166     as same as |must_level|, i.e. level = |m|.
1167     (check_element): Make list of URIs in the DOM.
1168    
1169 wakaba 1.197 2008-03-21 Wakaba <wakaba@suika.fam.cx>
1170    
1171 wakaba 1.203 * RDFXML.pm: Language accessor implemented.
1172     Local (null-namespace) attribute support.
1173    
1174     2008-03-21 Wakaba <wakaba@suika.fam.cx>
1175    
1176 wakaba 1.202 * RDFXML.pm: Factored out ID checking code.
1177    
1178     2008-03-21 Wakaba <wakaba@suika.fam.cx>
1179    
1180 wakaba 1.201 * RDFXML.pm: TODO items noted. Validation of ID
1181     and URI attributes is implemented. Warn if unknown
1182     value is used in rdf:parseType="" attribute.
1183    
1184     * URIChecker.pm (check_rdf_uri_reference): New function.
1185    
1186     2008-03-21 Wakaba <wakaba@suika.fam.cx>
1187    
1188 wakaba 1.200 * RDFXML.pm: bnodeid implemented. Relative references
1189     are now resolved.
1190    
1191     2008-03-21 Wakaba <wakaba@suika.fam.cx>
1192    
1193 wakaba 1.199 * ContentChecker.pm: RDF reification implemented.
1194    
1195     * RDFXML.pm: undef vs false bug fixed.
1196     Reification implemented.
1197    
1198     2008-03-21 Wakaba <wakaba@suika.fam.cx>
1199    
1200 wakaba 1.198 * RDFXML.pm: s/id/ID/ for attribute name.
1201     The |node| arguments are added for |ontriple| calls.
1202     Too many "attribute not allowed" errors were raised.
1203    
1204     * ContentChecker.pm: Initial experimental support for rdf:RDF
1205     element.
1206    
1207     2008-03-21 Wakaba <wakaba@suika.fam.cx>
1208    
1209 wakaba 1.197 * RDFXML.pm: New module.
1210    
1211 wakaba 1.192 2008-03-20 Wakaba <wakaba@suika.fam.cx>
1212    
1213 wakaba 1.196 * HTML.pm.src (set_inner_html): Line/column number
1214     code was old one yet.
1215    
1216     2008-03-20 Wakaba <wakaba@suika.fam.cx>
1217    
1218 wakaba 1.195 * HTML.pm.src: Better line/column reporting for "duplicate attribute"
1219     errors. Line/column markings for DOCTYPE, comment, and
1220     character tokens are reintroduced; otherwise,
1221     error location for "not HTML5" error and errors
1222     for implied elements are not attached.
1223    
1224     2008-03-20 Wakaba <wakaba@suika.fam.cx>
1225    
1226 wakaba 1.194 * HTML.pm.src: Set line/column numbers to attributes.
1227    
1228     * NanoDOM.pm (create_attribute_ns, set_attribute_node_ns): Added.
1229     (value): Setter implemented.
1230    
1231     * mkhtmlparser.pl: Set line/column numbers to Attr nodes.
1232    
1233     2008-03-20 Wakaba <wakaba@suika.fam.cx>
1234    
1235 wakaba 1.193 * HTML.pm.src: Unused line/column markings are removed.
1236    
1237     2008-03-20 Wakaba <wakaba@suika.fam.cx>
1238    
1239 wakaba 1.192 * HTML.pm.src (_get_next_token): Remove |first_start_tag|
1240     flag, which is no longer used.
1241    
1242 wakaba 1.190 2008-03-17 Wakaba <wakaba@suika.fam.cx>
1243    
1244 wakaba 1.191 * HTML.pm.src: Set line/column information to element nodes.
1245    
1246     * mkhtmlparser.pl (!!!create-element, !!!insert-element,
1247     and !!!insert-element-t): Set line/column information to
1248     element nodes.
1249    
1250     2008-03-17 Wakaba <wakaba@suika.fam.cx>
1251    
1252 wakaba 1.190 * HTML.pm.src (_get_next_token): The first "<" character
1253     in "<?", "<>", or "</>" should be the error point.
1254    
1255 wakaba 1.187 2008-03-16 Wakaba <wakaba@suika.fam.cx>
1256    
1257 wakaba 1.189 * HTML.pm.src: Some more fixes on error position reporting.
1258    
1259     2008-03-16 Wakaba <wakaba@suika.fam.cx>
1260    
1261 wakaba 1.188 * HTML.pm.src: Token-level precious error reporting.
1262    
1263     2008-03-16 Wakaba <wakaba@suika.fam.cx>
1264    
1265 wakaba 1.187 * HTML.pm.src: Preparation for more precious error point
1266     reporting.
1267    
1268 wakaba 1.184 2008-03-11 Wakaba <wakaba@suika.fam.cx>
1269    
1270 wakaba 1.186 * HTML.pm.src: Error type revised.
1271    
1272     2008-03-11 Wakaba <wakaba@suika.fam.cx>
1273    
1274 wakaba 1.185 * HTML.pm.src: Similar codes are merged together, again.
1275    
1276     2008-03-11 Wakaba <wakaba@suika.fam.cx>
1277    
1278 wakaba 1.184 * HTML.pm.src: Similar codes are merged together.
1279    
1280 wakaba 1.183 2008-03-10 Wakaba <wakaba@suika.fam.cx>
1281    
1282     * mkhtmlparser.pl: Set "level" parameter to parse errors.
1283    
1284     * HTML.pm.src: Code refined.
1285    
1286 wakaba 1.175 2008-03-09 Wakaba <wakaba@suika.fam.cx>
1287    
1288 wakaba 1.182 * HTML.pm.src: |</body>| treatement has been changed (HTML5
1289     revision 1348). Note that I really don't know this makes
1290     any difference in the black-box behavior of the parser.
1291    
1292     2008-03-09 Wakaba <wakaba@suika.fam.cx>
1293    
1294 wakaba 1.181 * HTML.pm.src: New end-of-file token implementation (HTML5
1295     revision 1348).
1296    
1297     2008-03-09 Wakaba <wakaba@suika.fam.cx>
1298    
1299 wakaba 1.180 * HTML.pm.src: |applet| support (HTML5 revision 1347).
1300    
1301     2008-03-09 Wakaba <wakaba@suika.fam.cx>
1302    
1303 wakaba 1.179 * HTML.pm.src: Foster parenting in AAA (HTML5 revision 1343).
1304    
1305     2008-03-09 Wakaba <wakaba@suika.fam.cx>
1306    
1307 wakaba 1.178 * HTML.pm.src: Support for |<input>| in the "in select" insertion
1308     mode and support for the "in select in table" insertion mode (HTML5
1309     revision 1342).
1310    
1311     2008-03-09 Wakaba <wakaba@suika.fam.cx>
1312    
1313 wakaba 1.177 * HTML.pm.src: No foster parenting for <script> and <script>
1314     in non-tainted <table>s (HTML5 revision 1336).
1315    
1316     2008-03-09 Wakaba <wakaba@suika.fam.cx>
1317    
1318 wakaba 1.176 * HTML.pm.src: Ignore white space characters between <html>
1319     and <head> (HTML5 revision 1332).
1320    
1321     2008-03-09 Wakaba <wakaba@suika.fam.cx>
1322    
1323 wakaba 1.175 * HTML.pm.src: Treat <input type=hidden> as if it were a
1324     white space (HTML5 revision 1331).
1325    
1326 wakaba 1.164 2008-03-08 Wakaba <wakaba@suika.fam.cx>
1327    
1328 wakaba 1.174 * HTML.pm.src: Ignore U+000A at the beginning of a |listing|
1329     element (HTML5 revision 1330).
1330    
1331     2008-03-08 Wakaba <wakaba@suika.fam.cx>
1332    
1333 wakaba 1.173 * HTML.pm.src: <title> is always appended to the current
1334     element (HTML5 revision 1328).
1335    
1336     2008-03-08 Wakaba <wakaba@suika.fam.cx>
1337    
1338 wakaba 1.172 * HTML.pm.src: White space in tainted tables are moved
1339     into foster parents (HTML5 revision 1326).
1340    
1341     2008-03-08 Wakaba <wakaba@suika.fam.cx>
1342    
1343 wakaba 1.171 * HTML.pm.src: Reduce errors from foster parenting cases (HTML5
1344     revision 1321).
1345    
1346     2008-03-08 Wakaba <wakaba@suika.fam.cx>
1347    
1348 wakaba 1.170 * HTML.pm.src: |</h/n/>| case code rearranged to align with
1349     the spec (HTML5 revision 1320). Note that we finally complete
1350     all of HTML5 revision 1320 changes.
1351    
1352     2008-03-08 Wakaba <wakaba@suika.fam.cx>
1353    
1354 wakaba 1.169 * HTML.pm.src: |</form>| now works similar to |</div>| for unclosed
1355     tags (HTML5 revision 1320).
1356    
1357     2008-03-08 Wakaba <wakaba@suika.fam.cx>
1358    
1359 wakaba 1.168 * HTML.pm.src: |</p>| case rearranged with no actual change in fact.
1360    
1361     2008-03-08 Wakaba <wakaba@suika.fam.cx>
1362    
1363 wakaba 1.167 * HTML.pm.src: A "generate implied end tags" code (t409.1)
1364     could not be reached so that it is now removed (HTML5 revision
1365     1320).
1366    
1367     2008-03-08 Wakaba <wakaba@suika.fam.cx>
1368    
1369 wakaba 1.166 * HTML.pm.src: Code for the case of |</div>| and so on are revised
1370     to align with new spec text (HTML5 revision 1320).
1371    
1372     2008-03-08 Wakaba <wakaba@suika.fam.cx>
1373    
1374 wakaba 1.165 * HTML.pm.src: Remove strange |if| condition; however, it should
1375     have had no harm in theory.
1376    
1377     2008-03-08 Wakaba <wakaba@suika.fam.cx>
1378    
1379 wakaba 1.164 * HTML.pm.src (_tree_construction_main): '</p>' in body
1380     case is split from other end tags for the preparation
1381     of implementing HTML revision 1320.
1382    
1383 wakaba 1.162 2008-03-07 Wakaba <wakaba@suika.fam.cx>
1384    
1385 wakaba 1.163 * HTML.pm.src: Simplified "generate implied end tag" (HTML5
1386     revision 1320).
1387    
1388     2008-03-07 Wakaba <wakaba@suika.fam.cx>
1389    
1390 wakaba 1.162 * HTML.pm.src (_tree_construction_main): Merge rules for "h1"
1391     and "div" (HTML5 revision 1318). Add comments to where
1392     |form| pointer association codes should be inserted (HTML5 revision
1393     1319).
1394    
1395 wakaba 1.161 2008-03-06 Wakaba <wakaba@suika.fam.cx>
1396    
1397     * HTML.pm.src: <html> treatement refined (HTML5 revision 1314).
1398    
1399 wakaba 1.159 2008-03-05 Wakaba <wakaba@suika.fam.cx>
1400    
1401 wakaba 1.160 * HTML.pm.src: Since the case t268 should never be reached (no
1402     other token type, there are), it is replaced by a |die| statement.
1403    
1404     2008-03-05 Wakaba <wakaba@suika.fam.cx>
1405    
1406 wakaba 1.159 * HTML.pm.src: Typo fixed.
1407    
1408 wakaba 1.158 2008-03-04 Wakaba <wakaba@suika.fam.cx>
1409    
1410     * HTML.pm.src (_tree_construction_initial): Some limited quirks
1411     doctypes were not uppercased for comparison.
1412    
1413 wakaba 1.152 2008-03-03 Wakaba <wakaba@suika.fam.cx>
1414    
1415 wakaba 1.157 * HTML.pm.src (tree construction and set_inner_html): Checkpoints
1416     are added.
1417    
1418     2008-03-03 Wakaba <wakaba@suika.fam.cx>
1419    
1420 wakaba 1.156 * HTML.pm.src (_tokenize_attempt_to_consume_an_entity): Checkpoints
1421     are set. Cases that are unlikely reached are noted as so.
1422    
1423     2008-03-03 Wakaba <wakaba@suika.fam.cx>
1424    
1425 wakaba 1.155 * HTML.pm.src: Checkpoints for debugging are added.
1426    
1427     * mkhtmlparser.pl: Support for |!!!cp| syntax.
1428    
1429     2008-03-03 Wakaba <wakaba@suika.fam.cx>
1430    
1431 wakaba 1.154 * HTML.pm.src, mkhtmlparser.pl: s/_input_character/_char/g
1432     for simplicity.
1433    
1434     2008-03-03 Wakaba <wakaba@suika.fam.cx>
1435    
1436 wakaba 1.153 * HTML.pm.src: Flag name changed: s/correct/force-quirks/g (HTML5
1437     revision 1307).
1438    
1439     2008-03-03 Wakaba <wakaba@suika.fam.cx>
1440    
1441 wakaba 1.152 * HTML.pm.src (_get_next_token): Where the /incorrect/ flag
1442     is set are changed (HTML5 revision 1305).
1443    
1444 wakaba 1.149 2008-03-02 Wakaba <wakaba@suika.fam.cx>
1445    
1446 wakaba 1.151 * HTML.pm.src: Raise a parse error for |<span ===>| (HTML5 revision
1447     1292). Entities are not parsed in comment-like part in RCDATA
1448     elements (HTML5 revision 1294). Allow bare & at the end
1449     of attribute value literals (HTML5 revision 1296). More
1450     quirks mode doctypes (HTML5 revision 1302). Requires spaces
1451     between attributes and ban attribute names or unquoted
1452     attribute values containing single or double quotes (HTML5
1453     revision 1303).
1454    
1455     2008-03-02 Wakaba <wakaba@suika.fam.cx>
1456    
1457 wakaba 1.150 * ContentChecker.pm: Typo fixed. Don't raise "character encoding"
1458     and related errors unless it is an HTML document (though the spec
1459     is unclear on whether it is applied to XHTML document).
1460    
1461     * HTML.pm (%HTMLAttrStatus): WF2 repetition model attributes
1462     are added.
1463    
1464     2008-03-02 Wakaba <wakaba@suika.fam.cx>
1465    
1466 wakaba 1.149 * HTML.pm.src: s/local_name/manakai_local_name/g.
1467    
1468 wakaba 1.148 2008-03-01 Wakaba <wakaba@suika.fam.cx>
1469    
1470     * _NamedEntityList.pm: Updated (HTML5 revision 1286).
1471    
1472     * HTML.pm.src: |charset| in |content| attribute is
1473     case-insensitive (HTML5 revision 1270).
1474    
1475 wakaba 1.147 2008-02-26 Wakaba <wakaba@suika.fam.cx>
1476    
1477     * ContentChecker.pm: New status constants are added.
1478     ($ElementDefault): |status| added.
1479     (check_element): Err for non-standard or deprecated elements.
1480     (_attr_status_info): For non-standard or deprecated attributes.
1481    
1482 wakaba 1.142 2008-02-24 Wakaba <wakaba@suika.fam.cx>
1483    
1484 wakaba 1.146 * ContentChecker.pm (_attr_status_info): New internal method.
1485    
1486     2008-02-24 Wakaba <wakaba@suika.fam.cx>
1487    
1488 wakaba 1.145 * ContentChecker.pm (check_element): Element standardized
1489     status information is now dispatched.
1490    
1491     2008-02-24 Wakaba <wakaba@suika.fam.cx>
1492    
1493 wakaba 1.144 * ContentChecker.pm (check_element): Fix |del|-and-significant
1494     problem by adding some more arguments.
1495    
1496     2008-02-24 Wakaba <wakaba@suika.fam.cx>
1497    
1498 wakaba 1.143 * ContentChecker.pm (check_element): Use context of
1499     container-for-the-purpose-of-content-model element (not transparent
1500     element) for |check_child_element| calling and significant
1501     text flag marking. This reintroduces |<del>|-and-significant
1502     problem again.
1503    
1504     2008-02-24 Wakaba <wakaba@suika.fam.cx>
1505    
1506 wakaba 1.142 * ContentChecker.pm (check_element): Make semi-transparent
1507     elements ignored for the purpose of phase changes in
1508     content model checking.
1509    
1510 wakaba 1.138 2008-02-23 Wakaba <wakaba@suika.fam.cx>
1511    
1512 wakaba 1.141 * ContentChecker.pm (check_element): In-element state
1513     was not properly managed for transparent cases.
1514    
1515     2008-02-23 Wakaba <wakaba@suika.fam.cx>
1516    
1517 wakaba 1.140 * ContentChecker.pm (check_element): Support for |video|
1518     and |audio| as semi-transparent elements.
1519    
1520     2008-02-23 Wakaba <wakaba@suika.fam.cx>
1521    
1522 wakaba 1.139 * ContentChecker.pm ($HTMLSemiTransparentElements): New.
1523     (check_element): s/minuses/minus_elements/, s/pluses/plus_elements/.
1524     Support for |html:object| as a semi-transparent element.
1525    
1526     2008-02-23 Wakaba <wakaba@suika.fam.cx>
1527    
1528 wakaba 1.138 * ContentChecker.pm (check_element): The way to traverse
1529     the tree is entirely revised to make it easier to track
1530     the state of ancestors/descendants. As a result of this
1531     revision (which rewrites almost all of Whatpm::ContentChecker::HTML),
1532     support for content model checking for HTML elements |figure|,
1533     |object|, |video|, and |audio| and checking for XML elements (and
1534     some XMLNS checkings) are dropped for now. They will be
1535     reimplemented in due cource.
1536    
1537 wakaba 1.134 2008-02-17 Wakaba <wakaba@suika.fam.cx>
1538    
1539 wakaba 1.137 * HTML.pm.src: |>| in public or system literal closes the DOCTYPE
1540     token (HTML5 revision 1225).
1541    
1542     2008-02-17 Wakaba <wakaba@suika.fam.cx>
1543    
1544 wakaba 1.136 * ContentChecker.pm ({unsupported_level}): New value.
1545    
1546     * HTML.pm.src: Save whether |meta| |content| attribute
1547     contains character references or not.
1548    
1549     2008-02-17 Wakaba <wakaba@suika.fam.cx>
1550    
1551 wakaba 1.135 * ContentChecker.pm (_get_children): (Incompleted) attempt to
1552     imlement significant content checking for contents
1553     with |del| elements.
1554    
1555     2008-02-17 Wakaba <wakaba@suika.fam.cx>
1556    
1557 wakaba 1.134 * ContenteChecker.pm ($HTMLTransparentElements): More
1558     elements are added.
1559     (_get_children): HTML |object| elements are now semi-transparent.
1560    
1561     * NanoDOM.pm (manakai_html, manakai_head): New methods.
1562    
1563 wakaba 1.133 2008-02-16 Wakaba <wakaba@suika.fam.cx>
1564    
1565     * CacheManifest.pm: HTML5 revision 1211 implemented.
1566    
1567     * CacheManifest.pod: Updated.
1568    
1569 wakaba 1.132 2008-02-10 Wakaba <wakaba@suika.fam.cx>
1570    
1571     * ContentChecker.pm (check_document, check_element): Support
1572     for second argument ($onsubdoc).
1573     (_get_css_parser): Removed (now it is part of WDCC).
1574    
1575 wakaba 1.131 2008-02-09 Wakaba <wakaba@suika.fam.cx>
1576    
1577     * ContentChecker.pm (_get_css_parser): New.
1578    
1579 wakaba 1.129 2007-11-25 Wakaba <wakaba@suika.fam.cx>
1580    
1581 wakaba 1.130 * ContentChecker.pm ($AnyChecker): Old way to add child elements
1582     for checking had been used.
1583    
1584     2007-11-25 Wakaba <wakaba@suika.fam.cx>
1585    
1586 wakaba 1.129 * ContentChecker.pm (check_element): New todo item type |descendant|.
1587    
1588 wakaba 1.125 2007-11-23 Wakaba <wakaba@suika.fam.cx>
1589    
1590 wakaba 1.128 * IMTChecker.pm: Revised to raise errors and warnings as (poorly)
1591     specced in RFC 2046 and RFC 4288.
1592     (application/atom+xml): Definition added.
1593    
1594     2007-11-23 Wakaba <wakaba@suika.fam.cx>
1595    
1596 wakaba 1.127 * URIChecker.pm: Make RFC 3986 should-level errors
1597     warnings (rather than SHOULD-level errors).
1598    
1599     2007-11-23 Wakaba <wakaba@suika.fam.cx>
1600    
1601 wakaba 1.126 * NanoDOM.pm (get_user_data, set_user_data): New methods.
1602    
1603     * HTML.pm.src: A flag for character references in attribute
1604     values are added. Set |manakai_has_reference| user data
1605     to |charset| attribute.
1606    
1607     2007-11-23 Wakaba <wakaba@suika.fam.cx>
1608    
1609 wakaba 1.125 * NanoDOM.pm (input_encoding, manakai_charset, manakai_has_bom): New
1610     attributes.
1611    
1612     * ContentChecker.pm (check_document): Warn if charset requirements
1613     cannot be tested.
1614    
1615 wakaba 1.124 2007-11-19 Wakaba <wakaba@suika.fam.cx>
1616    
1617     * HTML.pm.src (parse_byte_string): Detect charset
1618     by universalchardet if charset parameter is not specified.
1619    
1620     * Makefile (Charset-all, Charset-clean): New rules.
1621    
1622 wakaba 1.120 2007-11-18 Wakaba <wakaba@suika.fam.cx>
1623    
1624 wakaba 1.123 * ContentChecker.pm (check_document): Check the existence
1625     of character encoding declaration and selection of encoding
1626     for HTML document.
1627    
1628     2007-11-18 Wakaba <wakaba@suika.fam.cx>
1629    
1630 wakaba 1.122 * ContentType.pm (get_sniffed_type): Return also the official
1631     type in list context.
1632    
1633     2007-11-18 Wakaba <wakaba@suika.fam.cx>
1634    
1635 wakaba 1.121 * ContentType.pm: Sniffing with leading white space
1636     ignoring (HTML5 revisions 1013 and 1016).
1637    
1638     2007-11-18 Wakaba <wakaba@suika.fam.cx>
1639    
1640 wakaba 1.120 * ContentType.pm: HTML5 revision 1013 changes, except for leading
1641     white spaces, are implemented.
1642    
1643 wakaba 1.117 2007-11-11 Wakaba <wakaba@suika.fam.cx>
1644    
1645 wakaba 1.119 * HTML.pm.src (parse_char_string): Set |inner_encoding|
1646     attribute if possible.
1647    
1648     2007-11-11 Wakaba <wakaba@suika.fam.cx>
1649    
1650 wakaba 1.118 * HTML.pm.src (parse_byte_string): New method.
1651     (parse_char_string): New alias for |parse_string|.
1652     (main phase): Invoking "change the encoding" algorithm if desired.
1653    
1654     * HTML.pod: Updated.
1655    
1656     2007-11-11 Wakaba <wakaba@suika.fam.cx>
1657    
1658 wakaba 1.117 * HTML.pod (get_inner_html): Removed.
1659    
1660     * Makefile (HTML-all, HTML-clean): New.
1661    
1662     2007-11-11 Wakaba <wakaba@suika.fam.cx>
1663    
1664     * HTML.pm.src (get_inner_html): Removed (moved to HTML/Serializer.pm).
1665    
1666 wakaba 1.116 2007-11-08 Wakaba <wakaba@suika.fam.cx>
1667    
1668     * mklinktypelist.pl: s/noreferer/noreferrer/ (HTML5 revision 1132).
1669    
1670 wakaba 1.112 2007-11-04 Wakaba <wakaba@suika.fam.cx>
1671    
1672 wakaba 1.115 * Makefile: |CacheManifest.html| is added.
1673    
1674     * CacheManifest.pod: New file.
1675    
1676     2007-11-04 Wakaba <wakaba@suika.fam.cx>
1677    
1678 wakaba 1.114 * CacheManifest.pm: New module.
1679    
1680     2007-11-04 Wakaba <wakaba@suika.fam.cx>
1681    
1682 wakaba 1.113 * HTML.pm.src: Support for application cache selection algorithm
1683     callback.
1684    
1685     2007-11-04 Wakaba <wakaba@suika.fam.cx>
1686    
1687 wakaba 1.112 * mklinktypelist.pl: Support for rel=noreferer (HTML5 revision 1118).
1688    
1689 wakaba 1.110 2007-10-17 Wakaba <wakaba@suika.fam.cx>
1690    
1691 wakaba 1.111 * Makefile (clean): New rule.
1692    
1693     * NanoDOM.pm (public_id, system_id): New attributes.
1694    
1695     2007-10-17 Wakaba <wakaba@suika.fam.cx>
1696    
1697 wakaba 1.110 * Makefile (CSS-all, CSS-clean, clean): New rules.
1698    
1699 wakaba 1.109 2007-10-14 Wakaba <wakaba@suika.fam.cx>
1700    
1701     * ContentChecker.pm (check_document): Support for
1702     new |is_xml_root| flag.
1703     (check_element): Support for new |pluses| state.
1704     (_add_pluses): New method.
1705     (_remove_minuses): Support for new |minus| item.
1706    
1707 wakaba 1.106 2007-09-24 Wakaba <wakaba@suika.fam.cx>
1708    
1709 wakaba 1.107 * ContentChecker.pm: Raise specific error for invalid
1710     root element.
1711    
1712     2007-09-24 Wakaba <wakaba@suika.fam.cx>
1713    
1714 wakaba 1.106 * ContentChecker.pm: Set level values for later uses.
1715    
1716 wakaba 1.103 2007-09-09 Wakaba <wakaba@suika.fam.cx>
1717    
1718 wakaba 1.105 * ContentChecker.pm: Support for language tag validation.
1719    
1720     2007-09-09 Wakaba <wakaba@suika.fam.cx>
1721    
1722 wakaba 1.104 * LangTag.pm (check_rfc3066_language_tag): New method.
1723    
1724     2007-09-09 Wakaba <wakaba@suika.fam.cx>
1725    
1726 wakaba 1.103 * LangTag.pm: New module.
1727    
1728 wakaba 1.102 2007-09-04 Wakaba <wakaba@suika.fam.cx>
1729    
1730     * HTML.pm.src: Some error types were wrong.
1731    
1732 wakaba 1.100 2007-08-17 Wakaba <wakaba@suika.fam.cx>
1733    
1734 wakaba 1.101 * CSS/: New directory.
1735    
1736     2007-08-17 Wakaba <wakaba@suika.fam.cx>
1737    
1738 wakaba 1.100 * ContentChecker.pm (_check_get_children): Support
1739     for |noscript| in |head|.
1740    
1741 wakaba 1.99 2007-08-12 Wakaba <wakaba@suika.fam.cx>
1742    
1743     * URI/: New directory.
1744    
1745 wakaba 1.94 2007-08-11 Wakaba <wakaba@suika.fam.cx>
1746    
1747 wakaba 1.98 * HTML.pm.src: Tokenizer's states are now represented in
1748     number.
1749    
1750     2007-08-11 Wakaba <wakaba@suika.fam.cx>
1751    
1752 wakaba 1.97 * HTML.pm.src: |or|s for insertion modes are replaced
1753     by |&|s.
1754    
1755     2007-08-11 Wakaba <wakaba@suika.fam.cx>
1756    
1757 wakaba 1.96 * HTML.pm.src: Token types are now represented in number.
1758    
1759     2007-08-11 Wakaba <wakaba@suika.fam.cx>
1760    
1761 wakaba 1.95 * ContentType.pm (SEE ALSO): Updated.
1762    
1763     * HTML.pm.src: Insertion modes are now represented in number.
1764    
1765     2007-08-11 Wakaba <wakaba@suika.fam.cx>
1766    
1767 wakaba 1.94 * ContentType.pm: Sniffing for bitmap images (HTML5 revision
1768     999) is implemented.
1769    
1770 wakaba 1.93 2007-08-08 Wakaba <wakaba@suika.fam.cx>
1771    
1772     * ContentType.pm: Sniffing for |<script| (HTML5 revision
1773     983) is implemented.
1774    
1775 wakaba 1.92 2007-08-06 Wakaba <wakaba@suika.fam.cx>
1776    
1777     * ContentChecker.pod: New documentation.
1778    
1779     * Makefile: A rule for |ContentChecker.html| is added.
1780    
1781     * ContentChecker.pm: A pod "LICENSE" section is added.
1782    
1783     * NanoDOM.pm ($VERSION): New variable.
1784    
1785 wakaba 1.90 2007-08-05 Wakaba <wakaba@suika.fam.cx>
1786    
1787 wakaba 1.91 * H2H.pm: |b|, |i|, and |sub| are added to the
1788     list of allowed HTML elements.
1789    
1790     2007-08-05 Wakaba <wakaba@suika.fam.cx>
1791    
1792 wakaba 1.90 * H2H.pm: |samp| is added to the list of allowed
1793     HTML elements.
1794    
1795     * URIChecker.pm (check_iri): New.
1796     (check_iri_reference): Error type for IRI reference
1797     syntax error is changed.
1798    
1799 wakaba 1.87 2007-08-04 Wakaba <wakaba@suika.fam.cx>
1800    
1801 wakaba 1.89 * ContentChecker.pm: Reference to the |Whatpm::ContentChecker::Atom|
1802     is added.
1803     (check_document): Load appropriate module before validation.
1804    
1805     2007-08-04 Wakaba <wakaba@suika.fam.cx>
1806    
1807 wakaba 1.88 * ContentChecker/: New directory.
1808    
1809     2007-08-04 Wakaba <wakaba@suika.fam.cx>
1810    
1811 wakaba 1.87 * ContentChecker.pm: HTML |time| element is implemented.
1812    
1813     * HTMLTable.pm: Comments are updated as HTML5 is revised.
1814    
1815 wakaba 1.71 2007-07-21 Wakaba <wakaba@suika.fam.cx>
1816    
1817 wakaba 1.86 * ContentChecker.pm (check_document): Return value
1818     even if no document element is found.
1819    
1820     2007-07-21 Wakaba <wakaba@suika.fam.cx>
1821    
1822 wakaba 1.85 * HTML.pm.src: |$in_body| is no longer a function.
1823    
1824     2007-07-21 Wakaba <wakaba@suika.fam.cx>
1825    
1826 wakaba 1.84 * HTML.pm.src: The |$in_body| code has been moved down.
1827    
1828     2007-07-21 Wakaba <wakaba@suika.fam.cx>
1829    
1830 wakaba 1.83 * HTML.pm.src: The "trailing end" insertion mode
1831     is split into "after html body" and "after html frameset"
1832     insertion modes. Their codes are merged with "after body"
1833     and "after frameset" codes. |$previous_insertion_mode|
1834     has been removed. "after frameset" code is
1835     merged with "in frameset" code.
1836    
1837     2007-07-21 Wakaba <wakaba@suika.fam.cx>
1838    
1839 wakaba 1.82 * HTML.pm.src: The "before head" insertion mode is
1840     merged with the "in head" insertion mode.
1841    
1842     2007-07-21 Wakaba <wakaba@suika.fam.cx>
1843    
1844 wakaba 1.81 * HTML.pm.src: Cases in "in head" insertion mode are
1845     reorganized.
1846    
1847     2007-07-21 Wakaba <wakaba@suika.fam.cx>
1848    
1849 wakaba 1.80 * HTML.pm.src: Some cases in "in table" insertion mode
1850     are merged.
1851    
1852     2007-07-21 Wakaba <wakaba@suika.fam.cx>
1853    
1854 wakaba 1.79 * HTML.pm.src: The "in row" insertion mode is merged
1855     with "in table" insertion mode.
1856    
1857     2007-07-21 Wakaba <wakaba@suika.fam.cx>
1858    
1859 wakaba 1.78 * HTML.pm.src: The "in table" and "in table body" insertion
1860     modes are merged.
1861    
1862     2007-07-21 Wakaba <wakaba@suika.fam.cx>
1863    
1864 wakaba 1.77 * HTML.pm.src: There is no "in table head" or "in table foot"
1865     insertion mode!
1866    
1867     2007-07-21 Wakaba <wakaba@suika.fam.cx>
1868    
1869 wakaba 1.76 * HTML.pm.src: |<noframes>| "in frameset" and "in noframes"
1870     now directly invoke the handler.
1871    
1872     2007-07-21 Wakaba <wakaba@suika.fam.cx>
1873    
1874 wakaba 1.75 * HTML.pm.src: Codes for "in cell" insertion mode
1875     is merged to the "in body" insertion mode code.
1876    
1877     2007-07-21 Wakaba <wakaba@suika.fam.cx>
1878    
1879 wakaba 1.74 * HTML.pm.src: Codes for "in body" and "in caption"
1880     insertion modes are merged.
1881    
1882     2007-07-21 Wakaba <wakaba@suika.fam.cx>
1883    
1884 wakaba 1.73 * HTML.pm.src: Two |!!!next-token|s were missing.
1885    
1886     2007-07-21 Wakaba <wakaba@suika.fam.cx>
1887    
1888 wakaba 1.72 * HTML.pm.src: Use numeric constant for |{content_mode}|
1889     instead of string constant for |{content_model_flag}|.
1890    
1891     2007-07-21 Wakaba <wakaba@suika.fam.cx>
1892    
1893 wakaba 1.71 * HTML.pm.src: Add the name of the attribute
1894     to the "duplicate attribute" error.
1895    
1896 wakaba 1.69 2007-07-17 Wakaba <wakaba@suika.fam.cx>
1897    
1898 wakaba 1.70 * ContentChecker.pm: Return the |class| node list.
1899    
1900     2007-07-17 Wakaba <wakaba@suika.fam.cx>
1901    
1902 wakaba 1.69 * ContentChecker.pm: Return the |id| node list.
1903    
1904     * HTML.pm.src: A typo is fixed.
1905    
1906 wakaba 1.63 2007-07-16 Wakaba <wakaba@suika.fam.cx>
1907    
1908 wakaba 1.68 * ContentChecker.pm: Drop wrong |level => 'error'| specification
1909     from "in HTML:xml:lang" error. Character position
1910     is now the last part of the error type in the URI error
1911     description. Report "unsupported" status for language
1912     tags, media queries, script codes, and style sheets.
1913    
1914     2007-07-16 Wakaba <wakaba@suika.fam.cx>
1915    
1916 wakaba 1.67 * ContentChecker.pm: Report error if |xml:lang|
1917     in HTML, |lang| in XML, |xmlns| in XML, and |meta| |charset|
1918     in XML.
1919    
1920     * NanoDOM.pm (Attr.owner_document): New attribute.
1921    
1922     2007-07-16 Wakaba <wakaba@suika.fam.cx>
1923    
1924 wakaba 1.66 * HTML.pm.src: The character immediately following
1925     a bare |hcro| was discarded. Fix handling of
1926     entity references in attribute values.
1927    
1928     2007-07-16 Wakaba <wakaba@suika.fam.cx>
1929    
1930 wakaba 1.65 * HTML.pm.src (main and trailin end phases): Token
1931     types |DOCTYPE|, |comment|, |end-of-file|, and |<html>| are
1932     factored out. Error types |in html:#DOCTYPE|
1933     and |after html:#DOCTYPE| are merged
1934     into |DOCTYPE in the middle|. |</frameset>|
1935     in fragment parsing mode changed the
1936     insertion mode.
1937    
1938     2007-07-16 Wakaba <wakaba@suika.fam.cx>
1939    
1940 wakaba 1.64 * HTML.pm.src: |$phase| has been removed; The |trailing end|
1941     phase is now an insertion mode. Treatments for white
1942     space character tokens were incorrect for some
1943     insertion modes. An old |meta| case was not removed.
1944    
1945     2007-07-16 Wakaba <wakaba@suika.fam.cx>
1946    
1947 wakaba 1.63 * HTML.pm.src: |meta| charset declaration extraction
1948     implemented (but changing the encoding is not yet:-).
1949    
1950 wakaba 1.60 2007-07-15 Wakaba <wakaba@suika.fam.cx>
1951    
1952 wakaba 1.62 * Charset/: New directory.
1953    
1954     2007-07-15 Wakaba <wakaba@suika.fam.cx>
1955    
1956 wakaba 1.61 * H2H.pm: New Perl module (created from
1957     manakai's H2H.dis).
1958    
1959     2007-07-15 Wakaba <wakaba@suika.fam.cx>
1960    
1961 wakaba 1.60 * XMLSerializer.pm: New Perl module (created from
1962     manakai's SimpleLS.dis).
1963    
1964 wakaba 1.59 2007-07-07 Wakaba <wakaba@suika.fam.cx>
1965    
1966     * HTML.pm.src: |<!---x-->| was not processed correctly.
1967    
1968 wakaba 1.57 2007-07-01 Wakaba <wakaba@suika.fam.cx>
1969    
1970 wakaba 1.58 * HTML.pm.src: Report correct error message
1971     for |<body></div></body>|.
1972    
1973     2007-07-01 Wakaba <wakaba@suika.fam.cx>
1974    
1975 wakaba 1.57 * HTMLTable.pm: An error description was incorrect.
1976    
1977     2007-06-30 Wakaba <wakaba@suika.fam.cx>
1978    
1979     * ContentChecker.pm: Return |{term}| list.
1980    
1981 wakaba 1.55 2007-06-30 Wakaba <wakaba@suika.fam.cx>
1982    
1983 wakaba 1.56 * HTML.pm.src: HTML5 revisions 961-966 (</p>, </br>,
1984     nested <nobr>, implied </tbody>, </tfoot>, and </thead>,
1985     and <title> outside of head).
1986    
1987     2007-06-30 Wakaba <wakaba@suika.fam.cx>
1988    
1989 wakaba 1.55 * IMTChecker.pm: Report warning for unregistered
1990     and private types/subtypes.
1991    
1992     * ContentChecker.pm, HTML.pm.src, IMTChecker.pm,
1993     URIChecker.pm, HTMLTable.pm: Error messages are now
1994     consistent; they are all listed in
1995     <http://suika.fam.cx/gate/2005/sw/Whatpm%20Error%20Types>.
1996    
1997 wakaba 1.52 2007-06-25 Wakaba <wakaba@suika.fam.cx>
1998    
1999 wakaba 1.54 * ContentChecker.pm: |<img ismap>| not in |<a></a>|
2000     is now erred. |<datalist>| is implemented.
2001     Attribute checker for |<command>| and |<menu>| are
2002     added. Support for |contextmenu| global attribute
2003     is added.
2004    
2005     2007-06-25 Wakaba <wakaba@suika.fam.cx>
2006    
2007 wakaba 1.53 * HTML.pm.src (_reset_insertion_mode): Interpretation
2008     of Step 3 has been changed.
2009    
2010     2007-06-25 Wakaba <wakaba@suika.fam.cx>
2011    
2012 wakaba 1.52 * HTML.pm.src: Late |<html>| parse error is implemented.
2013    
2014 wakaba 1.47 2007-06-24 Wakaba <wakaba@suika.fam.cx>
2015    
2016 wakaba 1.51 * URIChecker.pm (check_iri_reference): A |decode| method name was
2017     incorrect.
2018    
2019     * ContentChecker.pm: Support for the |footer| element.
2020     Check URI syntax for space-separated URI attributes.
2021     Support for the |tabindex| attribute. Support
2022     for |datetime| attribute.
2023    
2024     2007-06-24 Wakaba <wakaba@suika.fam.cx>
2025    
2026 wakaba 1.50 * HTML.pm.src: HTML5 revision 1.144 (&#x0D;) and 1.145 (invalid
2027     character references). HTML5 revision 1.146 (white space
2028     characters before root start tag). HTML5 revision
2029     1.148 (named character references in attribute values).
2030     HTML5 revision 1.152 (<plaintext>.innerHTML get).
2031    
2032     2007-06-24 Wakaba <wakaba@suika.fam.cx>
2033    
2034     * HTML.pm.src: HTML5 revisions 1.142 and 1.143 (<noscript>
2035 wakaba 1.49 in <head>).
2036    
2037     2007-06-24 Wakaba <wakaba@suika.fam.cx>
2038    
2039 wakaba 1.48 * HTML.pm.src: HTML5 revision 935 (<base>, <link>, <meta>
2040     in body).
2041    
2042     * ContentChecker.pm: HTML5 revision 938 (scoped="").
2043    
2044     2007-06-24 Wakaba <wakaba@suika.fam.cx>
2045    
2046 wakaba 1.47 * HTML.pm.src: HTML5 revision 923 (matching end tag in CDATA
2047     or RCDATA in fragment parsing mode). HTML5 revision
2048     924 (<!--> and <!--->). HTML5 revision 926 (hn in hn).
2049    
2050     2007-06-23 Wakaba <wakaba@suika.fam.cx>
2051    
2052     * HTML.pm.src (get_inner_html): HTML5 revision 922 (inner_html
2053     for <pre> and <textarea>).
2054    
2055 wakaba 1.33 2007-06-23 Wakaba <wakaba@suika.fam.cx>
2056    
2057 wakaba 1.46 * HTML.pm.src: HTML5 revision 920 (<isindex>).
2058    
2059     2007-06-23 Wakaba <wakaba@suika.fam.cx>
2060    
2061 wakaba 1.45 * HTML.pm.src: HTML5 revision 918 (</head>, </body>, </html>).
2062    
2063     2007-06-23 Wakaba <wakaba@suika.fam.cx>
2064    
2065 wakaba 1.44 * HTML.pm.src: HTML5 revision 916 (</body>).
2066     HTML5 revision 917 (conforming bare &).
2067    
2068     2007-06-23 Wakaba <wakaba@suika.fam.cx>
2069    
2070 wakaba 1.43 * NanoDOM.pm (manakai_is_html): Setting to false did not work.
2071    
2072     * HTML.pm.src: HTML5 revision 914 (</ in CDATA, RCDATA).
2073     HTML5 revision 915 (<nobr>).
2074    
2075     2007-06-23 Wakaba <wakaba@suika.fam.cx>
2076    
2077 wakaba 1.42 * HTML.pm.src: HTML5 revisions 908, 909, 912, and 913 (quirks mode).
2078    
2079     * NanoDOM.pm (manakai_is_html, manakai_compat_mode, compat_mode):
2080     New attributes.
2081    
2082     2007-06-23 Wakaba <wakaba@suika.fam.cx>
2083    
2084 wakaba 1.41 * HTML.pm.src: HTML5 revisions 900, 901, 902, and 911 (<
2085     in tags).
2086    
2087     2007-06-23 Wakaba <wakaba@suika.fam.cx>
2088    
2089 wakaba 1.40 * .cvsignore: |Entities.html| is added.
2090    
2091     * HTML.pm.src: |$entity_char| is removed and
2092     requires |Whatpm::_NamedEntityList| instead.
2093     HTML5 revision 898 (refc), except that lack of refc
2094     is parse error.
2095    
2096     * mkentitylist.pl: New script.
2097    
2098     * Makefile (all): |_NamedEntityList.pm| is added.
2099     (_NamedEntityList.pm, Entities.html): New rules.
2100    
2101     2007-06-23 Wakaba <wakaba@suika.fam.cx>
2102    
2103 wakaba 1.39 * HTML.pm.src: Parse errors immediately after U+000D
2104     were ignored and U+000D immediately following another
2105     U+000D was not converted to U+000A.
2106    
2107     2007-06-23 Wakaba <wakaba@suika.fam.cx>
2108    
2109 wakaba 1.38 * HTML.pm.src (set_inner_html): HTML5 revision 892 (adopt
2110     nodes before appended). Parser was not ready for NULL
2111     parse error and escape flag.
2112    
2113     * NanoDOM.pm (adopt_node): New.
2114    
2115     2007-06-23 Wakaba <wakaba@suika.fam.cx>
2116    
2117 wakaba 1.37 * HTML.pm.src: HTML5 revision 886 (insane comment in
2118     CDATA and RCDATA). Note that current implementation
2119     is simply repeating what the spec says and it is maybe not
2120     a best way to do it.
2121    
2122     2007-06-23 Wakaba <wakaba@suika.fam.cx>
2123    
2124 wakaba 1.36 * HTML.pm.src: HTML5 revision 884 (</form> don't close
2125     the form element if a descendant element without implied end tag has
2126     still been open).
2127    
2128     2007-06-23 Wakaba <wakaba@suika.fam.cx>
2129    
2130 wakaba 1.35 * ContentChecker.pm: HTML5 revision 881 (Make |id| attribute with
2131     space characters non-conforming).
2132    
2133     2007-06-23 Wakaba <wakaba@suika.fam.cx>
2134    
2135 wakaba 1.34 * HTML.pm.src: An error message was incorrect.
2136     HTML5 revision 869 (C1 character references).
2137    
2138     2007-06-23 Wakaba <wakaba@suika.fam.cx>
2139    
2140 wakaba 1.33 * HTML.pm.src: HTML5 revision 867 (a LF at the beginning of
2141     a |textarea| is removed).
2142    
2143 wakaba 1.32 2007-06-05 Wakaba <wakaba@suika.fam.cx>
2144    
2145     * NanoDOM.pm (get_attribute_node_ns): New method.
2146    
2147     * ContentChecker.pm: |script| |async| and |defer|
2148     no longer require |src|. |async| MUST NOT be
2149     specified if |defer|. (HTML5 revision 858).
2150    
2151 wakaba 1.31 2007-05-30 Wakaba <wakaba@suika.fam.cx>
2152    
2153     * HTML.pm.src: |<form><form>| went to inifinite loop.
2154    
2155 wakaba 1.27 2007-05-27 Wakaba <wakaba@suika.fam.cx>
2156    
2157 wakaba 1.30 * ContentChecker.pm (html): Set |is_root| (allowed
2158     as a document element) flag on.
2159     (new): Removed.
2160     (check_document): New method.
2161    
2162     2007-05-27 Wakaba <wakaba@suika.fam.cx>
2163    
2164 wakaba 1.29 * ContentChecker.pm (thead, tfoot): Checker specifications
2165     were incorrect.
2166    
2167     2007-05-27 Wakaba <wakaba@suika.fam.cx>
2168    
2169 wakaba 1.28 * ContentChecker.pm ($HTMLURIAttrChecker): Include
2170     error position in the |type| option of the error.
2171    
2172     * HTMLTable.pm (form_table): The |$onerror| parameter
2173     is now optional. Some bugs are fixed.
2174    
2175     2007-05-27 Wakaba <wakaba@suika.fam.cx>
2176    
2177 wakaba 1.27 * HTMLTable.pm: New module.
2178    
2179     * ContentChecker.pm (table): Invoke table model error checker.
2180    
2181     * NanoDOM.pm (first_child, get_attribute_ns): New.
2182    
2183 wakaba 1.25 2007-05-26 Wakaba <wakaba@suika.fam.cx>
2184    
2185 wakaba 1.26 * ContentChecker.pm ($HTMLLinkTypesAttrChecker): New checker.
2186     (link/@rel, a/@rel, area/@rel): Use new checker.
2187    
2188     * Makefile (_LinkTypeList.pm, RelExtensions.html): New rules.
2189    
2190     * _LinkTypeList.pm: New file.
2191    
2192     * mklinktypelist.pl: New file.
2193    
2194     * .cvsignore: |RelExtensions.html| added.
2195    
2196     * NanoDOM.pm (child_nodes): Returns an empty array
2197     for non-child-containing node types.
2198     (text_content): New attribute.
2199    
2200     2007-05-26 Wakaba <wakaba@suika.fam.cx>
2201    
2202 wakaba 1.25 * IMTChecker.pm: New module.
2203    
2204     * ContentChecker.pm ($HTMLIMTAttrChecker): Call IMTChecker
2205     to test parameter value validity.
2206    
2207     * HTML.pm.src ($style_start_tag): Attributes were
2208     discarded.
2209    
2210 wakaba 1.22 2007-05-25 Wakaba <wakaba@suika.fam.cx>
2211    
2212 wakaba 1.24 * ContentChecker.pm ($HTMLURIAttrChecker): Implemented.
2213    
2214     2007-05-25 Wakaba <wakaba@suika.fam.cx>
2215    
2216 wakaba 1.23 * URIChecker.pm: All recommendations from RFC 3986
2217     and RFC 3987 are listed (not all testable items are checked yet).
2218    
2219     2007-05-25 Wakaba <wakaba@suika.fam.cx>
2220    
2221 wakaba 1.22 * URIChecker.pm: New module.
2222    
2223 wakaba 1.18 2007-05-20 Wakaba <wakaba@suika.fam.cx>
2224    
2225 wakaba 1.21 * ContentChecker.pm: Now most attributes are implemented
2226     or associated to some placeholder.
2227     ($ElementDefault): Warn unknown attributes
2228     for unknown elements as "attribute not supported".
2229     ($HTMLLanguageTagAttrChecker, $HTMLMQAttrChecker): New placeholders.
2230     ($HTMLUsemapAttrChecker, $HTMLTargetAttrChecker): New checkers.
2231     (|a| attribute checker): Reimplemented.
2232    
2233     2007-05-20 Wakaba <wakaba@suika.fam.cx>
2234    
2235 wakaba 1.20 * ContentChecker.pm ($HTMLEventHandlerAttrChecker): New placeholder.
2236     ($HTMLAttrChecker): Event handler content attributes
2237     are added.
2238     (link, embed): Required attribute is now checked.
2239     (embed): Unknown local attributes are no longer warned.
2240    
2241     2007-05-20 Wakaba <wakaba@suika.fam.cx>
2242    
2243 wakaba 1.19 * ContentChecker.pm ($HTMLSpaceURIsAttrChecker): New placeholder.
2244     ($HTMLIMTAttrChecker): New checker.
2245     (link@rel, link@href, link@type, style@type,
2246     a@href, a@ping, a@ping, a@type, embed@src, embed@type,
2247     object@data, object@type, source@src, source@type, area@alt,
2248     area@shape, area@coords, area@href,
2249     area@ping, area@rel, area@type, script@src,
2250     script@defer, script@async, script@type): Checkers added.
2251    
2252     2007-05-20 Wakaba <wakaba@suika.fam.cx>
2253    
2254 wakaba 1.18 * ContentChecker.pm: Descendant checking was incorrect.
2255    
2256 wakaba 1.12 2007-05-19 Wakaba <wakaba@suika.fam.cx>
2257    
2258 wakaba 1.17 * ContentChecker.pm: Support |xml:*| and |xmlns:*|
2259     attributes. Report an error if |Element.prefix|
2260     is |xmlns|.
2261    
2262     * NanoDOM.pm (prefix): New attribute.
2263    
2264     2007-05-19 Wakaba <wakaba@suika.fam.cx>
2265    
2266 wakaba 1.16 * HTML.pm.src: In |main| phase, |in body| insertion
2267     mode, action for |<iframe>| was missing.
2268    
2269     2007-05-19 Wakaba <wakaba@suika.fam.cx>
2270    
2271 wakaba 1.15 * ContentChecker.pm: Support for many of HTML5 elements.
2272     ($GetHTMLNonNegativeIntegerAttrChecker): New.
2273    
2274     2007-05-19 Wakaba <wakaba@suika.fam.cx>
2275    
2276 wakaba 1.14 * ContentChecker.pm: Support for most elements up to |progress|.
2277     ($HTMLURIAttrChecker): Placeholder.
2278     ($HTMLIntegerAttrChecker, $GetHTMLFloatingPointNumberAttrChecker):
2279     New.
2280    
2281     2007-05-19 Wakaba <wakaba@suika.fam.cx>
2282    
2283 wakaba 1.13 * ContentChecker.pm: Attribute checkers
2284     for global attributes, |html|, |base|, |style|, and |meta|.
2285    
2286     * NanoDOM.pm (insert_before): Weaken reference
2287     to the parent node.
2288     (Attr::new): Set |owner_element| attribute.
2289     (namespace_uri, manakai_local_name): New attribute implementations.
2290     (owner_element): New attribute.
2291    
2292     2007-05-19 Wakaba <wakaba@suika.fam.cx>
2293    
2294 wakaba 1.12 * ContentChecker.pm ($AttrChecker, $HTMLAttrChecker,
2295     $AnyChecker->{attr_checker}, $HTMLAttrsChecker,
2296     $Element->{$HTML_NS}->{''}): New.
2297     (check_element): Invoke attrs_checker for each element.
2298    
2299 wakaba 1.6 2007-05-13 Wakaba <wakaba@suika.fam.cx>
2300    
2301 wakaba 1.11 * ContentChecker.pm: Don't use |manakai_element_type_match|.
2302    
2303     2007-05-13 Wakaba <wakaba@suika.fam.cx>
2304    
2305 wakaba 1.10 * ContentChecker.pm: Use hashs rather than lists for
2306     element type testings.
2307    
2308     2007-05-13 Wakaba <wakaba@suika.fam.cx>
2309    
2310 wakaba 1.9 * ContentChecker.pm: Don't generate duplicate
2311     error when an element type is put in the "minus" list
2312     and the element type is not allowed explicitly in the particular
2313     element content model.
2314     (html:a checker): New checker.
2315     (html:details, html:datagrid): New checkers.
2316     (html:legend): New checker.
2317    
2318     2007-05-13 Wakaba <wakaba@suika.fam.cx>
2319    
2320 wakaba 1.8 * ContentChecker.pm (html:li checker): Implemented.
2321    
2322     2007-05-13 Wakaba <wakaba@suika.fam.cx>
2323    
2324 wakaba 1.7 * ContentChecker.pm ($HTMLInlineOrStriclyInlineChecker): New
2325     checker.
2326     (html:dd checker): New checker.
2327     (html:q, html:em, html:strong, html:small,
2328     html:m, html:dfn, html:code, html:samp, html:span): New checkers.
2329    
2330     2007-05-13 Wakaba <wakaba@suika.fam.cx>
2331    
2332 wakaba 1.6 * ContentChecker.pm ($AnyChecker): Renamed
2333     from |$ElementDefault->{checker}|.
2334     ($ElementDefault->{checker}): Throw an error that
2335     the element type is not supported by the checker.
2336     ($HTMLMetadataElement): |html:base| was missing.
2337     ($HTMLEmptyChecker): Don't throw an error
2338     for inter-element whitespace nodes.
2339     (html:html checker): Errors were not
2340     thrown even if |html:head| and/or |html:body|
2341     children were missing.
2342     (html:head checker): An error was not
2343     thrown if <meta charset> appered after other
2344     elements.
2345    
2346 wakaba 1.5 2007-05-05 Wakaba <wakaba@suika.fam.cx>
2347    
2348     * ContentChecker.pm: |footer|, |video|, |audio|, |script|,
2349     and |noscript| elements are implemented.
2350     (new): New method.
2351    
2352 wakaba 1.3 2007-05-04 Wakaba <wakaba@suika.fam.cx>
2353    
2354 wakaba 1.4 * ContentChecker.pm: New module.
2355    
2356     2007-05-04 Wakaba <wakaba@suika.fam.cx>
2357    
2358 wakaba 1.3 * NanoDOM.pm (manakai_parent_element,
2359     document_element, manakai_local_name,
2360     manakai_element_type_match): New method.
2361    
2362     2007-05-03 Wakaba <wakaba@suika.fam.cx>
2363    
2364     * HTML.pm.src: Replace decimal and hexadecimal numeric
2365     entities in C1 range using Windows-1252 mapping. Bare LF
2366     did not count as new line for error reporting.
2367    
2368 wakaba 1.2 2007-05-02 Wakaba <wakaba@suika.fam.cx>
2369    
2370     * NanoDOM.pm (DOMImplementation): New class.
2371     (append_child): Weaken the |parent_node| reference.
2372     (create_element_ns, Element new): Set the |owner_document|
2373     reference.
2374     (implementation): New attribute.
2375     (owner_document, local_name, namespace_uri): New attributes.
2376    
2377     * HTML.pm.src (parse_string): Line and column numbers
2378     are now provided to error handler.
2379     (!!!parse-error): Short descriptions are added.
2380     (_construct_tree): Split into three methods; support
2381     for innerHTML mode.
2382     (set_inner_html): New method.
2383    
2384 wakaba 1.1 2007-05-01 Wakaba <wakaba@suika.fam.cx>
2385    
2386     * NanoDOM.html: Documentation is added.
2387    
2388     * HTML.pod, ContentType.html: Documentation is revised.
2389    
2390     * .cvsignore: Pod2html temporary files are added.
2391    
2392     * Makefile: Make |NanoDOM.html|.
2393    
2394     2007-05-01 Wakaba <wakaba@suika.fam.cx>
2395    
2396     * HTML.pm.src (parse_string): New method.
2397     (get_inner_html): Renamed from |inner_html|.
2398    
2399     * Makefile: A rule for |HTML.html| is added.
2400    
2401     * HTML.pod: New documentation.
2402    
2403     2007-05-01 Wakaba <wakaba@suika.fam.cx>
2404    
2405     * NanoDOM.pm (last_child, previous_sibling): New attributes.
2406     (clone_node): Attribute nodes were not completely copied.
2407    
2408     * HTML.pm.src: Many bugs are fixed.
2409    
2410     2007-04-30 Wakaba <wakaba@suika.fam.cx>
2411    
2412     * HTML.pm.src: Some typos are fixed.
2413    
2414     2007-04-30 Wakaba <wakaba@suika.fam.cx>
2415    
2416     * mkhtmlparser.pl, Makefile: References to the |HTML-consume-entity.src|
2417     are removed.
2418    
2419     * HTML.pm.src: Tokenizer's handling on named entities are rewritten.
2420    
2421     * HTML-consume-entity.src: Removed.
2422    
2423     2007-04-30 Wakaba <wakaba@suika.fam.cx>
2424    
2425     * HTML.pm.src: Tokenizer's handling on hexadecimal
2426     numeric entities are rewritten.
2427    
2428     2007-04-30 Wakaba <wakaba@suika.fam.cx>
2429    
2430     * HTML.pm.src: Some tokenizer bugs are fixed.
2431    
2432     2007-04-30 Wakaba <wakaba@suika.fam.cx>
2433    
2434     * HTML.pm.src: The tree construction stage is implemented.
2435    
2436     * mkhtmlparser.pl: New macros are added.
2437    
2438     2007-04-28 Wakaba <wakaba@suika.fam.cx>
2439    
2440     * ContentType.pm: A note on bug in the specification
2441     is removed since it's been now fixed.
2442    
2443     * .cvsignore: New file.
2444    
2445     2007-04-28 Wakaba <wakaba@suika.fam.cx>
2446    
2447     * HTML.pm.src, HTML-consume-entity.src: New files.
2448    
2449     * Makefile (HTML.pm): New rule.
2450    
2451     * mkhtmlparser.pl: New script.
2452    
2453     2007-04-25 Wakaba <wakaba@suika.fam.cx>
2454    
2455     * Makefile: New file.
2456    
2457     2007-04-24 Wakaba <wakaba@suika.fam.cx>
2458    
2459     * ContentType.pm: An error in pod is fixed.
2460    
2461     2007-04-24 Wakaba <wakaba@suika.fam.cx>
2462    
2463     * ContentType.pm: Documentation is added.
2464    
2465     2007-04-24 Wakaba <wakaba@suika.fam.cx>
2466    
2467     * ContentType.pm: New Perl module.
2468    
2469     * ChangeLog: New file.
2470    
2471    

admin@suikawiki.org
ViewVC Help
Powered by ViewVC 1.1.24