/[suikacvs]/markup/html/whatpm/Whatpm/ChangeLog
Suika

Contents of /markup/html/whatpm/Whatpm/ChangeLog

Parent Directory Parent Directory | Revision Log Revision Log


Revision 1.380 - (show annotations) (download)
Sun Aug 16 03:41:52 2009 UTC (15 years, 2 months ago) by wakaba
Branch: MAIN
Changes since 1.379: +6 -0 lines
++ whatpm/t/ChangeLog	16 Aug 2009 03:41:40 -0000
2009-08-16  Wakaba  <wakaba@suika.fam.cx>

	* tree-test-form.dat: Added test cases for <select> in "in select"
	parsing modes.

++ whatpm/Whatpm/ChangeLog	16 Aug 2009 03:41:19 -0000
2009-08-16  Wakaba  <wakaba@suika.fam.cx>

	* HTML.pm.src (_tree_construction_main): Improved error reporting
	on "select" start tag in "in select" and "in select in table"
	parsing modes (c.f. HTML5 revision 3604).

1 2009-08-16 Wakaba <wakaba@suika.fam.cx>
2
3 * HTML.pm.src (_tree_construction_main): Improved error reporting
4 on "select" start tag in "in select" and "in select in table"
5 parsing modes (c.f. HTML5 revision 3604).
6
7 2009-07-26 Wakaba <wakaba@suika.fam.cx>
8
9 * mkhtmlclass.pl: Made xmlns:xlink with wrong namespace URL
10 invalid (HTML5 revision 2701).
11
12 2009-07-26 Wakaba <wakaba@suika.fam.cx>
13
14 * HTML.pm.src: </rt>, </rp>, </option>, </optgroup> become
15 optional before </body> (HTML5 revision 2806).
16
17 2009-07-25 Wakaba <wakaba@suika.fam.cx>
18
19 * HTML.pm.src: Marked where SVG |script| execution should be
20 implemented (HTML5 revision 2904).
21
22 2009-07-25 Wakaba <wakaba@suika.fam.cx>
23
24 * HTML.pm.src (set_inner_html): The fragment parsing algorithm
25 inherits the document quirksness flag of the main document (HTML5
26 revision 2935).
27
28 2009-07-05 Wakaba <wakaba@suika.fam.cx>
29
30 * HTML.pm.src (_tree_construction_main): Put |table| in |p| in
31 quirks mode (The Hixie Quirk, HTML5 revision 3132).
32
33 2009-07-03 Wakaba <wakaba@suika.fam.cx>
34
35 * HTML.pm.src: <kegten> breaks out of <select> (HTML5 revision
36 3163).
37
38 2009-07-03 Wakaba <wakaba@suika.fam.cx>
39
40 * HTML.pm.src: Don't change encoding to UTF-8 if the input
41 encoding is UTF-16 (HTML5 revision 3204).
42
43 2009-07-03 Wakaba <wakaba@suika.fam.cx>
44
45 * HTML.pm.src (parse_byte_stream): Don't change to another
46 encoding if the input is UTF-16 (HTML5 revision 3205).
47
48 2009-06-28 Wakaba <wakaba@suika.fam.cx>
49
50 * HTML.pm.src: |isindex| should no longer generate |p| element
51 node (HTML5 revision 3236).
52
53 2008-12-12 Wakaba <wakaba@suika.fam.cx>
54
55 * ContentChecker.pm: Introduced new flag |has_autofocus|. Added
56 support for new |id_type| value of |any|, which means any kind of
57 element is allowed as a referenced element.
58
59 2008-12-12 Wakaba <wakaba@suika.fam.cx>
60
61 * ContentChecker.pm: Introduced |check_attrs2| method for the ease
62 of defining codes for checking required attribute and as such.
63
64 2008-12-12 Wakaba <wakaba@suika.fam.cx>
65
66 * IMTChecker.pm: Added more definitions for subtypes.
67
68 2008-12-11 Wakaba <wakaba@suika.fam.cx>
69
70 * URIChecker.pm: Some of |pos_end| values were wrong.
71
72 2008-12-06 Wakaba <wakaba@suika.fam.cx>
73
74 * ContentChecker.pm (check_element): Added support for "no
75 referenced datalist" error.
76
77 2008-12-06 Wakaba <wakaba@suika.fam.cx>
78
79 * URIChecker.pm: Bug fix: It did not work unless
80 Message::DOM::DOMImplementation has been |require|d.
81
82 2008-12-06 Wakaba <wakaba@suika.fam.cx>
83
84 * NanoDOM.pm (document_uri): New attribute.
85
86 * ContentChecker.pm: Don't use methods not implemented by NanoDOM.
87
88 2008-11-07 Wakaba <wakaba@suika.fam.cx>
89
90 * NanoDOM.pm (text_content): Don't create a Text node if the new
91 value is empty.
92
93 2008-11-06 Wakaba <wakaba@suika.fam.cx>
94
95 * SWML/: New directory.
96
97 2008-10-20 Wakaba <wakaba@suika.fam.cx>
98
99 * NanoDOM.pm (specified, all_declarations_processed,
100 manakai_attribute_type): New attributes.
101
102 2008-10-19 Wakaba <wakaba@suika.fam.cx>
103
104 * NanoDOM.pm (Entity->new): Initialize ->child_nodes as an empty
105 array.
106
107 2008-10-19 Wakaba <wakaba@suika.fam.cx>
108
109 * NanoDOM.pm (notation_name): New attribute.
110
111 2008-10-18 Wakaba <wakaba@suika.fam.cx>
112
113 * NanoDOM.pm (public_id, system_id): New attributes.a
114
115 2008-10-18 Wakaba <wakaba@suika.fam.cx>
116
117 * NanoDOM.pm (text_content): Moved to Node from Element. Setter
118 implemented.
119 (allowed_tokens, default_type, declared_type): Implemented.
120
121 2008-10-17 Wakaba <wakaba@suika.fam.cx>
122
123 * NanoDOM.pm (node_name): New attribute.
124 (ELEMENT_TYPE_DEFINITION_NODE, ATTRIBUTE_DEFINITION_NODE): New
125 constants.
126 (create_element_type_definition_node, create_attribute_definition,
127 create_notation, create_general_entity,
128 get_element_type_definition_node,
129 set_element_type_definition_node, get_general_entity_node,
130 set_general_entity_node, get_notation_node, set_notation_node,
131 get_attribute_definition_node, set_attribute_definition_node): New
132 methods.
133 (element_types, entities, notations, attribute_definitions): New
134 attributes.
135 (DocumentType): Support for child nodes, entities, notations, and
136 element types.
137 (Entity, Notation, ElementTypeDefinition, AttributeDefinition):
138 New classes.
139
140 * Dumper.pm: Support for general entities, notations, element type
141 definitions, and attribute definitions.
142
143 2008-10-15 Wakaba <wakaba@suika.fam.cx>
144
145 * NanoDOM.pm (create_processing_instruction): New method.
146 (xml_version, xml_encoding, xml_standalone): New attributes.
147 (ProcessingInstruction): New class.
148
149 2008-10-14 Wakaba <wakaba@suika.fam.cx>
150
151 * HTML.pm.src: Handling of end tags in the foreign content
152 insertion mode was partially wrong, because of wrong bit
153 operations.
154
155 2008-10-14 Wakaba <wakaba@suika.fam.cx>
156
157 * NanoDOM.pm (dom_config): New attribute (do nothing), for
158 Whatpm::XML::Parser support.
159
160 2008-10-14 Wakaba <wakaba@suika.fam.cx>
161
162 * Makefile: New rule to make HTML/Tokenizer.pm is added.
163
164 * HTML.pm.src: Tokenizer part moved to another file.
165
166 2008-10-13 Wakaba <wakaba@suika.fam.cx>
167
168 * HTML.pm.src: Merge |DT_EL| and |DD_EL| as |DTDD_EL|.
169
170 2008-10-13 Wakaba <wakaba@suika.fam.cx>
171
172 * HTML.pm.src: Element category constants redefined.
173
174 2008-10-13 Wakaba <wakaba@suika.fam.cx>
175
176 * HTML.pm.src: Steps for CDATA/RCDATA elements in tree
177 construction stage synced with the spec (HTML5 revisions 2139 and
178 2302).
179
180 2008-10-07 Wakaba <wakaba@suika.fam.cx>
181
182 * ContentChecker.pm: New error level "html5_fact" added, which
183 should be tentatively used until all of requirements are properly
184 specced as RFC 2119 "MUST" in HTML5.
185
186 2008-10-05 Wakaba <wakaba@suika.fam.cx>
187
188 * ContentChecker.pod: Note on internal flags is added.
189
190 2008-10-05 Wakaba <wakaba@suika.fam.cx>
191
192 * HTML.pm.src: An AAA bug fixed.
193
194 2008-10-04 Wakaba <wakaba@suika.fam.cx>
195
196 * HTML.pm.src: If another node is inserted by the parser, don't
197 reuse existing Text node to append a character (HTML5 revision
198 2124).
199
200 2008-10-04 Wakaba <wakaba@suika.fam.cx>
201
202 * HTML.pm.src: Support for <option> and <optgroup> in body (HTML5
203 revisions 1731 and 2128).
204
205 2008-10-04 Wakaba <wakaba@suika.fam.cx>
206
207 * HTML.pm.src: Make <foreignObject> scoping (HTML5 revision 1837).
208 Support for end tags of camelCase SVG elements were broken. A
209 wrong error type text fixed.
210
211 2008-10-04 Wakaba <wakaba@suika.fam.cx>
212
213 * HTML.pm.src: Drop redundant code (HTML5 revision 1731).
214
215 2008-10-04 Wakaba <wakaba@suika.fam.cx>
216
217 * HTML.pm.src: Support for new definition of |param| and |source|
218 start tag parsing (HTML5 revision 1731).
219
220 2008-10-04 Wakaba <wakaba@suika.fam.cx>
221
222 * HTML.pm.src: <p> steps reimplemented (HTML5 revision 1731).
223
224 2008-10-04 Wakaba <wakaba@suika.fam.cx>
225
226 * HTML.pm.src: <li>, <dt>, and <dd> steps reimplemented (HTML5
227 revisions 1731 and 1831).
228
229 2008-10-04 Wakaba <wakaba@suika.fam.cx>
230
231 * HTML.pm.src: Support for new flow (but not phrasing) elements (HTML5
232 revisions 1731 and 1778). Support for the </sarcasm> end tag (HTML5
233 revision 1731).
234
235 2008-10-04 Wakaba <wakaba@suika.fam.cx>
236
237 * HTML.pm.src: Support for |command| and |eventsource| elements (HTML5
238 revision 1731). End tags of |option| and |optgroup| elements are
239 now optional (HTML5 revision 1731).
240
241 2008-10-04 Wakaba <wakaba@suika.fam.cx>
242
243 * HTML.pm.src: New "special" elements added to the list (HTML5
244 revision 1778). "strile" -> "strike".
245
246 2008-10-02 Wakaba <wakaba@suika.fam.cx>
247
248 * ContentType.pm (get_sniffed_type): Support for the "better"
249 content sniffing (HTML5 revision 1927). In a case the official
250 type was not returned when the method is invoked in the list
251 context.
252
253 2008-09-22 Wakaba <wakaba@suika.fam.cx>
254
255 * HTML.pm.src: Character references for non-space C0 characters,
256 including U+000B VT, DEL character, noncharacter code points, are
257 now converted to the U+FFFD character (cf. HTML5 revision 2138).
258
259 2008-09-21 Wakaba <wakaba@suika.fam.cx>
260
261 * ContentChecker.pm: |form=""| check support added.
262
263 2008-09-21 Wakaba <wakaba@suika.fam.cx>
264
265 * ContentChecker.pm: |contextmenu| validness is now checked using
266 |id| and |id_type| properties, and |menu| property is removed.
267
268 2008-09-21 Wakaba <wakaba@suika.fam.cx>
269
270 * ContentChecker.pm: Prepare for |form| |name| attribute's
271 duplication checking.
272
273 2008-09-21 Wakaba <wakaba@suika.fam.cx>
274
275 * HTML.pm.src (parse_byte_stream): Support (or non-support) for
276 unsupported charset="" parameter value (HTML5 revision 2131).
277
278 2008-09-20 Wakaba <wakaba@suika.fam.cx>
279
280 * HTML.pm.src: Reminding places where U+000B is allowed as a space
281 character is fixed (cf. HTML5 revision 1738).
282
283 * ContentChecker.pm, HTMLTable.pm: U+000B is no longer part of
284 space characters (HTML5 revision 1738).
285
286 2008-09-20 Wakaba <wakaba@suika.fam.cx>
287
288 * HTML.pm.src: The "anything else" case for the "after after body"
289 insertion mode was not updated to swtich to the "in body"
290 insertion mode. U+000B is no longer a space character for the
291 purpose of tree construction phase (HTML5 revision 1738).
292
293 2008-09-20 Wakaba <wakaba@suika.fam.cx>
294
295 * HTML.pm.src: U+000B is no longer a space character (HTML5
296 revision 1738).
297
298 2008-09-20 Wakaba <wakaba@suika.fam.cx>
299
300 * ContentType.pm: 0x0B is no longer a space character (HTML5
301 revision 1738).
302
303 * HTML.pm.src: U+000B is no longer a space character for the
304 algorithm for extracting an encoding from a Content-Type (HTML5
305 revision 1738).
306
307 2008-09-20 Wakaba <wakaba@suika.fam.cx>
308
309 * ContentChecker.pm ($IsInHTMLInteractiveContent): New.
310
311 2008-09-18 Wakaba <wakaba@suika.fam.cx>
312
313 * LangTag.pm: Add checks for remaining requirements from RFC 4646.
314
315 * mklangreg.pl: Sort 'Prefix' values by their length, to ease
316 matching.
317
318 2008-09-18 Wakaba <wakaba@suika.fam.cx>
319
320 * LangTag.pm: Warn for private use language subtags. Error level
321 typos fixed. Support for Suppress-Script field.
322
323 * mklangreg.pl: Support for dumping of nested structure.
324
325 2008-09-18 Wakaba <wakaba@suika.fam.cx>
326
327 * LangTag.pm (check_rfc4646_langtag): Check if a tag is in the
328 recommended case as per RFC 4646.
329
330 2008-09-18 Wakaba <wakaba@suika.fam.cx>
331
332 * LangTag.pm (check_rfc4646_langtag): New method.
333
334 2008-09-18 Wakaba <wakaba@suika.fam.cx>
335
336 * mklangreg.pl: New script.
337
338 * Makefile: Updated for creation of the module for language subtag
339 registry.
340
341 2008-09-16 Wakaba <wakaba@suika.fam.cx>
342
343 * Makefile: WebIDL.html added.
344
345 * WebIDL.pod: New documentation.
346
347 2008-09-16 Wakaba <wakaba@suika.fam.cx>
348
349 * WebIDL.pm: Checker's error types are redefined.
350
351 2008-09-16 Wakaba <wakaba@suika.fam.cx>
352
353 * WebIDL.pm: Parser's error types are redefined. Some forward
354 compatible parsing bugs are fixed. Some unreachable codes are
355 commented out.
356
357 2008-09-16 Wakaba <wakaba@suika.fam.cx>
358
359 * WebIDL.pm: Support for the reminding extended attributes are
360 added. It does not satisfy the definition that a forward
361 interface declaration has an extended attribute. It seems that
362 unless explicitly allowed multiple extended attributes with the
363 same name is not allowed, though it is not explicitly mentioned in
364 the spec.
365
366 2008-09-16 Wakaba <wakaba@suika.fam.cx>
367
368 * WebIDL.pm: Unescapes extended attribute names and extended
369 attribute identifiers. Preserve whether an extended attribute has
370 an argument list of not. Support for extended attributes:
371 Constructor, ExceptionConsts, IndexGetter, IndexSetter,
372 NameGetter, NameSetter, and Null.
373 (has_argument_list): New attribute.
374 (idl_text): Stringifies argument lists, if any, even if it is
375 empty.
376
377 2008-09-15 Wakaba <wakaba@suika.fam.cx>
378
379 * HTML.pm.src: New state |PCDATA_STATE|. Use an empty string for
380 |{s_kwd}| in DATA_STATE as default.
381
382 2008-09-15 Wakaba <wakaba@suika.fam.cx>
383
384 * HTML.pm.src, mkhtmlparser.pl: Replace |{prev_char}|
385 by |{s_kwd}| in DATA_STATE.
386
387 2008-09-15 Wakaba <wakaba@suika.fam.cx>
388
389 * HTML.pm.src: Shorten keys.
390
391 2008-09-15 Wakaba <wakaba@suika.fam.cx>
392
393 * HTML.pm.src: Remove checking for control character, surrogate
394 pair, or noncharacter code points and non-Unicode code
395 points (they should be handled by Whatpm::Charset::UnicodeChecker).
396 (parse_char_stream): Support for the |$get_wrapper| argument and
397 character stream error handlers.
398
399 2008-09-15 Wakaba <wakaba@suika.fam.cx>
400
401 * ContentChecker.pm: Don't call |loda_ns_module|
402 for null-namespace elements/attributes.
403
404 * HTML.pm.src: Fact out $disallowed_control_chars
405 as a hash.
406
407 2008-09-14 Wakaba <wakaba@suika.fam.cx>
408
409 * HTML.pm.src: Regexp typo fixed. |{prev_char}|
410 and |{next_char}| initializations are moved to initialization
411 method. |{read_until}| now supports buffering. Sync |set_inner_html|
412 with |parse_char_stream|.
413
414 2008-09-14 Wakaba <wakaba@suika.fam.cx>
415
416 * HTML.pm.src (parse_char_stream): Make |set_next_char|
417 invoke |manakai_read_until|, not only |read|, where
418 possible, to decrease the number of |read| method calls.
419
420 * mkhtmlparser.pl: Related changes to the aforementioned
421 modification.
422
423 2008-09-14 Wakaba <wakaba@suika.fam.cx>
424
425 * HTML.pm.src: Use |read| instead of |getc|. |set_inner_html|
426 would report character error from now.
427
428 2008-09-14 Wakaba <wakaba@suika.fam.cx>
429
430 * HTML.pm.src: White-space-leaded non-white-space character
431 tokens in "before head insertion mode" was not
432 correctly handled.
433 (set_inner_html): Reimplemented using CharString decodehandle
434 class. Support for $get_wrapper argument. Support
435 for |{read_until}| feature.
436
437 2008-09-14 Wakaba <wakaba@suika.fam.cx>
438
439 * HTML.pm.src: Make a "bare ero" error for unknown
440 entities point the "&" character.
441
442 2008-09-14 Wakaba <wakaba@suika.fam.cx>
443
444 * HTML.pm.src: It turns out that U+FFFD don't have to
445 be added to the list of excluded characters.
446
447 2008-09-14 Wakaba <wakaba@suika.fam.cx>
448
449 * HTML.pm.src ($char_onerror): Have character decoder's |line|
450 and |column| a higher priority than the one set by the
451 tokenizer's input handler.
452 ($self->{read_until}): Exclude U+FFFD (but this might
453 not be necessary, since now we do line/column fixup in
454 the character decode handle).
455
456 2008-09-14 Wakaba <wakaba@suika.fam.cx>
457
458 * HTML.pm.src: Use |{read_until}| where possible.
459
460 2008-09-14 Wakaba <wakaba@suika.fam.cx>
461
462 * HTML.pm.src: Change |{getc_until}| to |{read_until}|
463 and |manakai_getc_until| to |manakai_read_until| to
464 reduce the number of string copies.
465
466 2008-09-14 Wakaba <wakaba@suika.fam.cx>
467
468 * HTML.pm.src (parse_char_string): Use newly created
469 |Whatpm::Charset::DecodeHandle::CharString| instead of Perl's
470 standard feature to |open| a string as a filehandle,
471 since Perl's string filehandle seems not supporting |ungetc|
472 method correctly.
473 (parse_char_stream): Define |{getc_until}| method.
474 (DATA_STATE): Experimental support for |getc_until| feature.
475
476 2008-09-13 Wakaba <wakaba@suika.fam.cx>
477
478 * HTML.pm.src: Check points added to newly added branches.
479
480 2008-09-13 Wakaba <wakaba@suika.fam.cx>
481
482 * HTML.pm.src: Remove |{char}|, which is no longer used.
483 Remove |{entity_in_attr}| and |{last_attribute_value_state}|
484 and replaced by |{prev_state}|.
485
486 * mkhtmlparser.pl: Remove |{char}| feature.
487 Remove |!!!back-next-input-character;| macro.
488
489 2008-09-13 Wakaba <wakaba@suika.fam.cx>
490
491 * HTML.pm.src: Finally we get rid of all the inner loops. Remove
492 entity related tokenizer states in favor of new states
493 implementing the consume character reference algorithm.
494
495 2008-09-13 Wakaba <wakaba@suika.fam.cx>
496
497 * HTML.pm.src: "Consume a character reference" algorithm is
498 now implemented as a tokenizer's state, rather than
499 a method, with minimum changes (more changes will
500 be made, in due course). "Bogus comment state"'s inner
501 loop gets removed.
502
503 2008-09-13 Wakaba <wakaba@suika.fam.cx>
504
505 * HTML.pm.src: Make |PUBLIC| and |SYSTEM| keyword tokenizing
506 into their own tokenizer states.
507
508 2008-09-13 Wakaba <wakaba@suika.fam.cx>
509
510 * HTML.pm.src: |CDATA_SECTION_STATE| (formally |CDATA_BLOCK_STATE|
511 is split into three states.
512
513 2008-09-13 Wakaba <wakaba@suika.fam.cx>
514
515 * HTML.pm.src: |CLOSE_TAG_OPEN_STATE| is broken into
516 itself and new |CDATA_PCDATA_CLOSE_TAG_STATE| so that
517 no longer does the tokenizer have to push back next input
518 characters in those states.
519
520 2008-09-13 Wakaba <wakaba@suika.fam.cx>
521
522 * HTML.pm.src: |MARKUP_DECLARATION_OPEN_STATE| broken
523 into four states so that no longer does the tokenizer have to push
524 back next input characters in that state.
525
526 2008-09-11 Wakaba <wakaba@suika.fam.cx>
527
528 * HTML.pm.src: Methods now accept additional parameter, $get_wrapper,
529 which can be used to insert some wrapper between the character
530 stream handle and the tokenizer. (It is currently not supported
531 for |set_inner_html| for |Element|s).
532
533 2008-09-10 Wakaba <wakaba@suika.fam.cx>
534
535 * HTML.pm.src: Ignore punctuations in charset names.
536
537 2008-09-10 Wakaba <wakaba@suika.fam.cx>
538
539 * ContentChecker.pm: Support for charset-layer error levels.
540
541 * HTML.pm.src: Don't specify |text| argument for the
542 |chardecode:fallback| error, since it is not the encoding
543 being used alternatively.
544
545 2008-09-06 Wakaba <wakaba@suika.fam.cx>
546
547 * HTML.pm.src: Support for |XSLT-compat| (HTML5 revision 2141).
548
549 2008-08-31 Wakaba <wakaba@suika.fam.cx>
550
551 * CacheManifest.pm: Support for extensibility (HTML5 revision 2051).
552
553 2008-08-31 Wakaba <wakaba@suika.fam.cx>
554
555 * HTML.pm.src: Bug fix and sync with the spec with regard
556 to after after frameset insertion mode processing (HTML5
557 revision 1909). Note that the implementation was wrong
558 per the old spec before the r1909 changes.
559
560 2008-08-30 Wakaba <wakaba@suika.fam.cx>
561
562 * HTMLTable.pm: scope=auto algorithm fix synced with the
563 spec (HTML5 revision 2093).
564 ($process_row): Algorithm step numbers synced with the
565 spec (HTML5 revision 2092).
566
567 2008-08-30 Wakaba <wakaba@suika.fam.cx>
568
569 * HTMLTable.pm: Zs is not what we want; we want White_Space! (HTML5
570 revision 2094).
571
572 2008-08-30 Wakaba <wakaba@suika.fam.cx>
573
574 * ContentType.pm: Support for image/svg+xml (HTML5 revision 2096).
575
576 2008-08-30 Wakaba <wakaba@suika.fam.cx>
577
578 * HTML.pm.src: '"' and "'" at the end of attribute
579 name (after another attribute) now raise parse error (HTML5
580 revision 2123). Empty unquoted attribute values are no
581 longer allowed (HTML5 revision 2122).
582
583 2008-08-30 Wakaba <wakaba@suika.fam.cx>
584
585 * mkhtmlparser.pl: Support for MathML |definitionURL| attribute (HTML5
586 revision 2130).
587
588 2008-08-30 Wakaba <wakaba@suika.fam.cx>
589
590 * ContentChecker.pm: |xml:lang| attribute value must be same
591 as |lang| attribute value for HTML elements (HTML5 revision 2062
592 and so on).
593
594 2008-08-30 Wakaba <wakaba@suika.fam.cx>
595
596 * ContentChecker.pm: Error level definition for |xml_id_error|
597 was missing.
598
599 * URIChecker.pm: The end of the URL should be marked as the
600 error location for an empty path error. The position
601 between the userinfo and the port components should be
602 marked as the error location for an empty host error.
603
604 2008-08-30 Wakaba <wakaba@suika.fam.cx>
605
606 * URIChecker.pm: Set parameters representing where in the
607 value the error occurs for errors. Report unknown
608 address format error in warning level, since address
609 formats are rarely added. Path segments starting with "/.."
610 were misinterpreted as a dot-segment.
611
612 2008-08-30 Wakaba <wakaba@suika.fam.cx>
613
614 * URIChecker.pm (check_iri_reference): Requires
615 |Message::DOM::DOMImplementation|.
616
617 2008-08-29 Wakaba <wakaba@suika.fam.cx>
618
619 * IMTChecker.pm: Updated for the new error reporting architecture.
620
621 * ContentChecker.pm: Error levels for IMTs are added.
622
623 2008-08-17 Wakaba <wakaba@suika.fam.cx>
624
625 * H2H.pm (_shift_token): Support for unquoted HTML attribute
626 values.
627
628 2008-08-16 Wakaba <wakaba@suika.fam.cx>
629
630 * CacheManifest.pm: Support for new style of error
631 reports.
632
633 * HTML.pm.src: Set line=1, column=1 to the document node.
634
635 2008-08-16 Wakaba <wakaba@suika.fam.cx>
636
637 * ContentChecker.pm, RDFXML.pm: Pass {level} object to language tag
638 and URL checkers. Support for more error levels for bogus
639 langauge tag and URL "standards".
640
641 * LangTag.pm, URIChecker.pm: Support for new style error
642 level reporting.
643
644 2008-08-15 Wakaba <wakaba@suika.fam.cx>
645
646 * ContentChecker.pm: Support for RDF/XML error levels.
647
648 * HTMLTable.pm, RDFXML.pm: Support for new style of error level
649 specifying. Error types are revised.
650
651 2008-08-15 Wakaba <wakaba@suika.fam.cx>
652
653 * ContentChecker.pm: All error reporting method calls are
654 renewed.
655
656 2008-08-15 Wakaba <wakaba@suika.fam.cx>
657
658 * HTML.pm.src: All error type names and "text" parameters
659 are revised. Use new style for "level" specification.
660
661 * mkhtmlparser.pl: Use new style for "level" specification.
662
663 2008-08-03 Wakaba <wakaba@suika.fam.cx>
664
665 * WebIDL.pm (parse_char_string): Simplified error
666 reporting process for broken ignored valuetype definition.
667 (Valuetype idl_text): Support for special "DOMString" name.
668
669 2008-08-03 Wakaba <wakaba@suika.fam.cx>
670
671 * WebIDL.pm ($get_scoped_name): Append "::::" if the last
672 terminal of the ScopedName is "DOMString", such that whether
673 the last part of the scoped name is "DOMString" or "_DOMString"
674 later. It is necessary to determine whether a |typedef|
675 definition should be ignored or not.
676 (parse_char_string): Unescape the identifier of
677 exception members.
678 ($resolve): Return undef for builtin types and sequence<T>
679 types (we might not have to do this, however...).
680 (check): Support checking for Exceptions, Valuetypes,
681 and Typedefs.
682 ($serialize_type): Support for "DOMString::::" syntax.
683 (Typedef idl_text): Output Type as "DOMString" if it
684 is really "DOMString" (i.e. its internal representation
685 is "::DOMString::").
686
687 2008-08-03 Wakaba <wakaba@suika.fam.cx>
688
689 * WebIDL.pm ($resolve): New code, based on resolve code
690 for constant types in the |check| method.
691 (check): Support for checking of attributes, operations, and
692 arguments.
693 (Attribute/Operation idl_text): Exception names in getraises,
694 setraises, and raises clauses is serizlied by |$serialize_type|
695 code.
696
697 2008-08-02 Wakaba <wakaba@suika.fam.cx>
698
699 * WebIDL.pm ($integer): Order of selections are changed to match
700 hexadecimal numbers (the original pattern, taken from the spec,
701 was not work for hexadecimal numbers, because the "0" prefix
702 matches to the [0-7]* part (as an empty string) and therefore
703 it does not match with remaining "x..." part of a "0x..." integer
704 literal.
705 ($get_type): It now returns a string, not an array reference,
706 for regular types and |sequence| types (i.e. it in any case
707 returns a string).
708 ($get_next_token): The second item in the array that represents
709 a integer or float token is now a Perl number value, not the
710 original string representation of the number.
711 (check): Support for const value consistency checking.
712 No extended attribute is defined for constants.
713 (Node subclasses): Use simple strings rather than array references
714 for default data type values.
715 ($serialize_type): Type values are now simple strings.
716 (value): If the new attribute value is a false value, then
717 a FALSE value is set to the attribute.
718
719 2008-08-02 Wakaba <wakaba@suika.fam.cx>
720
721 * WebIDL.pm ($get_scoped_name): Now scoped names are stored
722 in its stringified format ("scoped name" as defined in the
723 spec). Note that future version of this module should not use
724 array references for type values and the |type_text| attribute
725 should be made obsolete.
726 (parse_char_string): Unescape attribute names.
727 (check): Support for checking of whether inherited interfaces
728 are actually defined or not. Support for checking of whether
729 interface member identifiers are duplicated or not.
730 ($serialize_type): Scoped names are returned as is. A future
731 version of this code should escape identifiers other than "DOMString",
732 otherwise the idl_text would be non-conforming.
733
734 2008-08-02 Wakaba <wakaba@suika.fam.cx>
735
736 * WebIDL.pm (parse_char_string): Set line/column numbers
737 to generated nodes. Unescape identifiers. Extended attributes
738 for Definition's were ignored.
739 (append_child): Set |parent_node| attribute.
740 (parent_node): New attribute.
741 (check): Support interface/exception members. Support
742 extended attributes. Support definition identifier uniqueness
743 constraint.
744 (qualified_name): New attribute.
745 (Interface/Exception idl_text): Extended attributes were
746 not prepended to the returned text.
747
748 2008-08-02 Wakaba <wakaba@suika.fam.cx>
749
750 * WebIDL.pm (parse_char_string): Set line/column numbers
751 to interface object experimentally. s/shift/pop/g, shift
752 would make things wrong. Support for interface forward
753 declarations was missing. Broken interface declarations
754 with no block were not ignored entirely.
755 (Whatpm::WebIDL::Node): New abstract class. This class
756 makes things easier.
757 (child_nodes): New attribute. Unlike DOM's attribute with
758 same name, this attribute returns a dead list of nodes for
759 simplicity.
760 (get_user_data, set_user_data): New methods.
761 (Module idl_text): A SPACE character should be inserted
762 before the |{| character.
763 (Interface idl_text): Support for interface forward declarations.
764 (is_forward_declaration): New attribute.
765
766 2008-07-19 Wakaba <wakaba@suika.fam.cx>
767
768 * WebIDL.pm (type_text): Better serializer.
769
770 2008-07-19 Wakaba <wakaba@suika.fam.cx>
771
772 * WebIDL.pm: Revise forward-compatible parsing so that
773 it now can handle broken extended attributes and as such.
774
775 2008-07-19 Wakaba <wakaba@suika.fam.cx>
776
777 * WebIDL.pm: Real support for extended attributes.
778 Support for extended attributes with arguments.
779
780 2008-07-19 Wakaba <wakaba@suika.fam.cx>
781
782 * WebIDL.pm: Support for |exception| syntax.
783 (Interface->idl_text): Tentative support for inheritances.
784
785 2008-07-19 Wakaba <wakaba@suika.fam.cx>
786
787 * WebIDL.pm: Hierarchical scoped name support was broken.
788 Support for raises, setraises, and getraises syntaxes.
789
790 2008-07-18 Wakaba <wakaba@suika.fam.cx>
791
792 * WebIDL.pm: Support for |idl_text| attribute, version 1 (no
793 proper support for types, extended attributes, and exceptions yet).
794 WebIDL parser, version 1 (no support for exceptions yet,
795 no proper support for extended attributes yet).
796
797 2008-07-09 Wakaba <wakaba@suika.fam.cx>
798
799 * WebIDL.pm (parse_char_string): Support for basic attribute syntax.
800
801 2008-06-29 Wakaba <wakaba@suika.fam.cx>
802
803 * WebIDL.pm: Support for valuetype and const.
804
805 2008-06-29 Wakaba <wakaba@suika.fam.cx>
806
807 * WebIDL.pm: New module.
808
809 2008-06-15 Wakaba <wakaba@suika.fam.cx>
810
811 * Makefile (Entities.html): URI changed.
812
813 2008-06-08 Wakaba <wakaba@suika.fam.cx>
814
815 * HTML.pm.src: Support for ruby parsing (HTML5 revision 1704).
816
817 2008-06-01 Wakaba <wakaba@suika.fam.cx>
818
819 * HTML.pm.src (_get_next_token): A parse error was missing.
820
821 2008-06-01 Wakaba <wakaba@suika.fam.cx>
822
823 * mklinktypelist.pl: rel=contact is no longer part of the HTML5
824 spec (commented out). (HTML5 revision 1711).
825
826 2008-05-25 Wakaba <wakaba@suika.fam.cx>
827
828 * ContentType.pm: Drop support for UTF-32 (HTML5 revision 1701).
829
830 * HTML.pm.src: UTF-16BE and UTF-16LE should be considered
831 as UTF-16 (HTML5 revision 1701).
832
833 2008-05-25 Wakaba <wakaba@suika.fam.cx>
834
835 * HTML.pm.src: Support for <noframes> in <head> (HTML5 revision
836 1692).
837
838 2008-05-25 Wakaba <wakaba@suika.fam.cx>
839
840 * HTML.pm.src: The secondary insertion mode used when switching
841 to foreign content is the "in body" insertion mode (HTML5 revision
842 1696).
843
844 2008-05-25 Wakaba <wakaba@suika.fam.cx>
845
846 * HTML.pm.src: Don't raise parse error for <isindex/> (HTML5
847 revision 1697).
848
849 2008-05-24 Wakaba <wakaba@suika.fam.cx>
850
851 * HTML.pm.src: Support for end-of-file token in foreign content
852 insertion mode (HTML5 revision 1693). Update SVG camelCase
853 attribute list (HTML5 revision 1700). <textarea> closes
854 </select> (HTML5 revision 1699). More start tags close in
855 foreign content insertion mode (HTML5 revision 1698).
856
857 2008-05-24 Wakaba <wakaba@suika.fam.cx>
858
859 * HTML.pm.src: ";" is not part of charset name (HTML5 revision 1665).
860
861 2008-05-24 Wakaba <wakaba@suika.fam.cx>
862
863 * HTML.pm.src: More robust charset parameter detection (HTML5
864 revision 1674).
865
866 2008-05-24 Wakaba <wakaba@suika.fam.cx>
867
868 * ContentType.pm: Support for image/vnd.microsoft.icon (HTML5
869 revision 1676).
870
871 2008-05-24 Wakaba <wakaba@suika.fam.cx>
872
873 * HTML.pm.src: Ignore language part of public identifiers for
874 quriks mode detection (HTML5 revision 1679).
875
876 2008-05-24 Wakaba <wakaba@suika.fam.cx>
877
878 * HTML.pm.src: Reduce the number of errors in truncated doctypes (HTML5
879 revision 1685).
880
881 2008-05-24 Wakaba <wakaba@suika.fam.cx>
882
883 * HTML.pm.src: Support for EOF in new states for tags (HTML5
884 revision 1684).
885
886 2008-05-24 Wakaba <wakaba@suika.fam.cx>
887
888 * HTML.pm.src (_reset_insertion_mode): Make <td>.innerHTML
889 work (HTML5 revision 1690).
890
891 2008-05-24 Wakaba <wakaba@suika.fam.cx>
892
893 * HTML.pm.src (_tree_construction_main): Change handling of
894 end tags in head insertion modes (HTML5 revision 1686).
895 (parse_char_string): Bug fix for non-utf8 character string handlings.
896 (parse_char_stream): |ungetc| does not work well for this context.
897
898 2008-05-18 Wakaba <wakaba@suika.fam.cx>
899
900 * HTML.pm.src (parse_byte_string): Redefined to invoke
901 |parse_byte_stream|.
902 (parse_byte_stream): New method.
903
904 2008-05-18 Wakaba <wakaba@suika.fam.cx>
905
906 * HTML.pm.src (parse_byte_string): Fix the column number reported
907 by encoding layer error reporter.
908
909 2008-05-17 Wakaba <wakaba@suika.fam.cx>
910
911 * HTML.pm.src (parse_byte_string): Use streaming decoder
912 rather than converting the whole byte string and then parsing.
913 Propagate errors in character encoding layer.
914 (get_next_token): Precise error reporting for |bare stago| error.
915
916 2008-05-17 Wakaba <wakaba@suika.fam.cx>
917
918 * HTML.pm.src (parse_char_stream): New method.
919 (parse_char_string): This method is now defined as an invocation
920 of the |parse_char_stream| method.
921
922 2008-05-17 Wakaba <wakaba@suika.fam.cx>
923
924 * HTML.pm.src (parse_byte_string): Report various status
925 of the sniffing as info-level errors. Support for new
926 decoding framework in parser resestting.
927 (new): Various default error levels were not set.
928
929 2008-05-17 Wakaba <wakaba@suika.fam.cx>
930
931 * HTML.pm.src (parse_byte_string): HTML5 encoding siniffing
932 algorithm, except for the actual sniffing, is implemented
933 with new framework with Message::Charset::Info.
934
935 2008-05-16 Wakaba <wakaba@suika.fam.cx>
936
937 * CacheManifest.pm (_parse): Drop fragment identifiers from
938 URIs in fallback section (HTML5 revision 1596).
939
940 2008-05-10 Wakaba <wakaba@suika.fam.cx>
941
942 * Makefile (Entities.html): URI has changed.
943
944 2008-05-10 Wakaba <wakaba@suika.fam.cx>
945
946 * CacheManifest.pm: Don't replace U+0000 NULL (HTML5 revision
947 1553).
948
949 2008-05-06 Wakaba <wakaba@suika.fam.cx>
950
951 * ContentChecker.pm: Noted that those returned in |table| are
952 no longer table elements, but table objects returned
953 by Whatpm::HTMLTable.
954
955 * HTMLTable.pm (form_table): Return table element node
956 as |$table->{element}|.
957 (assign_header): Support for the |headers=""| attribute.
958
959 2008-05-06 Wakaba <wakaba@suika.fam.cx>
960
961 * HTMLTable.pm (assign_header): New function; first version
962 with no support for headers="".
963 (form_table): Include table width and height to the returned
964 table object for covenience. Indexing in column assignement was wrong.
965 Set whether a data cell is empty or not for convenience.
966
967 2008-05-05 Wakaba <wakaba@suika.fam.cx>
968
969 * HTMLTable.pm: Robuster caption support (HTML5 revision 1393).
970
971 2008-05-05 Wakaba <wakaba@suika.fam.cx>
972
973 * HTMLTable.pm: How table model errors are detected is
974 changed (HTML5 revision 1387).
975
976 2008-05-05 Wakaba <wakaba@suika.fam.cx>
977
978 * HTMLTable.pm: The algorithm now moves |tfoot| elements
979 to the end of the table (HTML5 revision 1380).
980
981 2008-05-05 Wakaba <wakaba@suika.fam.cx>
982
983 * HTMLTable.pm: The algorithm is now 0-based indexing, instead
984 of 1-based (HTML5 revision 1376).
985
986 2008-05-05 Wakaba <wakaba@suika.fam.cx>
987
988 * ContentType.pm: "Content-Type: text/plain; charset=UTF-8"
989 and "Content-Encoding" no longer prevent sniffing (HTML5
990 revision 1288).
991
992 2008-05-05 Wakaba <wakaba@suika.fam.cx>
993
994 * ContentType.pm: Skip BOMs n feed or HTML algorithm (HTML5 revision
995 1282).
996
997 2008-05-03 Wakaba <wakaba@suika.fam.cx>
998
999 * ContentChecker.pm: Support for global attributes.
1000 Status of XML specs are added.
1001
1002 2008-05-03 Wakaba <wakaba@suika.fam.cx>
1003
1004 * ContentChecker.pm (check_element): Support for |template=""|
1005 and |ref=""| attribute (referent element type checking).
1006
1007 2008-04-29 Wakaba <wakaba@suika.fam.cx>
1008
1009 * CacheManifest.pm (_parse): New same origin definition (HTML5 revision
1010 1500) is implemented (except for IDNA part and URI-scheme-specific
1011 knowledge). Line number counting was wrong for LF-only
1012 documents.
1013
1014 2008-04-13 Wakaba <wakaba@suika.fam.cx>
1015
1016 * HTML.pm.src: Raise an parse error for any disallowed
1017 character (HTML5 revision 1263).
1018
1019 2008-04-13 Wakaba <wakaba@suika.fam.cx>
1020
1021 * mkentitylist.pl: Support for new HTML5 entity table format (the
1022 definition for |AElig;| was missing).
1023
1024 2008-04-13 Wakaba <wakaba@suika.fam.cx>
1025
1026 * HTML.pm.src, mkhtmlparser.pl: Support for element/attribute
1027 name/namespace fixup (HTML5 revisions 1413, 1415, 1416, and 1417).
1028
1029 2008-04-13 Wakaba <wakaba@suika.fam.cx>
1030
1031 * HTML.pm.src: List of element names that close foreign content
1032 insertion mode is added (HTML5 revisions 1412 and 1418).
1033
1034 2008-04-13 Wakaba <wakaba@suika.fam.cx>
1035
1036 * HTML.pm.src: Support for |mglyph| and |malignmark|
1037 elements (HTML5 revision 1410).
1038
1039 2008-04-13 Wakaba <wakaba@suika.fam.cx>
1040
1041 * HTML.pm.src: Support for new long MathML entities (HTML5
1042 revision 1406).
1043
1044 2008-04-13 Wakaba <wakaba@suika.fam.cx>
1045
1046 * HTML.pm.src: CDATA section support for MathML and SVG
1047 elements (HTML5 revision 1404 and 1420).
1048
1049 2008-04-12 Wakaba <wakaba@suika.fam.cx>
1050
1051 * HTML.pm.src, mkhtmlparser.pl: Support for MathML and SVG
1052 elements (HTML5 revision 1404). Unused !!!macro definitions
1053 are removed.
1054
1055 2008-04-12 Wakaba <wakaba@suika.fam.cx>
1056
1057 * HTML.pm.src, mkhtmlparser.pl: The way permitted slash errors
1058 are raised is changed (HTML5 revision 1404).
1059
1060 2008-04-06 Wakaba <wakaba@suika.fam.cx>
1061
1062 * HTML.pm.src: Category sets that are no longer used are removed.
1063
1064 2008-04-06 Wakaba <wakaba@suika.fam.cx>
1065
1066 * HTML.pm.src: The ->[1] property of stack entries are now
1067 replaced by constants representing element category.
1068
1069 2008-04-06 Wakaba <wakaba@suika.fam.cx>
1070
1071 * HTML.pm.src: Don't use local name stored in stack (i.e. ->[1])
1072 for error reporting. (This is a preparation for using constant
1073 value for ->[1].)
1074
1075 2008-03-22 Wakaba <wakaba@suika.fam.cx>
1076
1077 * RDFXML.pm: Typo fixed.
1078
1079 2008-03-22 Wakaba <wakaba@suika.fam.cx>
1080
1081 * ContentChecker.pm: |fact_level| is now treated
1082 as same as |must_level|, i.e. level = |m|.
1083 (check_element): Make list of URIs in the DOM.
1084
1085 2008-03-21 Wakaba <wakaba@suika.fam.cx>
1086
1087 * RDFXML.pm: Language accessor implemented.
1088 Local (null-namespace) attribute support.
1089
1090 2008-03-21 Wakaba <wakaba@suika.fam.cx>
1091
1092 * RDFXML.pm: Factored out ID checking code.
1093
1094 2008-03-21 Wakaba <wakaba@suika.fam.cx>
1095
1096 * RDFXML.pm: TODO items noted. Validation of ID
1097 and URI attributes is implemented. Warn if unknown
1098 value is used in rdf:parseType="" attribute.
1099
1100 * URIChecker.pm (check_rdf_uri_reference): New function.
1101
1102 2008-03-21 Wakaba <wakaba@suika.fam.cx>
1103
1104 * RDFXML.pm: bnodeid implemented. Relative references
1105 are now resolved.
1106
1107 2008-03-21 Wakaba <wakaba@suika.fam.cx>
1108
1109 * ContentChecker.pm: RDF reification implemented.
1110
1111 * RDFXML.pm: undef vs false bug fixed.
1112 Reification implemented.
1113
1114 2008-03-21 Wakaba <wakaba@suika.fam.cx>
1115
1116 * RDFXML.pm: s/id/ID/ for attribute name.
1117 The |node| arguments are added for |ontriple| calls.
1118 Too many "attribute not allowed" errors were raised.
1119
1120 * ContentChecker.pm: Initial experimental support for rdf:RDF
1121 element.
1122
1123 2008-03-21 Wakaba <wakaba@suika.fam.cx>
1124
1125 * RDFXML.pm: New module.
1126
1127 2008-03-20 Wakaba <wakaba@suika.fam.cx>
1128
1129 * HTML.pm.src (set_inner_html): Line/column number
1130 code was old one yet.
1131
1132 2008-03-20 Wakaba <wakaba@suika.fam.cx>
1133
1134 * HTML.pm.src: Better line/column reporting for "duplicate attribute"
1135 errors. Line/column markings for DOCTYPE, comment, and
1136 character tokens are reintroduced; otherwise,
1137 error location for "not HTML5" error and errors
1138 for implied elements are not attached.
1139
1140 2008-03-20 Wakaba <wakaba@suika.fam.cx>
1141
1142 * HTML.pm.src: Set line/column numbers to attributes.
1143
1144 * NanoDOM.pm (create_attribute_ns, set_attribute_node_ns): Added.
1145 (value): Setter implemented.
1146
1147 * mkhtmlparser.pl: Set line/column numbers to Attr nodes.
1148
1149 2008-03-20 Wakaba <wakaba@suika.fam.cx>
1150
1151 * HTML.pm.src: Unused line/column markings are removed.
1152
1153 2008-03-20 Wakaba <wakaba@suika.fam.cx>
1154
1155 * HTML.pm.src (_get_next_token): Remove |first_start_tag|
1156 flag, which is no longer used.
1157
1158 2008-03-17 Wakaba <wakaba@suika.fam.cx>
1159
1160 * HTML.pm.src: Set line/column information to element nodes.
1161
1162 * mkhtmlparser.pl (!!!create-element, !!!insert-element,
1163 and !!!insert-element-t): Set line/column information to
1164 element nodes.
1165
1166 2008-03-17 Wakaba <wakaba@suika.fam.cx>
1167
1168 * HTML.pm.src (_get_next_token): The first "<" character
1169 in "<?", "<>", or "</>" should be the error point.
1170
1171 2008-03-16 Wakaba <wakaba@suika.fam.cx>
1172
1173 * HTML.pm.src: Some more fixes on error position reporting.
1174
1175 2008-03-16 Wakaba <wakaba@suika.fam.cx>
1176
1177 * HTML.pm.src: Token-level precious error reporting.
1178
1179 2008-03-16 Wakaba <wakaba@suika.fam.cx>
1180
1181 * HTML.pm.src: Preparation for more precious error point
1182 reporting.
1183
1184 2008-03-11 Wakaba <wakaba@suika.fam.cx>
1185
1186 * HTML.pm.src: Error type revised.
1187
1188 2008-03-11 Wakaba <wakaba@suika.fam.cx>
1189
1190 * HTML.pm.src: Similar codes are merged together, again.
1191
1192 2008-03-11 Wakaba <wakaba@suika.fam.cx>
1193
1194 * HTML.pm.src: Similar codes are merged together.
1195
1196 2008-03-10 Wakaba <wakaba@suika.fam.cx>
1197
1198 * mkhtmlparser.pl: Set "level" parameter to parse errors.
1199
1200 * HTML.pm.src: Code refined.
1201
1202 2008-03-09 Wakaba <wakaba@suika.fam.cx>
1203
1204 * HTML.pm.src: |</body>| treatement has been changed (HTML5
1205 revision 1348). Note that I really don't know this makes
1206 any difference in the black-box behavior of the parser.
1207
1208 2008-03-09 Wakaba <wakaba@suika.fam.cx>
1209
1210 * HTML.pm.src: New end-of-file token implementation (HTML5
1211 revision 1348).
1212
1213 2008-03-09 Wakaba <wakaba@suika.fam.cx>
1214
1215 * HTML.pm.src: |applet| support (HTML5 revision 1347).
1216
1217 2008-03-09 Wakaba <wakaba@suika.fam.cx>
1218
1219 * HTML.pm.src: Foster parenting in AAA (HTML5 revision 1343).
1220
1221 2008-03-09 Wakaba <wakaba@suika.fam.cx>
1222
1223 * HTML.pm.src: Support for |<input>| in the "in select" insertion
1224 mode and support for the "in select in table" insertion mode (HTML5
1225 revision 1342).
1226
1227 2008-03-09 Wakaba <wakaba@suika.fam.cx>
1228
1229 * HTML.pm.src: No foster parenting for <script> and <script>
1230 in non-tainted <table>s (HTML5 revision 1336).
1231
1232 2008-03-09 Wakaba <wakaba@suika.fam.cx>
1233
1234 * HTML.pm.src: Ignore white space characters between <html>
1235 and <head> (HTML5 revision 1332).
1236
1237 2008-03-09 Wakaba <wakaba@suika.fam.cx>
1238
1239 * HTML.pm.src: Treat <input type=hidden> as if it were a
1240 white space (HTML5 revision 1331).
1241
1242 2008-03-08 Wakaba <wakaba@suika.fam.cx>
1243
1244 * HTML.pm.src: Ignore U+000A at the beginning of a |listing|
1245 element (HTML5 revision 1330).
1246
1247 2008-03-08 Wakaba <wakaba@suika.fam.cx>
1248
1249 * HTML.pm.src: <title> is always appended to the current
1250 element (HTML5 revision 1328).
1251
1252 2008-03-08 Wakaba <wakaba@suika.fam.cx>
1253
1254 * HTML.pm.src: White space in tainted tables are moved
1255 into foster parents (HTML5 revision 1326).
1256
1257 2008-03-08 Wakaba <wakaba@suika.fam.cx>
1258
1259 * HTML.pm.src: Reduce errors from foster parenting cases (HTML5
1260 revision 1321).
1261
1262 2008-03-08 Wakaba <wakaba@suika.fam.cx>
1263
1264 * HTML.pm.src: |</h/n/>| case code rearranged to align with
1265 the spec (HTML5 revision 1320). Note that we finally complete
1266 all of HTML5 revision 1320 changes.
1267
1268 2008-03-08 Wakaba <wakaba@suika.fam.cx>
1269
1270 * HTML.pm.src: |</form>| now works similar to |</div>| for unclosed
1271 tags (HTML5 revision 1320).
1272
1273 2008-03-08 Wakaba <wakaba@suika.fam.cx>
1274
1275 * HTML.pm.src: |</p>| case rearranged with no actual change in fact.
1276
1277 2008-03-08 Wakaba <wakaba@suika.fam.cx>
1278
1279 * HTML.pm.src: A "generate implied end tags" code (t409.1)
1280 could not be reached so that it is now removed (HTML5 revision
1281 1320).
1282
1283 2008-03-08 Wakaba <wakaba@suika.fam.cx>
1284
1285 * HTML.pm.src: Code for the case of |</div>| and so on are revised
1286 to align with new spec text (HTML5 revision 1320).
1287
1288 2008-03-08 Wakaba <wakaba@suika.fam.cx>
1289
1290 * HTML.pm.src: Remove strange |if| condition; however, it should
1291 have had no harm in theory.
1292
1293 2008-03-08 Wakaba <wakaba@suika.fam.cx>
1294
1295 * HTML.pm.src (_tree_construction_main): '</p>' in body
1296 case is split from other end tags for the preparation
1297 of implementing HTML revision 1320.
1298
1299 2008-03-07 Wakaba <wakaba@suika.fam.cx>
1300
1301 * HTML.pm.src: Simplified "generate implied end tag" (HTML5
1302 revision 1320).
1303
1304 2008-03-07 Wakaba <wakaba@suika.fam.cx>
1305
1306 * HTML.pm.src (_tree_construction_main): Merge rules for "h1"
1307 and "div" (HTML5 revision 1318). Add comments to where
1308 |form| pointer association codes should be inserted (HTML5 revision
1309 1319).
1310
1311 2008-03-06 Wakaba <wakaba@suika.fam.cx>
1312
1313 * HTML.pm.src: <html> treatement refined (HTML5 revision 1314).
1314
1315 2008-03-05 Wakaba <wakaba@suika.fam.cx>
1316
1317 * HTML.pm.src: Since the case t268 should never be reached (no
1318 other token type, there are), it is replaced by a |die| statement.
1319
1320 2008-03-05 Wakaba <wakaba@suika.fam.cx>
1321
1322 * HTML.pm.src: Typo fixed.
1323
1324 2008-03-04 Wakaba <wakaba@suika.fam.cx>
1325
1326 * HTML.pm.src (_tree_construction_initial): Some limited quirks
1327 doctypes were not uppercased for comparison.
1328
1329 2008-03-03 Wakaba <wakaba@suika.fam.cx>
1330
1331 * HTML.pm.src (tree construction and set_inner_html): Checkpoints
1332 are added.
1333
1334 2008-03-03 Wakaba <wakaba@suika.fam.cx>
1335
1336 * HTML.pm.src (_tokenize_attempt_to_consume_an_entity): Checkpoints
1337 are set. Cases that are unlikely reached are noted as so.
1338
1339 2008-03-03 Wakaba <wakaba@suika.fam.cx>
1340
1341 * HTML.pm.src: Checkpoints for debugging are added.
1342
1343 * mkhtmlparser.pl: Support for |!!!cp| syntax.
1344
1345 2008-03-03 Wakaba <wakaba@suika.fam.cx>
1346
1347 * HTML.pm.src, mkhtmlparser.pl: s/_input_character/_char/g
1348 for simplicity.
1349
1350 2008-03-03 Wakaba <wakaba@suika.fam.cx>
1351
1352 * HTML.pm.src: Flag name changed: s/correct/force-quirks/g (HTML5
1353 revision 1307).
1354
1355 2008-03-03 Wakaba <wakaba@suika.fam.cx>
1356
1357 * HTML.pm.src (_get_next_token): Where the /incorrect/ flag
1358 is set are changed (HTML5 revision 1305).
1359
1360 2008-03-02 Wakaba <wakaba@suika.fam.cx>
1361
1362 * HTML.pm.src: Raise a parse error for |<span ===>| (HTML5 revision
1363 1292). Entities are not parsed in comment-like part in RCDATA
1364 elements (HTML5 revision 1294). Allow bare & at the end
1365 of attribute value literals (HTML5 revision 1296). More
1366 quirks mode doctypes (HTML5 revision 1302). Requires spaces
1367 between attributes and ban attribute names or unquoted
1368 attribute values containing single or double quotes (HTML5
1369 revision 1303).
1370
1371 2008-03-02 Wakaba <wakaba@suika.fam.cx>
1372
1373 * ContentChecker.pm: Typo fixed. Don't raise "character encoding"
1374 and related errors unless it is an HTML document (though the spec
1375 is unclear on whether it is applied to XHTML document).
1376
1377 * HTML.pm (%HTMLAttrStatus): WF2 repetition model attributes
1378 are added.
1379
1380 2008-03-02 Wakaba <wakaba@suika.fam.cx>
1381
1382 * HTML.pm.src: s/local_name/manakai_local_name/g.
1383
1384 2008-03-01 Wakaba <wakaba@suika.fam.cx>
1385
1386 * _NamedEntityList.pm: Updated (HTML5 revision 1286).
1387
1388 * HTML.pm.src: |charset| in |content| attribute is
1389 case-insensitive (HTML5 revision 1270).
1390
1391 2008-02-26 Wakaba <wakaba@suika.fam.cx>
1392
1393 * ContentChecker.pm: New status constants are added.
1394 ($ElementDefault): |status| added.
1395 (check_element): Err for non-standard or deprecated elements.
1396 (_attr_status_info): For non-standard or deprecated attributes.
1397
1398 2008-02-24 Wakaba <wakaba@suika.fam.cx>
1399
1400 * ContentChecker.pm (_attr_status_info): New internal method.
1401
1402 2008-02-24 Wakaba <wakaba@suika.fam.cx>
1403
1404 * ContentChecker.pm (check_element): Element standardized
1405 status information is now dispatched.
1406
1407 2008-02-24 Wakaba <wakaba@suika.fam.cx>
1408
1409 * ContentChecker.pm (check_element): Fix |del|-and-significant
1410 problem by adding some more arguments.
1411
1412 2008-02-24 Wakaba <wakaba@suika.fam.cx>
1413
1414 * ContentChecker.pm (check_element): Use context of
1415 container-for-the-purpose-of-content-model element (not transparent
1416 element) for |check_child_element| calling and significant
1417 text flag marking. This reintroduces |<del>|-and-significant
1418 problem again.
1419
1420 2008-02-24 Wakaba <wakaba@suika.fam.cx>
1421
1422 * ContentChecker.pm (check_element): Make semi-transparent
1423 elements ignored for the purpose of phase changes in
1424 content model checking.
1425
1426 2008-02-23 Wakaba <wakaba@suika.fam.cx>
1427
1428 * ContentChecker.pm (check_element): In-element state
1429 was not properly managed for transparent cases.
1430
1431 2008-02-23 Wakaba <wakaba@suika.fam.cx>
1432
1433 * ContentChecker.pm (check_element): Support for |video|
1434 and |audio| as semi-transparent elements.
1435
1436 2008-02-23 Wakaba <wakaba@suika.fam.cx>
1437
1438 * ContentChecker.pm ($HTMLSemiTransparentElements): New.
1439 (check_element): s/minuses/minus_elements/, s/pluses/plus_elements/.
1440 Support for |html:object| as a semi-transparent element.
1441
1442 2008-02-23 Wakaba <wakaba@suika.fam.cx>
1443
1444 * ContentChecker.pm (check_element): The way to traverse
1445 the tree is entirely revised to make it easier to track
1446 the state of ancestors/descendants. As a result of this
1447 revision (which rewrites almost all of Whatpm::ContentChecker::HTML),
1448 support for content model checking for HTML elements |figure|,
1449 |object|, |video|, and |audio| and checking for XML elements (and
1450 some XMLNS checkings) are dropped for now. They will be
1451 reimplemented in due cource.
1452
1453 2008-02-17 Wakaba <wakaba@suika.fam.cx>
1454
1455 * HTML.pm.src: |>| in public or system literal closes the DOCTYPE
1456 token (HTML5 revision 1225).
1457
1458 2008-02-17 Wakaba <wakaba@suika.fam.cx>
1459
1460 * ContentChecker.pm ({unsupported_level}): New value.
1461
1462 * HTML.pm.src: Save whether |meta| |content| attribute
1463 contains character references or not.
1464
1465 2008-02-17 Wakaba <wakaba@suika.fam.cx>
1466
1467 * ContentChecker.pm (_get_children): (Incompleted) attempt to
1468 imlement significant content checking for contents
1469 with |del| elements.
1470
1471 2008-02-17 Wakaba <wakaba@suika.fam.cx>
1472
1473 * ContenteChecker.pm ($HTMLTransparentElements): More
1474 elements are added.
1475 (_get_children): HTML |object| elements are now semi-transparent.
1476
1477 * NanoDOM.pm (manakai_html, manakai_head): New methods.
1478
1479 2008-02-16 Wakaba <wakaba@suika.fam.cx>
1480
1481 * CacheManifest.pm: HTML5 revision 1211 implemented.
1482
1483 * CacheManifest.pod: Updated.
1484
1485 2008-02-10 Wakaba <wakaba@suika.fam.cx>
1486
1487 * ContentChecker.pm (check_document, check_element): Support
1488 for second argument ($onsubdoc).
1489 (_get_css_parser): Removed (now it is part of WDCC).
1490
1491 2008-02-09 Wakaba <wakaba@suika.fam.cx>
1492
1493 * ContentChecker.pm (_get_css_parser): New.
1494
1495 2007-11-25 Wakaba <wakaba@suika.fam.cx>
1496
1497 * ContentChecker.pm ($AnyChecker): Old way to add child elements
1498 for checking had been used.
1499
1500 2007-11-25 Wakaba <wakaba@suika.fam.cx>
1501
1502 * ContentChecker.pm (check_element): New todo item type |descendant|.
1503
1504 2007-11-23 Wakaba <wakaba@suika.fam.cx>
1505
1506 * IMTChecker.pm: Revised to raise errors and warnings as (poorly)
1507 specced in RFC 2046 and RFC 4288.
1508 (application/atom+xml): Definition added.
1509
1510 2007-11-23 Wakaba <wakaba@suika.fam.cx>
1511
1512 * URIChecker.pm: Make RFC 3986 should-level errors
1513 warnings (rather than SHOULD-level errors).
1514
1515 2007-11-23 Wakaba <wakaba@suika.fam.cx>
1516
1517 * NanoDOM.pm (get_user_data, set_user_data): New methods.
1518
1519 * HTML.pm.src: A flag for character references in attribute
1520 values are added. Set |manakai_has_reference| user data
1521 to |charset| attribute.
1522
1523 2007-11-23 Wakaba <wakaba@suika.fam.cx>
1524
1525 * NanoDOM.pm (input_encoding, manakai_charset, manakai_has_bom): New
1526 attributes.
1527
1528 * ContentChecker.pm (check_document): Warn if charset requirements
1529 cannot be tested.
1530
1531 2007-11-19 Wakaba <wakaba@suika.fam.cx>
1532
1533 * HTML.pm.src (parse_byte_string): Detect charset
1534 by universalchardet if charset parameter is not specified.
1535
1536 * Makefile (Charset-all, Charset-clean): New rules.
1537
1538 2007-11-18 Wakaba <wakaba@suika.fam.cx>
1539
1540 * ContentChecker.pm (check_document): Check the existence
1541 of character encoding declaration and selection of encoding
1542 for HTML document.
1543
1544 2007-11-18 Wakaba <wakaba@suika.fam.cx>
1545
1546 * ContentType.pm (get_sniffed_type): Return also the official
1547 type in list context.
1548
1549 2007-11-18 Wakaba <wakaba@suika.fam.cx>
1550
1551 * ContentType.pm: Sniffing with leading white space
1552 ignoring (HTML5 revisions 1013 and 1016).
1553
1554 2007-11-18 Wakaba <wakaba@suika.fam.cx>
1555
1556 * ContentType.pm: HTML5 revision 1013 changes, except for leading
1557 white spaces, are implemented.
1558
1559 2007-11-11 Wakaba <wakaba@suika.fam.cx>
1560
1561 * HTML.pm.src (parse_char_string): Set |inner_encoding|
1562 attribute if possible.
1563
1564 2007-11-11 Wakaba <wakaba@suika.fam.cx>
1565
1566 * HTML.pm.src (parse_byte_string): New method.
1567 (parse_char_string): New alias for |parse_string|.
1568 (main phase): Invoking "change the encoding" algorithm if desired.
1569
1570 * HTML.pod: Updated.
1571
1572 2007-11-11 Wakaba <wakaba@suika.fam.cx>
1573
1574 * HTML.pod (get_inner_html): Removed.
1575
1576 * Makefile (HTML-all, HTML-clean): New.
1577
1578 2007-11-11 Wakaba <wakaba@suika.fam.cx>
1579
1580 * HTML.pm.src (get_inner_html): Removed (moved to HTML/Serializer.pm).
1581
1582 2007-11-08 Wakaba <wakaba@suika.fam.cx>
1583
1584 * mklinktypelist.pl: s/noreferer/noreferrer/ (HTML5 revision 1132).
1585
1586 2007-11-04 Wakaba <wakaba@suika.fam.cx>
1587
1588 * Makefile: |CacheManifest.html| is added.
1589
1590 * CacheManifest.pod: New file.
1591
1592 2007-11-04 Wakaba <wakaba@suika.fam.cx>
1593
1594 * CacheManifest.pm: New module.
1595
1596 2007-11-04 Wakaba <wakaba@suika.fam.cx>
1597
1598 * HTML.pm.src: Support for application cache selection algorithm
1599 callback.
1600
1601 2007-11-04 Wakaba <wakaba@suika.fam.cx>
1602
1603 * mklinktypelist.pl: Support for rel=noreferer (HTML5 revision 1118).
1604
1605 2007-10-17 Wakaba <wakaba@suika.fam.cx>
1606
1607 * Makefile (clean): New rule.
1608
1609 * NanoDOM.pm (public_id, system_id): New attributes.
1610
1611 2007-10-17 Wakaba <wakaba@suika.fam.cx>
1612
1613 * Makefile (CSS-all, CSS-clean, clean): New rules.
1614
1615 2007-10-14 Wakaba <wakaba@suika.fam.cx>
1616
1617 * ContentChecker.pm (check_document): Support for
1618 new |is_xml_root| flag.
1619 (check_element): Support for new |pluses| state.
1620 (_add_pluses): New method.
1621 (_remove_minuses): Support for new |minus| item.
1622
1623 2007-09-24 Wakaba <wakaba@suika.fam.cx>
1624
1625 * ContentChecker.pm: Raise specific error for invalid
1626 root element.
1627
1628 2007-09-24 Wakaba <wakaba@suika.fam.cx>
1629
1630 * ContentChecker.pm: Set level values for later uses.
1631
1632 2007-09-09 Wakaba <wakaba@suika.fam.cx>
1633
1634 * ContentChecker.pm: Support for language tag validation.
1635
1636 2007-09-09 Wakaba <wakaba@suika.fam.cx>
1637
1638 * LangTag.pm (check_rfc3066_language_tag): New method.
1639
1640 2007-09-09 Wakaba <wakaba@suika.fam.cx>
1641
1642 * LangTag.pm: New module.
1643
1644 2007-09-04 Wakaba <wakaba@suika.fam.cx>
1645
1646 * HTML.pm.src: Some error types were wrong.
1647
1648 2007-08-17 Wakaba <wakaba@suika.fam.cx>
1649
1650 * CSS/: New directory.
1651
1652 2007-08-17 Wakaba <wakaba@suika.fam.cx>
1653
1654 * ContentChecker.pm (_check_get_children): Support
1655 for |noscript| in |head|.
1656
1657 2007-08-12 Wakaba <wakaba@suika.fam.cx>
1658
1659 * URI/: New directory.
1660
1661 2007-08-11 Wakaba <wakaba@suika.fam.cx>
1662
1663 * HTML.pm.src: Tokenizer's states are now represented in
1664 number.
1665
1666 2007-08-11 Wakaba <wakaba@suika.fam.cx>
1667
1668 * HTML.pm.src: |or|s for insertion modes are replaced
1669 by |&|s.
1670
1671 2007-08-11 Wakaba <wakaba@suika.fam.cx>
1672
1673 * HTML.pm.src: Token types are now represented in number.
1674
1675 2007-08-11 Wakaba <wakaba@suika.fam.cx>
1676
1677 * ContentType.pm (SEE ALSO): Updated.
1678
1679 * HTML.pm.src: Insertion modes are now represented in number.
1680
1681 2007-08-11 Wakaba <wakaba@suika.fam.cx>
1682
1683 * ContentType.pm: Sniffing for bitmap images (HTML5 revision
1684 999) is implemented.
1685
1686 2007-08-08 Wakaba <wakaba@suika.fam.cx>
1687
1688 * ContentType.pm: Sniffing for |<script| (HTML5 revision
1689 983) is implemented.
1690
1691 2007-08-06 Wakaba <wakaba@suika.fam.cx>
1692
1693 * ContentChecker.pod: New documentation.
1694
1695 * Makefile: A rule for |ContentChecker.html| is added.
1696
1697 * ContentChecker.pm: A pod "LICENSE" section is added.
1698
1699 * NanoDOM.pm ($VERSION): New variable.
1700
1701 2007-08-05 Wakaba <wakaba@suika.fam.cx>
1702
1703 * H2H.pm: |b|, |i|, and |sub| are added to the
1704 list of allowed HTML elements.
1705
1706 2007-08-05 Wakaba <wakaba@suika.fam.cx>
1707
1708 * H2H.pm: |samp| is added to the list of allowed
1709 HTML elements.
1710
1711 * URIChecker.pm (check_iri): New.
1712 (check_iri_reference): Error type for IRI reference
1713 syntax error is changed.
1714
1715 2007-08-04 Wakaba <wakaba@suika.fam.cx>
1716
1717 * ContentChecker.pm: Reference to the |Whatpm::ContentChecker::Atom|
1718 is added.
1719 (check_document): Load appropriate module before validation.
1720
1721 2007-08-04 Wakaba <wakaba@suika.fam.cx>
1722
1723 * ContentChecker/: New directory.
1724
1725 2007-08-04 Wakaba <wakaba@suika.fam.cx>
1726
1727 * ContentChecker.pm: HTML |time| element is implemented.
1728
1729 * HTMLTable.pm: Comments are updated as HTML5 is revised.
1730
1731 2007-07-21 Wakaba <wakaba@suika.fam.cx>
1732
1733 * ContentChecker.pm (check_document): Return value
1734 even if no document element is found.
1735
1736 2007-07-21 Wakaba <wakaba@suika.fam.cx>
1737
1738 * HTML.pm.src: |$in_body| is no longer a function.
1739
1740 2007-07-21 Wakaba <wakaba@suika.fam.cx>
1741
1742 * HTML.pm.src: The |$in_body| code has been moved down.
1743
1744 2007-07-21 Wakaba <wakaba@suika.fam.cx>
1745
1746 * HTML.pm.src: The "trailing end" insertion mode
1747 is split into "after html body" and "after html frameset"
1748 insertion modes. Their codes are merged with "after body"
1749 and "after frameset" codes. |$previous_insertion_mode|
1750 has been removed. "after frameset" code is
1751 merged with "in frameset" code.
1752
1753 2007-07-21 Wakaba <wakaba@suika.fam.cx>
1754
1755 * HTML.pm.src: The "before head" insertion mode is
1756 merged with the "in head" insertion mode.
1757
1758 2007-07-21 Wakaba <wakaba@suika.fam.cx>
1759
1760 * HTML.pm.src: Cases in "in head" insertion mode are
1761 reorganized.
1762
1763 2007-07-21 Wakaba <wakaba@suika.fam.cx>
1764
1765 * HTML.pm.src: Some cases in "in table" insertion mode
1766 are merged.
1767
1768 2007-07-21 Wakaba <wakaba@suika.fam.cx>
1769
1770 * HTML.pm.src: The "in row" insertion mode is merged
1771 with "in table" insertion mode.
1772
1773 2007-07-21 Wakaba <wakaba@suika.fam.cx>
1774
1775 * HTML.pm.src: The "in table" and "in table body" insertion
1776 modes are merged.
1777
1778 2007-07-21 Wakaba <wakaba@suika.fam.cx>
1779
1780 * HTML.pm.src: There is no "in table head" or "in table foot"
1781 insertion mode!
1782
1783 2007-07-21 Wakaba <wakaba@suika.fam.cx>
1784
1785 * HTML.pm.src: |<noframes>| "in frameset" and "in noframes"
1786 now directly invoke the handler.
1787
1788 2007-07-21 Wakaba <wakaba@suika.fam.cx>
1789
1790 * HTML.pm.src: Codes for "in cell" insertion mode
1791 is merged to the "in body" insertion mode code.
1792
1793 2007-07-21 Wakaba <wakaba@suika.fam.cx>
1794
1795 * HTML.pm.src: Codes for "in body" and "in caption"
1796 insertion modes are merged.
1797
1798 2007-07-21 Wakaba <wakaba@suika.fam.cx>
1799
1800 * HTML.pm.src: Two |!!!next-token|s were missing.
1801
1802 2007-07-21 Wakaba <wakaba@suika.fam.cx>
1803
1804 * HTML.pm.src: Use numeric constant for |{content_mode}|
1805 instead of string constant for |{content_model_flag}|.
1806
1807 2007-07-21 Wakaba <wakaba@suika.fam.cx>
1808
1809 * HTML.pm.src: Add the name of the attribute
1810 to the "duplicate attribute" error.
1811
1812 2007-07-17 Wakaba <wakaba@suika.fam.cx>
1813
1814 * ContentChecker.pm: Return the |class| node list.
1815
1816 2007-07-17 Wakaba <wakaba@suika.fam.cx>
1817
1818 * ContentChecker.pm: Return the |id| node list.
1819
1820 * HTML.pm.src: A typo is fixed.
1821
1822 2007-07-16 Wakaba <wakaba@suika.fam.cx>
1823
1824 * ContentChecker.pm: Drop wrong |level => 'error'| specification
1825 from "in HTML:xml:lang" error. Character position
1826 is now the last part of the error type in the URI error
1827 description. Report "unsupported" status for language
1828 tags, media queries, script codes, and style sheets.
1829
1830 2007-07-16 Wakaba <wakaba@suika.fam.cx>
1831
1832 * ContentChecker.pm: Report error if |xml:lang|
1833 in HTML, |lang| in XML, |xmlns| in XML, and |meta| |charset|
1834 in XML.
1835
1836 * NanoDOM.pm (Attr.owner_document): New attribute.
1837
1838 2007-07-16 Wakaba <wakaba@suika.fam.cx>
1839
1840 * HTML.pm.src: The character immediately following
1841 a bare |hcro| was discarded. Fix handling of
1842 entity references in attribute values.
1843
1844 2007-07-16 Wakaba <wakaba@suika.fam.cx>
1845
1846 * HTML.pm.src (main and trailin end phases): Token
1847 types |DOCTYPE|, |comment|, |end-of-file|, and |<html>| are
1848 factored out. Error types |in html:#DOCTYPE|
1849 and |after html:#DOCTYPE| are merged
1850 into |DOCTYPE in the middle|. |</frameset>|
1851 in fragment parsing mode changed the
1852 insertion mode.
1853
1854 2007-07-16 Wakaba <wakaba@suika.fam.cx>
1855
1856 * HTML.pm.src: |$phase| has been removed; The |trailing end|
1857 phase is now an insertion mode. Treatments for white
1858 space character tokens were incorrect for some
1859 insertion modes. An old |meta| case was not removed.
1860
1861 2007-07-16 Wakaba <wakaba@suika.fam.cx>
1862
1863 * HTML.pm.src: |meta| charset declaration extraction
1864 implemented (but changing the encoding is not yet:-).
1865
1866 2007-07-15 Wakaba <wakaba@suika.fam.cx>
1867
1868 * Charset/: New directory.
1869
1870 2007-07-15 Wakaba <wakaba@suika.fam.cx>
1871
1872 * H2H.pm: New Perl module (created from
1873 manakai's H2H.dis).
1874
1875 2007-07-15 Wakaba <wakaba@suika.fam.cx>
1876
1877 * XMLSerializer.pm: New Perl module (created from
1878 manakai's SimpleLS.dis).
1879
1880 2007-07-07 Wakaba <wakaba@suika.fam.cx>
1881
1882 * HTML.pm.src: |<!---x-->| was not processed correctly.
1883
1884 2007-07-01 Wakaba <wakaba@suika.fam.cx>
1885
1886 * HTML.pm.src: Report correct error message
1887 for |<body></div></body>|.
1888
1889 2007-07-01 Wakaba <wakaba@suika.fam.cx>
1890
1891 * HTMLTable.pm: An error description was incorrect.
1892
1893 2007-06-30 Wakaba <wakaba@suika.fam.cx>
1894
1895 * ContentChecker.pm: Return |{term}| list.
1896
1897 2007-06-30 Wakaba <wakaba@suika.fam.cx>
1898
1899 * HTML.pm.src: HTML5 revisions 961-966 (</p>, </br>,
1900 nested <nobr>, implied </tbody>, </tfoot>, and </thead>,
1901 and <title> outside of head).
1902
1903 2007-06-30 Wakaba <wakaba@suika.fam.cx>
1904
1905 * IMTChecker.pm: Report warning for unregistered
1906 and private types/subtypes.
1907
1908 * ContentChecker.pm, HTML.pm.src, IMTChecker.pm,
1909 URIChecker.pm, HTMLTable.pm: Error messages are now
1910 consistent; they are all listed in
1911 <http://suika.fam.cx/gate/2005/sw/Whatpm%20Error%20Types>.
1912
1913 2007-06-25 Wakaba <wakaba@suika.fam.cx>
1914
1915 * ContentChecker.pm: |<img ismap>| not in |<a></a>|
1916 is now erred. |<datalist>| is implemented.
1917 Attribute checker for |<command>| and |<menu>| are
1918 added. Support for |contextmenu| global attribute
1919 is added.
1920
1921 2007-06-25 Wakaba <wakaba@suika.fam.cx>
1922
1923 * HTML.pm.src (_reset_insertion_mode): Interpretation
1924 of Step 3 has been changed.
1925
1926 2007-06-25 Wakaba <wakaba@suika.fam.cx>
1927
1928 * HTML.pm.src: Late |<html>| parse error is implemented.
1929
1930 2007-06-24 Wakaba <wakaba@suika.fam.cx>
1931
1932 * URIChecker.pm (check_iri_reference): A |decode| method name was
1933 incorrect.
1934
1935 * ContentChecker.pm: Support for the |footer| element.
1936 Check URI syntax for space-separated URI attributes.
1937 Support for the |tabindex| attribute. Support
1938 for |datetime| attribute.
1939
1940 2007-06-24 Wakaba <wakaba@suika.fam.cx>
1941
1942 * HTML.pm.src: HTML5 revision 1.144 (&#x0D;) and 1.145 (invalid
1943 character references). HTML5 revision 1.146 (white space
1944 characters before root start tag). HTML5 revision
1945 1.148 (named character references in attribute values).
1946 HTML5 revision 1.152 (<plaintext>.innerHTML get).
1947
1948 2007-06-24 Wakaba <wakaba@suika.fam.cx>
1949
1950 * HTML.pm.src: HTML5 revisions 1.142 and 1.143 (<noscript>
1951 in <head>).
1952
1953 2007-06-24 Wakaba <wakaba@suika.fam.cx>
1954
1955 * HTML.pm.src: HTML5 revision 935 (<base>, <link>, <meta>
1956 in body).
1957
1958 * ContentChecker.pm: HTML5 revision 938 (scoped="").
1959
1960 2007-06-24 Wakaba <wakaba@suika.fam.cx>
1961
1962 * HTML.pm.src: HTML5 revision 923 (matching end tag in CDATA
1963 or RCDATA in fragment parsing mode). HTML5 revision
1964 924 (<!--> and <!--->). HTML5 revision 926 (hn in hn).
1965
1966 2007-06-23 Wakaba <wakaba@suika.fam.cx>
1967
1968 * HTML.pm.src (get_inner_html): HTML5 revision 922 (inner_html
1969 for <pre> and <textarea>).
1970
1971 2007-06-23 Wakaba <wakaba@suika.fam.cx>
1972
1973 * HTML.pm.src: HTML5 revision 920 (<isindex>).
1974
1975 2007-06-23 Wakaba <wakaba@suika.fam.cx>
1976
1977 * HTML.pm.src: HTML5 revision 918 (</head>, </body>, </html>).
1978
1979 2007-06-23 Wakaba <wakaba@suika.fam.cx>
1980
1981 * HTML.pm.src: HTML5 revision 916 (</body>).
1982 HTML5 revision 917 (conforming bare &).
1983
1984 2007-06-23 Wakaba <wakaba@suika.fam.cx>
1985
1986 * NanoDOM.pm (manakai_is_html): Setting to false did not work.
1987
1988 * HTML.pm.src: HTML5 revision 914 (</ in CDATA, RCDATA).
1989 HTML5 revision 915 (<nobr>).
1990
1991 2007-06-23 Wakaba <wakaba@suika.fam.cx>
1992
1993 * HTML.pm.src: HTML5 revisions 908, 909, 912, and 913 (quirks mode).
1994
1995 * NanoDOM.pm (manakai_is_html, manakai_compat_mode, compat_mode):
1996 New attributes.
1997
1998 2007-06-23 Wakaba <wakaba@suika.fam.cx>
1999
2000 * HTML.pm.src: HTML5 revisions 900, 901, 902, and 911 (<
2001 in tags).
2002
2003 2007-06-23 Wakaba <wakaba@suika.fam.cx>
2004
2005 * .cvsignore: |Entities.html| is added.
2006
2007 * HTML.pm.src: |$entity_char| is removed and
2008 requires |Whatpm::_NamedEntityList| instead.
2009 HTML5 revision 898 (refc), except that lack of refc
2010 is parse error.
2011
2012 * mkentitylist.pl: New script.
2013
2014 * Makefile (all): |_NamedEntityList.pm| is added.
2015 (_NamedEntityList.pm, Entities.html): New rules.
2016
2017 2007-06-23 Wakaba <wakaba@suika.fam.cx>
2018
2019 * HTML.pm.src: Parse errors immediately after U+000D
2020 were ignored and U+000D immediately following another
2021 U+000D was not converted to U+000A.
2022
2023 2007-06-23 Wakaba <wakaba@suika.fam.cx>
2024
2025 * HTML.pm.src (set_inner_html): HTML5 revision 892 (adopt
2026 nodes before appended). Parser was not ready for NULL
2027 parse error and escape flag.
2028
2029 * NanoDOM.pm (adopt_node): New.
2030
2031 2007-06-23 Wakaba <wakaba@suika.fam.cx>
2032
2033 * HTML.pm.src: HTML5 revision 886 (insane comment in
2034 CDATA and RCDATA). Note that current implementation
2035 is simply repeating what the spec says and it is maybe not
2036 a best way to do it.
2037
2038 2007-06-23 Wakaba <wakaba@suika.fam.cx>
2039
2040 * HTML.pm.src: HTML5 revision 884 (</form> don't close
2041 the form element if a descendant element without implied end tag has
2042 still been open).
2043
2044 2007-06-23 Wakaba <wakaba@suika.fam.cx>
2045
2046 * ContentChecker.pm: HTML5 revision 881 (Make |id| attribute with
2047 space characters non-conforming).
2048
2049 2007-06-23 Wakaba <wakaba@suika.fam.cx>
2050
2051 * HTML.pm.src: An error message was incorrect.
2052 HTML5 revision 869 (C1 character references).
2053
2054 2007-06-23 Wakaba <wakaba@suika.fam.cx>
2055
2056 * HTML.pm.src: HTML5 revision 867 (a LF at the beginning of
2057 a |textarea| is removed).
2058
2059 2007-06-05 Wakaba <wakaba@suika.fam.cx>
2060
2061 * NanoDOM.pm (get_attribute_node_ns): New method.
2062
2063 * ContentChecker.pm: |script| |async| and |defer|
2064 no longer require |src|. |async| MUST NOT be
2065 specified if |defer|. (HTML5 revision 858).
2066
2067 2007-05-30 Wakaba <wakaba@suika.fam.cx>
2068
2069 * HTML.pm.src: |<form><form>| went to inifinite loop.
2070
2071 2007-05-27 Wakaba <wakaba@suika.fam.cx>
2072
2073 * ContentChecker.pm (html): Set |is_root| (allowed
2074 as a document element) flag on.
2075 (new): Removed.
2076 (check_document): New method.
2077
2078 2007-05-27 Wakaba <wakaba@suika.fam.cx>
2079
2080 * ContentChecker.pm (thead, tfoot): Checker specifications
2081 were incorrect.
2082
2083 2007-05-27 Wakaba <wakaba@suika.fam.cx>
2084
2085 * ContentChecker.pm ($HTMLURIAttrChecker): Include
2086 error position in the |type| option of the error.
2087
2088 * HTMLTable.pm (form_table): The |$onerror| parameter
2089 is now optional. Some bugs are fixed.
2090
2091 2007-05-27 Wakaba <wakaba@suika.fam.cx>
2092
2093 * HTMLTable.pm: New module.
2094
2095 * ContentChecker.pm (table): Invoke table model error checker.
2096
2097 * NanoDOM.pm (first_child, get_attribute_ns): New.
2098
2099 2007-05-26 Wakaba <wakaba@suika.fam.cx>
2100
2101 * ContentChecker.pm ($HTMLLinkTypesAttrChecker): New checker.
2102 (link/@rel, a/@rel, area/@rel): Use new checker.
2103
2104 * Makefile (_LinkTypeList.pm, RelExtensions.html): New rules.
2105
2106 * _LinkTypeList.pm: New file.
2107
2108 * mklinktypelist.pl: New file.
2109
2110 * .cvsignore: |RelExtensions.html| added.
2111
2112 * NanoDOM.pm (child_nodes): Returns an empty array
2113 for non-child-containing node types.
2114 (text_content): New attribute.
2115
2116 2007-05-26 Wakaba <wakaba@suika.fam.cx>
2117
2118 * IMTChecker.pm: New module.
2119
2120 * ContentChecker.pm ($HTMLIMTAttrChecker): Call IMTChecker
2121 to test parameter value validity.
2122
2123 * HTML.pm.src ($style_start_tag): Attributes were
2124 discarded.
2125
2126 2007-05-25 Wakaba <wakaba@suika.fam.cx>
2127
2128 * ContentChecker.pm ($HTMLURIAttrChecker): Implemented.
2129
2130 2007-05-25 Wakaba <wakaba@suika.fam.cx>
2131
2132 * URIChecker.pm: All recommendations from RFC 3986
2133 and RFC 3987 are listed (not all testable items are checked yet).
2134
2135 2007-05-25 Wakaba <wakaba@suika.fam.cx>
2136
2137 * URIChecker.pm: New module.
2138
2139 2007-05-20 Wakaba <wakaba@suika.fam.cx>
2140
2141 * ContentChecker.pm: Now most attributes are implemented
2142 or associated to some placeholder.
2143 ($ElementDefault): Warn unknown attributes
2144 for unknown elements as "attribute not supported".
2145 ($HTMLLanguageTagAttrChecker, $HTMLMQAttrChecker): New placeholders.
2146 ($HTMLUsemapAttrChecker, $HTMLTargetAttrChecker): New checkers.
2147 (|a| attribute checker): Reimplemented.
2148
2149 2007-05-20 Wakaba <wakaba@suika.fam.cx>
2150
2151 * ContentChecker.pm ($HTMLEventHandlerAttrChecker): New placeholder.
2152 ($HTMLAttrChecker): Event handler content attributes
2153 are added.
2154 (link, embed): Required attribute is now checked.
2155 (embed): Unknown local attributes are no longer warned.
2156
2157 2007-05-20 Wakaba <wakaba@suika.fam.cx>
2158
2159 * ContentChecker.pm ($HTMLSpaceURIsAttrChecker): New placeholder.
2160 ($HTMLIMTAttrChecker): New checker.
2161 (link@rel, link@href, link@type, style@type,
2162 a@href, a@ping, a@ping, a@type, embed@src, embed@type,
2163 object@data, object@type, source@src, source@type, area@alt,
2164 area@shape, area@coords, area@href,
2165 area@ping, area@rel, area@type, script@src,
2166 script@defer, script@async, script@type): Checkers added.
2167
2168 2007-05-20 Wakaba <wakaba@suika.fam.cx>
2169
2170 * ContentChecker.pm: Descendant checking was incorrect.
2171
2172 2007-05-19 Wakaba <wakaba@suika.fam.cx>
2173
2174 * ContentChecker.pm: Support |xml:*| and |xmlns:*|
2175 attributes. Report an error if |Element.prefix|
2176 is |xmlns|.
2177
2178 * NanoDOM.pm (prefix): New attribute.
2179
2180 2007-05-19 Wakaba <wakaba@suika.fam.cx>
2181
2182 * HTML.pm.src: In |main| phase, |in body| insertion
2183 mode, action for |<iframe>| was missing.
2184
2185 2007-05-19 Wakaba <wakaba@suika.fam.cx>
2186
2187 * ContentChecker.pm: Support for many of HTML5 elements.
2188 ($GetHTMLNonNegativeIntegerAttrChecker): New.
2189
2190 2007-05-19 Wakaba <wakaba@suika.fam.cx>
2191
2192 * ContentChecker.pm: Support for most elements up to |progress|.
2193 ($HTMLURIAttrChecker): Placeholder.
2194 ($HTMLIntegerAttrChecker, $GetHTMLFloatingPointNumberAttrChecker):
2195 New.
2196
2197 2007-05-19 Wakaba <wakaba@suika.fam.cx>
2198
2199 * ContentChecker.pm: Attribute checkers
2200 for global attributes, |html|, |base|, |style|, and |meta|.
2201
2202 * NanoDOM.pm (insert_before): Weaken reference
2203 to the parent node.
2204 (Attr::new): Set |owner_element| attribute.
2205 (namespace_uri, manakai_local_name): New attribute implementations.
2206 (owner_element): New attribute.
2207
2208 2007-05-19 Wakaba <wakaba@suika.fam.cx>
2209
2210 * ContentChecker.pm ($AttrChecker, $HTMLAttrChecker,
2211 $AnyChecker->{attr_checker}, $HTMLAttrsChecker,
2212 $Element->{$HTML_NS}->{''}): New.
2213 (check_element): Invoke attrs_checker for each element.
2214
2215 2007-05-13 Wakaba <wakaba@suika.fam.cx>
2216
2217 * ContentChecker.pm: Don't use |manakai_element_type_match|.
2218
2219 2007-05-13 Wakaba <wakaba@suika.fam.cx>
2220
2221 * ContentChecker.pm: Use hashs rather than lists for
2222 element type testings.
2223
2224 2007-05-13 Wakaba <wakaba@suika.fam.cx>
2225
2226 * ContentChecker.pm: Don't generate duplicate
2227 error when an element type is put in the "minus" list
2228 and the element type is not allowed explicitly in the particular
2229 element content model.
2230 (html:a checker): New checker.
2231 (html:details, html:datagrid): New checkers.
2232 (html:legend): New checker.
2233
2234 2007-05-13 Wakaba <wakaba@suika.fam.cx>
2235
2236 * ContentChecker.pm (html:li checker): Implemented.
2237
2238 2007-05-13 Wakaba <wakaba@suika.fam.cx>
2239
2240 * ContentChecker.pm ($HTMLInlineOrStriclyInlineChecker): New
2241 checker.
2242 (html:dd checker): New checker.
2243 (html:q, html:em, html:strong, html:small,
2244 html:m, html:dfn, html:code, html:samp, html:span): New checkers.
2245
2246 2007-05-13 Wakaba <wakaba@suika.fam.cx>
2247
2248 * ContentChecker.pm ($AnyChecker): Renamed
2249 from |$ElementDefault->{checker}|.
2250 ($ElementDefault->{checker}): Throw an error that
2251 the element type is not supported by the checker.
2252 ($HTMLMetadataElement): |html:base| was missing.
2253 ($HTMLEmptyChecker): Don't throw an error
2254 for inter-element whitespace nodes.
2255 (html:html checker): Errors were not
2256 thrown even if |html:head| and/or |html:body|
2257 children were missing.
2258 (html:head checker): An error was not
2259 thrown if <meta charset> appered after other
2260 elements.
2261
2262 2007-05-05 Wakaba <wakaba@suika.fam.cx>
2263
2264 * ContentChecker.pm: |footer|, |video|, |audio|, |script|,
2265 and |noscript| elements are implemented.
2266 (new): New method.
2267
2268 2007-05-04 Wakaba <wakaba@suika.fam.cx>
2269
2270 * ContentChecker.pm: New module.
2271
2272 2007-05-04 Wakaba <wakaba@suika.fam.cx>
2273
2274 * NanoDOM.pm (manakai_parent_element,
2275 document_element, manakai_local_name,
2276 manakai_element_type_match): New method.
2277
2278 2007-05-03 Wakaba <wakaba@suika.fam.cx>
2279
2280 * HTML.pm.src: Replace decimal and hexadecimal numeric
2281 entities in C1 range using Windows-1252 mapping. Bare LF
2282 did not count as new line for error reporting.
2283
2284 2007-05-02 Wakaba <wakaba@suika.fam.cx>
2285
2286 * NanoDOM.pm (DOMImplementation): New class.
2287 (append_child): Weaken the |parent_node| reference.
2288 (create_element_ns, Element new): Set the |owner_document|
2289 reference.
2290 (implementation): New attribute.
2291 (owner_document, local_name, namespace_uri): New attributes.
2292
2293 * HTML.pm.src (parse_string): Line and column numbers
2294 are now provided to error handler.
2295 (!!!parse-error): Short descriptions are added.
2296 (_construct_tree): Split into three methods; support
2297 for innerHTML mode.
2298 (set_inner_html): New method.
2299
2300 2007-05-01 Wakaba <wakaba@suika.fam.cx>
2301
2302 * NanoDOM.html: Documentation is added.
2303
2304 * HTML.pod, ContentType.html: Documentation is revised.
2305
2306 * .cvsignore: Pod2html temporary files are added.
2307
2308 * Makefile: Make |NanoDOM.html|.
2309
2310 2007-05-01 Wakaba <wakaba@suika.fam.cx>
2311
2312 * HTML.pm.src (parse_string): New method.
2313 (get_inner_html): Renamed from |inner_html|.
2314
2315 * Makefile: A rule for |HTML.html| is added.
2316
2317 * HTML.pod: New documentation.
2318
2319 2007-05-01 Wakaba <wakaba@suika.fam.cx>
2320
2321 * NanoDOM.pm (last_child, previous_sibling): New attributes.
2322 (clone_node): Attribute nodes were not completely copied.
2323
2324 * HTML.pm.src: Many bugs are fixed.
2325
2326 2007-04-30 Wakaba <wakaba@suika.fam.cx>
2327
2328 * HTML.pm.src: Some typos are fixed.
2329
2330 2007-04-30 Wakaba <wakaba@suika.fam.cx>
2331
2332 * mkhtmlparser.pl, Makefile: References to the |HTML-consume-entity.src|
2333 are removed.
2334
2335 * HTML.pm.src: Tokenizer's handling on named entities are rewritten.
2336
2337 * HTML-consume-entity.src: Removed.
2338
2339 2007-04-30 Wakaba <wakaba@suika.fam.cx>
2340
2341 * HTML.pm.src: Tokenizer's handling on hexadecimal
2342 numeric entities are rewritten.
2343
2344 2007-04-30 Wakaba <wakaba@suika.fam.cx>
2345
2346 * HTML.pm.src: Some tokenizer bugs are fixed.
2347
2348 2007-04-30 Wakaba <wakaba@suika.fam.cx>
2349
2350 * HTML.pm.src: The tree construction stage is implemented.
2351
2352 * mkhtmlparser.pl: New macros are added.
2353
2354 2007-04-28 Wakaba <wakaba@suika.fam.cx>
2355
2356 * ContentType.pm: A note on bug in the specification
2357 is removed since it's been now fixed.
2358
2359 * .cvsignore: New file.
2360
2361 2007-04-28 Wakaba <wakaba@suika.fam.cx>
2362
2363 * HTML.pm.src, HTML-consume-entity.src: New files.
2364
2365 * Makefile (HTML.pm): New rule.
2366
2367 * mkhtmlparser.pl: New script.
2368
2369 2007-04-25 Wakaba <wakaba@suika.fam.cx>
2370
2371 * Makefile: New file.
2372
2373 2007-04-24 Wakaba <wakaba@suika.fam.cx>
2374
2375 * ContentType.pm: An error in pod is fixed.
2376
2377 2007-04-24 Wakaba <wakaba@suika.fam.cx>
2378
2379 * ContentType.pm: Documentation is added.
2380
2381 2007-04-24 Wakaba <wakaba@suika.fam.cx>
2382
2383 * ContentType.pm: New Perl module.
2384
2385 * ChangeLog: New file.
2386
2387

admin@suikawiki.org
ViewVC Help
Powered by ViewVC 1.1.24