/[suikacvs]/markup/html/whatpm/Whatpm/ChangeLog
Suika

Contents of /markup/html/whatpm/Whatpm/ChangeLog

Parent Directory Parent Directory | Revision Log Revision Log


Revision 1.372 - (show annotations) (download)
Thu Jul 2 22:08:36 2009 UTC (16 years ago) by wakaba
Branch: MAIN
Changes since 1.371: +5 -0 lines
++ whatpm/Whatpm/ChangeLog	2 Jul 2009 22:08:24 -0000
2009-07-03  Wakaba  <wakaba@suika.fam.cx>

	* HTML.pm.src (parse_byte_stream): Don't change to another
	encoding if the input is UTF-16 (HTML5 revision 3205).

1 2009-07-03 Wakaba <wakaba@suika.fam.cx>
2
3 * HTML.pm.src (parse_byte_stream): Don't change to another
4 encoding if the input is UTF-16 (HTML5 revision 3205).
5
6 2009-06-28 Wakaba <wakaba@suika.fam.cx>
7
8 * HTML.pm.src: |isindex| should no longer generate |p| element
9 node (HTML5 revision 3236).
10
11 2008-12-12 Wakaba <wakaba@suika.fam.cx>
12
13 * ContentChecker.pm: Introduced new flag |has_autofocus|. Added
14 support for new |id_type| value of |any|, which means any kind of
15 element is allowed as a referenced element.
16
17 2008-12-12 Wakaba <wakaba@suika.fam.cx>
18
19 * ContentChecker.pm: Introduced |check_attrs2| method for the ease
20 of defining codes for checking required attribute and as such.
21
22 2008-12-12 Wakaba <wakaba@suika.fam.cx>
23
24 * IMTChecker.pm: Added more definitions for subtypes.
25
26 2008-12-11 Wakaba <wakaba@suika.fam.cx>
27
28 * URIChecker.pm: Some of |pos_end| values were wrong.
29
30 2008-12-06 Wakaba <wakaba@suika.fam.cx>
31
32 * ContentChecker.pm (check_element): Added support for "no
33 referenced datalist" error.
34
35 2008-12-06 Wakaba <wakaba@suika.fam.cx>
36
37 * URIChecker.pm: Bug fix: It did not work unless
38 Message::DOM::DOMImplementation has been |require|d.
39
40 2008-12-06 Wakaba <wakaba@suika.fam.cx>
41
42 * NanoDOM.pm (document_uri): New attribute.
43
44 * ContentChecker.pm: Don't use methods not implemented by NanoDOM.
45
46 2008-11-07 Wakaba <wakaba@suika.fam.cx>
47
48 * NanoDOM.pm (text_content): Don't create a Text node if the new
49 value is empty.
50
51 2008-11-06 Wakaba <wakaba@suika.fam.cx>
52
53 * SWML/: New directory.
54
55 2008-10-20 Wakaba <wakaba@suika.fam.cx>
56
57 * NanoDOM.pm (specified, all_declarations_processed,
58 manakai_attribute_type): New attributes.
59
60 2008-10-19 Wakaba <wakaba@suika.fam.cx>
61
62 * NanoDOM.pm (Entity->new): Initialize ->child_nodes as an empty
63 array.
64
65 2008-10-19 Wakaba <wakaba@suika.fam.cx>
66
67 * NanoDOM.pm (notation_name): New attribute.
68
69 2008-10-18 Wakaba <wakaba@suika.fam.cx>
70
71 * NanoDOM.pm (public_id, system_id): New attributes.a
72
73 2008-10-18 Wakaba <wakaba@suika.fam.cx>
74
75 * NanoDOM.pm (text_content): Moved to Node from Element. Setter
76 implemented.
77 (allowed_tokens, default_type, declared_type): Implemented.
78
79 2008-10-17 Wakaba <wakaba@suika.fam.cx>
80
81 * NanoDOM.pm (node_name): New attribute.
82 (ELEMENT_TYPE_DEFINITION_NODE, ATTRIBUTE_DEFINITION_NODE): New
83 constants.
84 (create_element_type_definition_node, create_attribute_definition,
85 create_notation, create_general_entity,
86 get_element_type_definition_node,
87 set_element_type_definition_node, get_general_entity_node,
88 set_general_entity_node, get_notation_node, set_notation_node,
89 get_attribute_definition_node, set_attribute_definition_node): New
90 methods.
91 (element_types, entities, notations, attribute_definitions): New
92 attributes.
93 (DocumentType): Support for child nodes, entities, notations, and
94 element types.
95 (Entity, Notation, ElementTypeDefinition, AttributeDefinition):
96 New classes.
97
98 * Dumper.pm: Support for general entities, notations, element type
99 definitions, and attribute definitions.
100
101 2008-10-15 Wakaba <wakaba@suika.fam.cx>
102
103 * NanoDOM.pm (create_processing_instruction): New method.
104 (xml_version, xml_encoding, xml_standalone): New attributes.
105 (ProcessingInstruction): New class.
106
107 2008-10-14 Wakaba <wakaba@suika.fam.cx>
108
109 * HTML.pm.src: Handling of end tags in the foreign content
110 insertion mode was partially wrong, because of wrong bit
111 operations.
112
113 2008-10-14 Wakaba <wakaba@suika.fam.cx>
114
115 * NanoDOM.pm (dom_config): New attribute (do nothing), for
116 Whatpm::XML::Parser support.
117
118 2008-10-14 Wakaba <wakaba@suika.fam.cx>
119
120 * Makefile: New rule to make HTML/Tokenizer.pm is added.
121
122 * HTML.pm.src: Tokenizer part moved to another file.
123
124 2008-10-13 Wakaba <wakaba@suika.fam.cx>
125
126 * HTML.pm.src: Merge |DT_EL| and |DD_EL| as |DTDD_EL|.
127
128 2008-10-13 Wakaba <wakaba@suika.fam.cx>
129
130 * HTML.pm.src: Element category constants redefined.
131
132 2008-10-13 Wakaba <wakaba@suika.fam.cx>
133
134 * HTML.pm.src: Steps for CDATA/RCDATA elements in tree
135 construction stage synced with the spec (HTML5 revisions 2139 and
136 2302).
137
138 2008-10-07 Wakaba <wakaba@suika.fam.cx>
139
140 * ContentChecker.pm: New error level "html5_fact" added, which
141 should be tentatively used until all of requirements are properly
142 specced as RFC 2119 "MUST" in HTML5.
143
144 2008-10-05 Wakaba <wakaba@suika.fam.cx>
145
146 * ContentChecker.pod: Note on internal flags is added.
147
148 2008-10-05 Wakaba <wakaba@suika.fam.cx>
149
150 * HTML.pm.src: An AAA bug fixed.
151
152 2008-10-04 Wakaba <wakaba@suika.fam.cx>
153
154 * HTML.pm.src: If another node is inserted by the parser, don't
155 reuse existing Text node to append a character (HTML5 revision
156 2124).
157
158 2008-10-04 Wakaba <wakaba@suika.fam.cx>
159
160 * HTML.pm.src: Support for <option> and <optgroup> in body (HTML5
161 revisions 1731 and 2128).
162
163 2008-10-04 Wakaba <wakaba@suika.fam.cx>
164
165 * HTML.pm.src: Make <foreignObject> scoping (HTML5 revision 1837).
166 Support for end tags of camelCase SVG elements were broken. A
167 wrong error type text fixed.
168
169 2008-10-04 Wakaba <wakaba@suika.fam.cx>
170
171 * HTML.pm.src: Drop redundant code (HTML5 revision 1731).
172
173 2008-10-04 Wakaba <wakaba@suika.fam.cx>
174
175 * HTML.pm.src: Support for new definition of |param| and |source|
176 start tag parsing (HTML5 revision 1731).
177
178 2008-10-04 Wakaba <wakaba@suika.fam.cx>
179
180 * HTML.pm.src: <p> steps reimplemented (HTML5 revision 1731).
181
182 2008-10-04 Wakaba <wakaba@suika.fam.cx>
183
184 * HTML.pm.src: <li>, <dt>, and <dd> steps reimplemented (HTML5
185 revisions 1731 and 1831).
186
187 2008-10-04 Wakaba <wakaba@suika.fam.cx>
188
189 * HTML.pm.src: Support for new flow (but not phrasing) elements (HTML5
190 revisions 1731 and 1778). Support for the </sarcasm> end tag (HTML5
191 revision 1731).
192
193 2008-10-04 Wakaba <wakaba@suika.fam.cx>
194
195 * HTML.pm.src: Support for |command| and |eventsource| elements (HTML5
196 revision 1731). End tags of |option| and |optgroup| elements are
197 now optional (HTML5 revision 1731).
198
199 2008-10-04 Wakaba <wakaba@suika.fam.cx>
200
201 * HTML.pm.src: New "special" elements added to the list (HTML5
202 revision 1778). "strile" -> "strike".
203
204 2008-10-02 Wakaba <wakaba@suika.fam.cx>
205
206 * ContentType.pm (get_sniffed_type): Support for the "better"
207 content sniffing (HTML5 revision 1927). In a case the official
208 type was not returned when the method is invoked in the list
209 context.
210
211 2008-09-22 Wakaba <wakaba@suika.fam.cx>
212
213 * HTML.pm.src: Character references for non-space C0 characters,
214 including U+000B VT, DEL character, noncharacter code points, are
215 now converted to the U+FFFD character (cf. HTML5 revision 2138).
216
217 2008-09-21 Wakaba <wakaba@suika.fam.cx>
218
219 * ContentChecker.pm: |form=""| check support added.
220
221 2008-09-21 Wakaba <wakaba@suika.fam.cx>
222
223 * ContentChecker.pm: |contextmenu| validness is now checked using
224 |id| and |id_type| properties, and |menu| property is removed.
225
226 2008-09-21 Wakaba <wakaba@suika.fam.cx>
227
228 * ContentChecker.pm: Prepare for |form| |name| attribute's
229 duplication checking.
230
231 2008-09-21 Wakaba <wakaba@suika.fam.cx>
232
233 * HTML.pm.src (parse_byte_stream): Support (or non-support) for
234 unsupported charset="" parameter value (HTML5 revision 2131).
235
236 2008-09-20 Wakaba <wakaba@suika.fam.cx>
237
238 * HTML.pm.src: Reminding places where U+000B is allowed as a space
239 character is fixed (cf. HTML5 revision 1738).
240
241 * ContentChecker.pm, HTMLTable.pm: U+000B is no longer part of
242 space characters (HTML5 revision 1738).
243
244 2008-09-20 Wakaba <wakaba@suika.fam.cx>
245
246 * HTML.pm.src: The "anything else" case for the "after after body"
247 insertion mode was not updated to swtich to the "in body"
248 insertion mode. U+000B is no longer a space character for the
249 purpose of tree construction phase (HTML5 revision 1738).
250
251 2008-09-20 Wakaba <wakaba@suika.fam.cx>
252
253 * HTML.pm.src: U+000B is no longer a space character (HTML5
254 revision 1738).
255
256 2008-09-20 Wakaba <wakaba@suika.fam.cx>
257
258 * ContentType.pm: 0x0B is no longer a space character (HTML5
259 revision 1738).
260
261 * HTML.pm.src: U+000B is no longer a space character for the
262 algorithm for extracting an encoding from a Content-Type (HTML5
263 revision 1738).
264
265 2008-09-20 Wakaba <wakaba@suika.fam.cx>
266
267 * ContentChecker.pm ($IsInHTMLInteractiveContent): New.
268
269 2008-09-18 Wakaba <wakaba@suika.fam.cx>
270
271 * LangTag.pm: Add checks for remaining requirements from RFC 4646.
272
273 * mklangreg.pl: Sort 'Prefix' values by their length, to ease
274 matching.
275
276 2008-09-18 Wakaba <wakaba@suika.fam.cx>
277
278 * LangTag.pm: Warn for private use language subtags. Error level
279 typos fixed. Support for Suppress-Script field.
280
281 * mklangreg.pl: Support for dumping of nested structure.
282
283 2008-09-18 Wakaba <wakaba@suika.fam.cx>
284
285 * LangTag.pm (check_rfc4646_langtag): Check if a tag is in the
286 recommended case as per RFC 4646.
287
288 2008-09-18 Wakaba <wakaba@suika.fam.cx>
289
290 * LangTag.pm (check_rfc4646_langtag): New method.
291
292 2008-09-18 Wakaba <wakaba@suika.fam.cx>
293
294 * mklangreg.pl: New script.
295
296 * Makefile: Updated for creation of the module for language subtag
297 registry.
298
299 2008-09-16 Wakaba <wakaba@suika.fam.cx>
300
301 * Makefile: WebIDL.html added.
302
303 * WebIDL.pod: New documentation.
304
305 2008-09-16 Wakaba <wakaba@suika.fam.cx>
306
307 * WebIDL.pm: Checker's error types are redefined.
308
309 2008-09-16 Wakaba <wakaba@suika.fam.cx>
310
311 * WebIDL.pm: Parser's error types are redefined. Some forward
312 compatible parsing bugs are fixed. Some unreachable codes are
313 commented out.
314
315 2008-09-16 Wakaba <wakaba@suika.fam.cx>
316
317 * WebIDL.pm: Support for the reminding extended attributes are
318 added. It does not satisfy the definition that a forward
319 interface declaration has an extended attribute. It seems that
320 unless explicitly allowed multiple extended attributes with the
321 same name is not allowed, though it is not explicitly mentioned in
322 the spec.
323
324 2008-09-16 Wakaba <wakaba@suika.fam.cx>
325
326 * WebIDL.pm: Unescapes extended attribute names and extended
327 attribute identifiers. Preserve whether an extended attribute has
328 an argument list of not. Support for extended attributes:
329 Constructor, ExceptionConsts, IndexGetter, IndexSetter,
330 NameGetter, NameSetter, and Null.
331 (has_argument_list): New attribute.
332 (idl_text): Stringifies argument lists, if any, even if it is
333 empty.
334
335 2008-09-15 Wakaba <wakaba@suika.fam.cx>
336
337 * HTML.pm.src: New state |PCDATA_STATE|. Use an empty string for
338 |{s_kwd}| in DATA_STATE as default.
339
340 2008-09-15 Wakaba <wakaba@suika.fam.cx>
341
342 * HTML.pm.src, mkhtmlparser.pl: Replace |{prev_char}|
343 by |{s_kwd}| in DATA_STATE.
344
345 2008-09-15 Wakaba <wakaba@suika.fam.cx>
346
347 * HTML.pm.src: Shorten keys.
348
349 2008-09-15 Wakaba <wakaba@suika.fam.cx>
350
351 * HTML.pm.src: Remove checking for control character, surrogate
352 pair, or noncharacter code points and non-Unicode code
353 points (they should be handled by Whatpm::Charset::UnicodeChecker).
354 (parse_char_stream): Support for the |$get_wrapper| argument and
355 character stream error handlers.
356
357 2008-09-15 Wakaba <wakaba@suika.fam.cx>
358
359 * ContentChecker.pm: Don't call |loda_ns_module|
360 for null-namespace elements/attributes.
361
362 * HTML.pm.src: Fact out $disallowed_control_chars
363 as a hash.
364
365 2008-09-14 Wakaba <wakaba@suika.fam.cx>
366
367 * HTML.pm.src: Regexp typo fixed. |{prev_char}|
368 and |{next_char}| initializations are moved to initialization
369 method. |{read_until}| now supports buffering. Sync |set_inner_html|
370 with |parse_char_stream|.
371
372 2008-09-14 Wakaba <wakaba@suika.fam.cx>
373
374 * HTML.pm.src (parse_char_stream): Make |set_next_char|
375 invoke |manakai_read_until|, not only |read|, where
376 possible, to decrease the number of |read| method calls.
377
378 * mkhtmlparser.pl: Related changes to the aforementioned
379 modification.
380
381 2008-09-14 Wakaba <wakaba@suika.fam.cx>
382
383 * HTML.pm.src: Use |read| instead of |getc|. |set_inner_html|
384 would report character error from now.
385
386 2008-09-14 Wakaba <wakaba@suika.fam.cx>
387
388 * HTML.pm.src: White-space-leaded non-white-space character
389 tokens in "before head insertion mode" was not
390 correctly handled.
391 (set_inner_html): Reimplemented using CharString decodehandle
392 class. Support for $get_wrapper argument. Support
393 for |{read_until}| feature.
394
395 2008-09-14 Wakaba <wakaba@suika.fam.cx>
396
397 * HTML.pm.src: Make a "bare ero" error for unknown
398 entities point the "&" character.
399
400 2008-09-14 Wakaba <wakaba@suika.fam.cx>
401
402 * HTML.pm.src: It turns out that U+FFFD don't have to
403 be added to the list of excluded characters.
404
405 2008-09-14 Wakaba <wakaba@suika.fam.cx>
406
407 * HTML.pm.src ($char_onerror): Have character decoder's |line|
408 and |column| a higher priority than the one set by the
409 tokenizer's input handler.
410 ($self->{read_until}): Exclude U+FFFD (but this might
411 not be necessary, since now we do line/column fixup in
412 the character decode handle).
413
414 2008-09-14 Wakaba <wakaba@suika.fam.cx>
415
416 * HTML.pm.src: Use |{read_until}| where possible.
417
418 2008-09-14 Wakaba <wakaba@suika.fam.cx>
419
420 * HTML.pm.src: Change |{getc_until}| to |{read_until}|
421 and |manakai_getc_until| to |manakai_read_until| to
422 reduce the number of string copies.
423
424 2008-09-14 Wakaba <wakaba@suika.fam.cx>
425
426 * HTML.pm.src (parse_char_string): Use newly created
427 |Whatpm::Charset::DecodeHandle::CharString| instead of Perl's
428 standard feature to |open| a string as a filehandle,
429 since Perl's string filehandle seems not supporting |ungetc|
430 method correctly.
431 (parse_char_stream): Define |{getc_until}| method.
432 (DATA_STATE): Experimental support for |getc_until| feature.
433
434 2008-09-13 Wakaba <wakaba@suika.fam.cx>
435
436 * HTML.pm.src: Check points added to newly added branches.
437
438 2008-09-13 Wakaba <wakaba@suika.fam.cx>
439
440 * HTML.pm.src: Remove |{char}|, which is no longer used.
441 Remove |{entity_in_attr}| and |{last_attribute_value_state}|
442 and replaced by |{prev_state}|.
443
444 * mkhtmlparser.pl: Remove |{char}| feature.
445 Remove |!!!back-next-input-character;| macro.
446
447 2008-09-13 Wakaba <wakaba@suika.fam.cx>
448
449 * HTML.pm.src: Finally we get rid of all the inner loops. Remove
450 entity related tokenizer states in favor of new states
451 implementing the consume character reference algorithm.
452
453 2008-09-13 Wakaba <wakaba@suika.fam.cx>
454
455 * HTML.pm.src: "Consume a character reference" algorithm is
456 now implemented as a tokenizer's state, rather than
457 a method, with minimum changes (more changes will
458 be made, in due course). "Bogus comment state"'s inner
459 loop gets removed.
460
461 2008-09-13 Wakaba <wakaba@suika.fam.cx>
462
463 * HTML.pm.src: Make |PUBLIC| and |SYSTEM| keyword tokenizing
464 into their own tokenizer states.
465
466 2008-09-13 Wakaba <wakaba@suika.fam.cx>
467
468 * HTML.pm.src: |CDATA_SECTION_STATE| (formally |CDATA_BLOCK_STATE|
469 is split into three states.
470
471 2008-09-13 Wakaba <wakaba@suika.fam.cx>
472
473 * HTML.pm.src: |CLOSE_TAG_OPEN_STATE| is broken into
474 itself and new |CDATA_PCDATA_CLOSE_TAG_STATE| so that
475 no longer does the tokenizer have to push back next input
476 characters in those states.
477
478 2008-09-13 Wakaba <wakaba@suika.fam.cx>
479
480 * HTML.pm.src: |MARKUP_DECLARATION_OPEN_STATE| broken
481 into four states so that no longer does the tokenizer have to push
482 back next input characters in that state.
483
484 2008-09-11 Wakaba <wakaba@suika.fam.cx>
485
486 * HTML.pm.src: Methods now accept additional parameter, $get_wrapper,
487 which can be used to insert some wrapper between the character
488 stream handle and the tokenizer. (It is currently not supported
489 for |set_inner_html| for |Element|s).
490
491 2008-09-10 Wakaba <wakaba@suika.fam.cx>
492
493 * HTML.pm.src: Ignore punctuations in charset names.
494
495 2008-09-10 Wakaba <wakaba@suika.fam.cx>
496
497 * ContentChecker.pm: Support for charset-layer error levels.
498
499 * HTML.pm.src: Don't specify |text| argument for the
500 |chardecode:fallback| error, since it is not the encoding
501 being used alternatively.
502
503 2008-09-06 Wakaba <wakaba@suika.fam.cx>
504
505 * HTML.pm.src: Support for |XSLT-compat| (HTML5 revision 2141).
506
507 2008-08-31 Wakaba <wakaba@suika.fam.cx>
508
509 * CacheManifest.pm: Support for extensibility (HTML5 revision 2051).
510
511 2008-08-31 Wakaba <wakaba@suika.fam.cx>
512
513 * HTML.pm.src: Bug fix and sync with the spec with regard
514 to after after frameset insertion mode processing (HTML5
515 revision 1909). Note that the implementation was wrong
516 per the old spec before the r1909 changes.
517
518 2008-08-30 Wakaba <wakaba@suika.fam.cx>
519
520 * HTMLTable.pm: scope=auto algorithm fix synced with the
521 spec (HTML5 revision 2093).
522 ($process_row): Algorithm step numbers synced with the
523 spec (HTML5 revision 2092).
524
525 2008-08-30 Wakaba <wakaba@suika.fam.cx>
526
527 * HTMLTable.pm: Zs is not what we want; we want White_Space! (HTML5
528 revision 2094).
529
530 2008-08-30 Wakaba <wakaba@suika.fam.cx>
531
532 * ContentType.pm: Support for image/svg+xml (HTML5 revision 2096).
533
534 2008-08-30 Wakaba <wakaba@suika.fam.cx>
535
536 * HTML.pm.src: '"' and "'" at the end of attribute
537 name (after another attribute) now raise parse error (HTML5
538 revision 2123). Empty unquoted attribute values are no
539 longer allowed (HTML5 revision 2122).
540
541 2008-08-30 Wakaba <wakaba@suika.fam.cx>
542
543 * mkhtmlparser.pl: Support for MathML |definitionURL| attribute (HTML5
544 revision 2130).
545
546 2008-08-30 Wakaba <wakaba@suika.fam.cx>
547
548 * ContentChecker.pm: |xml:lang| attribute value must be same
549 as |lang| attribute value for HTML elements (HTML5 revision 2062
550 and so on).
551
552 2008-08-30 Wakaba <wakaba@suika.fam.cx>
553
554 * ContentChecker.pm: Error level definition for |xml_id_error|
555 was missing.
556
557 * URIChecker.pm: The end of the URL should be marked as the
558 error location for an empty path error. The position
559 between the userinfo and the port components should be
560 marked as the error location for an empty host error.
561
562 2008-08-30 Wakaba <wakaba@suika.fam.cx>
563
564 * URIChecker.pm: Set parameters representing where in the
565 value the error occurs for errors. Report unknown
566 address format error in warning level, since address
567 formats are rarely added. Path segments starting with "/.."
568 were misinterpreted as a dot-segment.
569
570 2008-08-30 Wakaba <wakaba@suika.fam.cx>
571
572 * URIChecker.pm (check_iri_reference): Requires
573 |Message::DOM::DOMImplementation|.
574
575 2008-08-29 Wakaba <wakaba@suika.fam.cx>
576
577 * IMTChecker.pm: Updated for the new error reporting architecture.
578
579 * ContentChecker.pm: Error levels for IMTs are added.
580
581 2008-08-17 Wakaba <wakaba@suika.fam.cx>
582
583 * H2H.pm (_shift_token): Support for unquoted HTML attribute
584 values.
585
586 2008-08-16 Wakaba <wakaba@suika.fam.cx>
587
588 * CacheManifest.pm: Support for new style of error
589 reports.
590
591 * HTML.pm.src: Set line=1, column=1 to the document node.
592
593 2008-08-16 Wakaba <wakaba@suika.fam.cx>
594
595 * ContentChecker.pm, RDFXML.pm: Pass {level} object to language tag
596 and URL checkers. Support for more error levels for bogus
597 langauge tag and URL "standards".
598
599 * LangTag.pm, URIChecker.pm: Support for new style error
600 level reporting.
601
602 2008-08-15 Wakaba <wakaba@suika.fam.cx>
603
604 * ContentChecker.pm: Support for RDF/XML error levels.
605
606 * HTMLTable.pm, RDFXML.pm: Support for new style of error level
607 specifying. Error types are revised.
608
609 2008-08-15 Wakaba <wakaba@suika.fam.cx>
610
611 * ContentChecker.pm: All error reporting method calls are
612 renewed.
613
614 2008-08-15 Wakaba <wakaba@suika.fam.cx>
615
616 * HTML.pm.src: All error type names and "text" parameters
617 are revised. Use new style for "level" specification.
618
619 * mkhtmlparser.pl: Use new style for "level" specification.
620
621 2008-08-03 Wakaba <wakaba@suika.fam.cx>
622
623 * WebIDL.pm (parse_char_string): Simplified error
624 reporting process for broken ignored valuetype definition.
625 (Valuetype idl_text): Support for special "DOMString" name.
626
627 2008-08-03 Wakaba <wakaba@suika.fam.cx>
628
629 * WebIDL.pm ($get_scoped_name): Append "::::" if the last
630 terminal of the ScopedName is "DOMString", such that whether
631 the last part of the scoped name is "DOMString" or "_DOMString"
632 later. It is necessary to determine whether a |typedef|
633 definition should be ignored or not.
634 (parse_char_string): Unescape the identifier of
635 exception members.
636 ($resolve): Return undef for builtin types and sequence<T>
637 types (we might not have to do this, however...).
638 (check): Support checking for Exceptions, Valuetypes,
639 and Typedefs.
640 ($serialize_type): Support for "DOMString::::" syntax.
641 (Typedef idl_text): Output Type as "DOMString" if it
642 is really "DOMString" (i.e. its internal representation
643 is "::DOMString::").
644
645 2008-08-03 Wakaba <wakaba@suika.fam.cx>
646
647 * WebIDL.pm ($resolve): New code, based on resolve code
648 for constant types in the |check| method.
649 (check): Support for checking of attributes, operations, and
650 arguments.
651 (Attribute/Operation idl_text): Exception names in getraises,
652 setraises, and raises clauses is serizlied by |$serialize_type|
653 code.
654
655 2008-08-02 Wakaba <wakaba@suika.fam.cx>
656
657 * WebIDL.pm ($integer): Order of selections are changed to match
658 hexadecimal numbers (the original pattern, taken from the spec,
659 was not work for hexadecimal numbers, because the "0" prefix
660 matches to the [0-7]* part (as an empty string) and therefore
661 it does not match with remaining "x..." part of a "0x..." integer
662 literal.
663 ($get_type): It now returns a string, not an array reference,
664 for regular types and |sequence| types (i.e. it in any case
665 returns a string).
666 ($get_next_token): The second item in the array that represents
667 a integer or float token is now a Perl number value, not the
668 original string representation of the number.
669 (check): Support for const value consistency checking.
670 No extended attribute is defined for constants.
671 (Node subclasses): Use simple strings rather than array references
672 for default data type values.
673 ($serialize_type): Type values are now simple strings.
674 (value): If the new attribute value is a false value, then
675 a FALSE value is set to the attribute.
676
677 2008-08-02 Wakaba <wakaba@suika.fam.cx>
678
679 * WebIDL.pm ($get_scoped_name): Now scoped names are stored
680 in its stringified format ("scoped name" as defined in the
681 spec). Note that future version of this module should not use
682 array references for type values and the |type_text| attribute
683 should be made obsolete.
684 (parse_char_string): Unescape attribute names.
685 (check): Support for checking of whether inherited interfaces
686 are actually defined or not. Support for checking of whether
687 interface member identifiers are duplicated or not.
688 ($serialize_type): Scoped names are returned as is. A future
689 version of this code should escape identifiers other than "DOMString",
690 otherwise the idl_text would be non-conforming.
691
692 2008-08-02 Wakaba <wakaba@suika.fam.cx>
693
694 * WebIDL.pm (parse_char_string): Set line/column numbers
695 to generated nodes. Unescape identifiers. Extended attributes
696 for Definition's were ignored.
697 (append_child): Set |parent_node| attribute.
698 (parent_node): New attribute.
699 (check): Support interface/exception members. Support
700 extended attributes. Support definition identifier uniqueness
701 constraint.
702 (qualified_name): New attribute.
703 (Interface/Exception idl_text): Extended attributes were
704 not prepended to the returned text.
705
706 2008-08-02 Wakaba <wakaba@suika.fam.cx>
707
708 * WebIDL.pm (parse_char_string): Set line/column numbers
709 to interface object experimentally. s/shift/pop/g, shift
710 would make things wrong. Support for interface forward
711 declarations was missing. Broken interface declarations
712 with no block were not ignored entirely.
713 (Whatpm::WebIDL::Node): New abstract class. This class
714 makes things easier.
715 (child_nodes): New attribute. Unlike DOM's attribute with
716 same name, this attribute returns a dead list of nodes for
717 simplicity.
718 (get_user_data, set_user_data): New methods.
719 (Module idl_text): A SPACE character should be inserted
720 before the |{| character.
721 (Interface idl_text): Support for interface forward declarations.
722 (is_forward_declaration): New attribute.
723
724 2008-07-19 Wakaba <wakaba@suika.fam.cx>
725
726 * WebIDL.pm (type_text): Better serializer.
727
728 2008-07-19 Wakaba <wakaba@suika.fam.cx>
729
730 * WebIDL.pm: Revise forward-compatible parsing so that
731 it now can handle broken extended attributes and as such.
732
733 2008-07-19 Wakaba <wakaba@suika.fam.cx>
734
735 * WebIDL.pm: Real support for extended attributes.
736 Support for extended attributes with arguments.
737
738 2008-07-19 Wakaba <wakaba@suika.fam.cx>
739
740 * WebIDL.pm: Support for |exception| syntax.
741 (Interface->idl_text): Tentative support for inheritances.
742
743 2008-07-19 Wakaba <wakaba@suika.fam.cx>
744
745 * WebIDL.pm: Hierarchical scoped name support was broken.
746 Support for raises, setraises, and getraises syntaxes.
747
748 2008-07-18 Wakaba <wakaba@suika.fam.cx>
749
750 * WebIDL.pm: Support for |idl_text| attribute, version 1 (no
751 proper support for types, extended attributes, and exceptions yet).
752 WebIDL parser, version 1 (no support for exceptions yet,
753 no proper support for extended attributes yet).
754
755 2008-07-09 Wakaba <wakaba@suika.fam.cx>
756
757 * WebIDL.pm (parse_char_string): Support for basic attribute syntax.
758
759 2008-06-29 Wakaba <wakaba@suika.fam.cx>
760
761 * WebIDL.pm: Support for valuetype and const.
762
763 2008-06-29 Wakaba <wakaba@suika.fam.cx>
764
765 * WebIDL.pm: New module.
766
767 2008-06-15 Wakaba <wakaba@suika.fam.cx>
768
769 * Makefile (Entities.html): URI changed.
770
771 2008-06-08 Wakaba <wakaba@suika.fam.cx>
772
773 * HTML.pm.src: Support for ruby parsing (HTML5 revision 1704).
774
775 2008-06-01 Wakaba <wakaba@suika.fam.cx>
776
777 * HTML.pm.src (_get_next_token): A parse error was missing.
778
779 2008-06-01 Wakaba <wakaba@suika.fam.cx>
780
781 * mklinktypelist.pl: rel=contact is no longer part of the HTML5
782 spec (commented out). (HTML5 revision 1711).
783
784 2008-05-25 Wakaba <wakaba@suika.fam.cx>
785
786 * ContentType.pm: Drop support for UTF-32 (HTML5 revision 1701).
787
788 * HTML.pm.src: UTF-16BE and UTF-16LE should be considered
789 as UTF-16 (HTML5 revision 1701).
790
791 2008-05-25 Wakaba <wakaba@suika.fam.cx>
792
793 * HTML.pm.src: Support for <noframes> in <head> (HTML5 revision
794 1692).
795
796 2008-05-25 Wakaba <wakaba@suika.fam.cx>
797
798 * HTML.pm.src: The secondary insertion mode used when switching
799 to foreign content is the "in body" insertion mode (HTML5 revision
800 1696).
801
802 2008-05-25 Wakaba <wakaba@suika.fam.cx>
803
804 * HTML.pm.src: Don't raise parse error for <isindex/> (HTML5
805 revision 1697).
806
807 2008-05-24 Wakaba <wakaba@suika.fam.cx>
808
809 * HTML.pm.src: Support for end-of-file token in foreign content
810 insertion mode (HTML5 revision 1693). Update SVG camelCase
811 attribute list (HTML5 revision 1700). <textarea> closes
812 </select> (HTML5 revision 1699). More start tags close in
813 foreign content insertion mode (HTML5 revision 1698).
814
815 2008-05-24 Wakaba <wakaba@suika.fam.cx>
816
817 * HTML.pm.src: ";" is not part of charset name (HTML5 revision 1665).
818
819 2008-05-24 Wakaba <wakaba@suika.fam.cx>
820
821 * HTML.pm.src: More robust charset parameter detection (HTML5
822 revision 1674).
823
824 2008-05-24 Wakaba <wakaba@suika.fam.cx>
825
826 * ContentType.pm: Support for image/vnd.microsoft.icon (HTML5
827 revision 1676).
828
829 2008-05-24 Wakaba <wakaba@suika.fam.cx>
830
831 * HTML.pm.src: Ignore language part of public identifiers for
832 quriks mode detection (HTML5 revision 1679).
833
834 2008-05-24 Wakaba <wakaba@suika.fam.cx>
835
836 * HTML.pm.src: Reduce the number of errors in truncated doctypes (HTML5
837 revision 1685).
838
839 2008-05-24 Wakaba <wakaba@suika.fam.cx>
840
841 * HTML.pm.src: Support for EOF in new states for tags (HTML5
842 revision 1684).
843
844 2008-05-24 Wakaba <wakaba@suika.fam.cx>
845
846 * HTML.pm.src (_reset_insertion_mode): Make <td>.innerHTML
847 work (HTML5 revision 1690).
848
849 2008-05-24 Wakaba <wakaba@suika.fam.cx>
850
851 * HTML.pm.src (_tree_construction_main): Change handling of
852 end tags in head insertion modes (HTML5 revision 1686).
853 (parse_char_string): Bug fix for non-utf8 character string handlings.
854 (parse_char_stream): |ungetc| does not work well for this context.
855
856 2008-05-18 Wakaba <wakaba@suika.fam.cx>
857
858 * HTML.pm.src (parse_byte_string): Redefined to invoke
859 |parse_byte_stream|.
860 (parse_byte_stream): New method.
861
862 2008-05-18 Wakaba <wakaba@suika.fam.cx>
863
864 * HTML.pm.src (parse_byte_string): Fix the column number reported
865 by encoding layer error reporter.
866
867 2008-05-17 Wakaba <wakaba@suika.fam.cx>
868
869 * HTML.pm.src (parse_byte_string): Use streaming decoder
870 rather than converting the whole byte string and then parsing.
871 Propagate errors in character encoding layer.
872 (get_next_token): Precise error reporting for |bare stago| error.
873
874 2008-05-17 Wakaba <wakaba@suika.fam.cx>
875
876 * HTML.pm.src (parse_char_stream): New method.
877 (parse_char_string): This method is now defined as an invocation
878 of the |parse_char_stream| method.
879
880 2008-05-17 Wakaba <wakaba@suika.fam.cx>
881
882 * HTML.pm.src (parse_byte_string): Report various status
883 of the sniffing as info-level errors. Support for new
884 decoding framework in parser resestting.
885 (new): Various default error levels were not set.
886
887 2008-05-17 Wakaba <wakaba@suika.fam.cx>
888
889 * HTML.pm.src (parse_byte_string): HTML5 encoding siniffing
890 algorithm, except for the actual sniffing, is implemented
891 with new framework with Message::Charset::Info.
892
893 2008-05-16 Wakaba <wakaba@suika.fam.cx>
894
895 * CacheManifest.pm (_parse): Drop fragment identifiers from
896 URIs in fallback section (HTML5 revision 1596).
897
898 2008-05-10 Wakaba <wakaba@suika.fam.cx>
899
900 * Makefile (Entities.html): URI has changed.
901
902 2008-05-10 Wakaba <wakaba@suika.fam.cx>
903
904 * CacheManifest.pm: Don't replace U+0000 NULL (HTML5 revision
905 1553).
906
907 2008-05-06 Wakaba <wakaba@suika.fam.cx>
908
909 * ContentChecker.pm: Noted that those returned in |table| are
910 no longer table elements, but table objects returned
911 by Whatpm::HTMLTable.
912
913 * HTMLTable.pm (form_table): Return table element node
914 as |$table->{element}|.
915 (assign_header): Support for the |headers=""| attribute.
916
917 2008-05-06 Wakaba <wakaba@suika.fam.cx>
918
919 * HTMLTable.pm (assign_header): New function; first version
920 with no support for headers="".
921 (form_table): Include table width and height to the returned
922 table object for covenience. Indexing in column assignement was wrong.
923 Set whether a data cell is empty or not for convenience.
924
925 2008-05-05 Wakaba <wakaba@suika.fam.cx>
926
927 * HTMLTable.pm: Robuster caption support (HTML5 revision 1393).
928
929 2008-05-05 Wakaba <wakaba@suika.fam.cx>
930
931 * HTMLTable.pm: How table model errors are detected is
932 changed (HTML5 revision 1387).
933
934 2008-05-05 Wakaba <wakaba@suika.fam.cx>
935
936 * HTMLTable.pm: The algorithm now moves |tfoot| elements
937 to the end of the table (HTML5 revision 1380).
938
939 2008-05-05 Wakaba <wakaba@suika.fam.cx>
940
941 * HTMLTable.pm: The algorithm is now 0-based indexing, instead
942 of 1-based (HTML5 revision 1376).
943
944 2008-05-05 Wakaba <wakaba@suika.fam.cx>
945
946 * ContentType.pm: "Content-Type: text/plain; charset=UTF-8"
947 and "Content-Encoding" no longer prevent sniffing (HTML5
948 revision 1288).
949
950 2008-05-05 Wakaba <wakaba@suika.fam.cx>
951
952 * ContentType.pm: Skip BOMs n feed or HTML algorithm (HTML5 revision
953 1282).
954
955 2008-05-03 Wakaba <wakaba@suika.fam.cx>
956
957 * ContentChecker.pm: Support for global attributes.
958 Status of XML specs are added.
959
960 2008-05-03 Wakaba <wakaba@suika.fam.cx>
961
962 * ContentChecker.pm (check_element): Support for |template=""|
963 and |ref=""| attribute (referent element type checking).
964
965 2008-04-29 Wakaba <wakaba@suika.fam.cx>
966
967 * CacheManifest.pm (_parse): New same origin definition (HTML5 revision
968 1500) is implemented (except for IDNA part and URI-scheme-specific
969 knowledge). Line number counting was wrong for LF-only
970 documents.
971
972 2008-04-13 Wakaba <wakaba@suika.fam.cx>
973
974 * HTML.pm.src: Raise an parse error for any disallowed
975 character (HTML5 revision 1263).
976
977 2008-04-13 Wakaba <wakaba@suika.fam.cx>
978
979 * mkentitylist.pl: Support for new HTML5 entity table format (the
980 definition for |AElig;| was missing).
981
982 2008-04-13 Wakaba <wakaba@suika.fam.cx>
983
984 * HTML.pm.src, mkhtmlparser.pl: Support for element/attribute
985 name/namespace fixup (HTML5 revisions 1413, 1415, 1416, and 1417).
986
987 2008-04-13 Wakaba <wakaba@suika.fam.cx>
988
989 * HTML.pm.src: List of element names that close foreign content
990 insertion mode is added (HTML5 revisions 1412 and 1418).
991
992 2008-04-13 Wakaba <wakaba@suika.fam.cx>
993
994 * HTML.pm.src: Support for |mglyph| and |malignmark|
995 elements (HTML5 revision 1410).
996
997 2008-04-13 Wakaba <wakaba@suika.fam.cx>
998
999 * HTML.pm.src: Support for new long MathML entities (HTML5
1000 revision 1406).
1001
1002 2008-04-13 Wakaba <wakaba@suika.fam.cx>
1003
1004 * HTML.pm.src: CDATA section support for MathML and SVG
1005 elements (HTML5 revision 1404 and 1420).
1006
1007 2008-04-12 Wakaba <wakaba@suika.fam.cx>
1008
1009 * HTML.pm.src, mkhtmlparser.pl: Support for MathML and SVG
1010 elements (HTML5 revision 1404). Unused !!!macro definitions
1011 are removed.
1012
1013 2008-04-12 Wakaba <wakaba@suika.fam.cx>
1014
1015 * HTML.pm.src, mkhtmlparser.pl: The way permitted slash errors
1016 are raised is changed (HTML5 revision 1404).
1017
1018 2008-04-06 Wakaba <wakaba@suika.fam.cx>
1019
1020 * HTML.pm.src: Category sets that are no longer used are removed.
1021
1022 2008-04-06 Wakaba <wakaba@suika.fam.cx>
1023
1024 * HTML.pm.src: The ->[1] property of stack entries are now
1025 replaced by constants representing element category.
1026
1027 2008-04-06 Wakaba <wakaba@suika.fam.cx>
1028
1029 * HTML.pm.src: Don't use local name stored in stack (i.e. ->[1])
1030 for error reporting. (This is a preparation for using constant
1031 value for ->[1].)
1032
1033 2008-03-22 Wakaba <wakaba@suika.fam.cx>
1034
1035 * RDFXML.pm: Typo fixed.
1036
1037 2008-03-22 Wakaba <wakaba@suika.fam.cx>
1038
1039 * ContentChecker.pm: |fact_level| is now treated
1040 as same as |must_level|, i.e. level = |m|.
1041 (check_element): Make list of URIs in the DOM.
1042
1043 2008-03-21 Wakaba <wakaba@suika.fam.cx>
1044
1045 * RDFXML.pm: Language accessor implemented.
1046 Local (null-namespace) attribute support.
1047
1048 2008-03-21 Wakaba <wakaba@suika.fam.cx>
1049
1050 * RDFXML.pm: Factored out ID checking code.
1051
1052 2008-03-21 Wakaba <wakaba@suika.fam.cx>
1053
1054 * RDFXML.pm: TODO items noted. Validation of ID
1055 and URI attributes is implemented. Warn if unknown
1056 value is used in rdf:parseType="" attribute.
1057
1058 * URIChecker.pm (check_rdf_uri_reference): New function.
1059
1060 2008-03-21 Wakaba <wakaba@suika.fam.cx>
1061
1062 * RDFXML.pm: bnodeid implemented. Relative references
1063 are now resolved.
1064
1065 2008-03-21 Wakaba <wakaba@suika.fam.cx>
1066
1067 * ContentChecker.pm: RDF reification implemented.
1068
1069 * RDFXML.pm: undef vs false bug fixed.
1070 Reification implemented.
1071
1072 2008-03-21 Wakaba <wakaba@suika.fam.cx>
1073
1074 * RDFXML.pm: s/id/ID/ for attribute name.
1075 The |node| arguments are added for |ontriple| calls.
1076 Too many "attribute not allowed" errors were raised.
1077
1078 * ContentChecker.pm: Initial experimental support for rdf:RDF
1079 element.
1080
1081 2008-03-21 Wakaba <wakaba@suika.fam.cx>
1082
1083 * RDFXML.pm: New module.
1084
1085 2008-03-20 Wakaba <wakaba@suika.fam.cx>
1086
1087 * HTML.pm.src (set_inner_html): Line/column number
1088 code was old one yet.
1089
1090 2008-03-20 Wakaba <wakaba@suika.fam.cx>
1091
1092 * HTML.pm.src: Better line/column reporting for "duplicate attribute"
1093 errors. Line/column markings for DOCTYPE, comment, and
1094 character tokens are reintroduced; otherwise,
1095 error location for "not HTML5" error and errors
1096 for implied elements are not attached.
1097
1098 2008-03-20 Wakaba <wakaba@suika.fam.cx>
1099
1100 * HTML.pm.src: Set line/column numbers to attributes.
1101
1102 * NanoDOM.pm (create_attribute_ns, set_attribute_node_ns): Added.
1103 (value): Setter implemented.
1104
1105 * mkhtmlparser.pl: Set line/column numbers to Attr nodes.
1106
1107 2008-03-20 Wakaba <wakaba@suika.fam.cx>
1108
1109 * HTML.pm.src: Unused line/column markings are removed.
1110
1111 2008-03-20 Wakaba <wakaba@suika.fam.cx>
1112
1113 * HTML.pm.src (_get_next_token): Remove |first_start_tag|
1114 flag, which is no longer used.
1115
1116 2008-03-17 Wakaba <wakaba@suika.fam.cx>
1117
1118 * HTML.pm.src: Set line/column information to element nodes.
1119
1120 * mkhtmlparser.pl (!!!create-element, !!!insert-element,
1121 and !!!insert-element-t): Set line/column information to
1122 element nodes.
1123
1124 2008-03-17 Wakaba <wakaba@suika.fam.cx>
1125
1126 * HTML.pm.src (_get_next_token): The first "<" character
1127 in "<?", "<>", or "</>" should be the error point.
1128
1129 2008-03-16 Wakaba <wakaba@suika.fam.cx>
1130
1131 * HTML.pm.src: Some more fixes on error position reporting.
1132
1133 2008-03-16 Wakaba <wakaba@suika.fam.cx>
1134
1135 * HTML.pm.src: Token-level precious error reporting.
1136
1137 2008-03-16 Wakaba <wakaba@suika.fam.cx>
1138
1139 * HTML.pm.src: Preparation for more precious error point
1140 reporting.
1141
1142 2008-03-11 Wakaba <wakaba@suika.fam.cx>
1143
1144 * HTML.pm.src: Error type revised.
1145
1146 2008-03-11 Wakaba <wakaba@suika.fam.cx>
1147
1148 * HTML.pm.src: Similar codes are merged together, again.
1149
1150 2008-03-11 Wakaba <wakaba@suika.fam.cx>
1151
1152 * HTML.pm.src: Similar codes are merged together.
1153
1154 2008-03-10 Wakaba <wakaba@suika.fam.cx>
1155
1156 * mkhtmlparser.pl: Set "level" parameter to parse errors.
1157
1158 * HTML.pm.src: Code refined.
1159
1160 2008-03-09 Wakaba <wakaba@suika.fam.cx>
1161
1162 * HTML.pm.src: |</body>| treatement has been changed (HTML5
1163 revision 1348). Note that I really don't know this makes
1164 any difference in the black-box behavior of the parser.
1165
1166 2008-03-09 Wakaba <wakaba@suika.fam.cx>
1167
1168 * HTML.pm.src: New end-of-file token implementation (HTML5
1169 revision 1348).
1170
1171 2008-03-09 Wakaba <wakaba@suika.fam.cx>
1172
1173 * HTML.pm.src: |applet| support (HTML5 revision 1347).
1174
1175 2008-03-09 Wakaba <wakaba@suika.fam.cx>
1176
1177 * HTML.pm.src: Foster parenting in AAA (HTML5 revision 1343).
1178
1179 2008-03-09 Wakaba <wakaba@suika.fam.cx>
1180
1181 * HTML.pm.src: Support for |<input>| in the "in select" insertion
1182 mode and support for the "in select in table" insertion mode (HTML5
1183 revision 1342).
1184
1185 2008-03-09 Wakaba <wakaba@suika.fam.cx>
1186
1187 * HTML.pm.src: No foster parenting for <script> and <script>
1188 in non-tainted <table>s (HTML5 revision 1336).
1189
1190 2008-03-09 Wakaba <wakaba@suika.fam.cx>
1191
1192 * HTML.pm.src: Ignore white space characters between <html>
1193 and <head> (HTML5 revision 1332).
1194
1195 2008-03-09 Wakaba <wakaba@suika.fam.cx>
1196
1197 * HTML.pm.src: Treat <input type=hidden> as if it were a
1198 white space (HTML5 revision 1331).
1199
1200 2008-03-08 Wakaba <wakaba@suika.fam.cx>
1201
1202 * HTML.pm.src: Ignore U+000A at the beginning of a |listing|
1203 element (HTML5 revision 1330).
1204
1205 2008-03-08 Wakaba <wakaba@suika.fam.cx>
1206
1207 * HTML.pm.src: <title> is always appended to the current
1208 element (HTML5 revision 1328).
1209
1210 2008-03-08 Wakaba <wakaba@suika.fam.cx>
1211
1212 * HTML.pm.src: White space in tainted tables are moved
1213 into foster parents (HTML5 revision 1326).
1214
1215 2008-03-08 Wakaba <wakaba@suika.fam.cx>
1216
1217 * HTML.pm.src: Reduce errors from foster parenting cases (HTML5
1218 revision 1321).
1219
1220 2008-03-08 Wakaba <wakaba@suika.fam.cx>
1221
1222 * HTML.pm.src: |</h/n/>| case code rearranged to align with
1223 the spec (HTML5 revision 1320). Note that we finally complete
1224 all of HTML5 revision 1320 changes.
1225
1226 2008-03-08 Wakaba <wakaba@suika.fam.cx>
1227
1228 * HTML.pm.src: |</form>| now works similar to |</div>| for unclosed
1229 tags (HTML5 revision 1320).
1230
1231 2008-03-08 Wakaba <wakaba@suika.fam.cx>
1232
1233 * HTML.pm.src: |</p>| case rearranged with no actual change in fact.
1234
1235 2008-03-08 Wakaba <wakaba@suika.fam.cx>
1236
1237 * HTML.pm.src: A "generate implied end tags" code (t409.1)
1238 could not be reached so that it is now removed (HTML5 revision
1239 1320).
1240
1241 2008-03-08 Wakaba <wakaba@suika.fam.cx>
1242
1243 * HTML.pm.src: Code for the case of |</div>| and so on are revised
1244 to align with new spec text (HTML5 revision 1320).
1245
1246 2008-03-08 Wakaba <wakaba@suika.fam.cx>
1247
1248 * HTML.pm.src: Remove strange |if| condition; however, it should
1249 have had no harm in theory.
1250
1251 2008-03-08 Wakaba <wakaba@suika.fam.cx>
1252
1253 * HTML.pm.src (_tree_construction_main): '</p>' in body
1254 case is split from other end tags for the preparation
1255 of implementing HTML revision 1320.
1256
1257 2008-03-07 Wakaba <wakaba@suika.fam.cx>
1258
1259 * HTML.pm.src: Simplified "generate implied end tag" (HTML5
1260 revision 1320).
1261
1262 2008-03-07 Wakaba <wakaba@suika.fam.cx>
1263
1264 * HTML.pm.src (_tree_construction_main): Merge rules for "h1"
1265 and "div" (HTML5 revision 1318). Add comments to where
1266 |form| pointer association codes should be inserted (HTML5 revision
1267 1319).
1268
1269 2008-03-06 Wakaba <wakaba@suika.fam.cx>
1270
1271 * HTML.pm.src: <html> treatement refined (HTML5 revision 1314).
1272
1273 2008-03-05 Wakaba <wakaba@suika.fam.cx>
1274
1275 * HTML.pm.src: Since the case t268 should never be reached (no
1276 other token type, there are), it is replaced by a |die| statement.
1277
1278 2008-03-05 Wakaba <wakaba@suika.fam.cx>
1279
1280 * HTML.pm.src: Typo fixed.
1281
1282 2008-03-04 Wakaba <wakaba@suika.fam.cx>
1283
1284 * HTML.pm.src (_tree_construction_initial): Some limited quirks
1285 doctypes were not uppercased for comparison.
1286
1287 2008-03-03 Wakaba <wakaba@suika.fam.cx>
1288
1289 * HTML.pm.src (tree construction and set_inner_html): Checkpoints
1290 are added.
1291
1292 2008-03-03 Wakaba <wakaba@suika.fam.cx>
1293
1294 * HTML.pm.src (_tokenize_attempt_to_consume_an_entity): Checkpoints
1295 are set. Cases that are unlikely reached are noted as so.
1296
1297 2008-03-03 Wakaba <wakaba@suika.fam.cx>
1298
1299 * HTML.pm.src: Checkpoints for debugging are added.
1300
1301 * mkhtmlparser.pl: Support for |!!!cp| syntax.
1302
1303 2008-03-03 Wakaba <wakaba@suika.fam.cx>
1304
1305 * HTML.pm.src, mkhtmlparser.pl: s/_input_character/_char/g
1306 for simplicity.
1307
1308 2008-03-03 Wakaba <wakaba@suika.fam.cx>
1309
1310 * HTML.pm.src: Flag name changed: s/correct/force-quirks/g (HTML5
1311 revision 1307).
1312
1313 2008-03-03 Wakaba <wakaba@suika.fam.cx>
1314
1315 * HTML.pm.src (_get_next_token): Where the /incorrect/ flag
1316 is set are changed (HTML5 revision 1305).
1317
1318 2008-03-02 Wakaba <wakaba@suika.fam.cx>
1319
1320 * HTML.pm.src: Raise a parse error for |<span ===>| (HTML5 revision
1321 1292). Entities are not parsed in comment-like part in RCDATA
1322 elements (HTML5 revision 1294). Allow bare & at the end
1323 of attribute value literals (HTML5 revision 1296). More
1324 quirks mode doctypes (HTML5 revision 1302). Requires spaces
1325 between attributes and ban attribute names or unquoted
1326 attribute values containing single or double quotes (HTML5
1327 revision 1303).
1328
1329 2008-03-02 Wakaba <wakaba@suika.fam.cx>
1330
1331 * ContentChecker.pm: Typo fixed. Don't raise "character encoding"
1332 and related errors unless it is an HTML document (though the spec
1333 is unclear on whether it is applied to XHTML document).
1334
1335 * HTML.pm (%HTMLAttrStatus): WF2 repetition model attributes
1336 are added.
1337
1338 2008-03-02 Wakaba <wakaba@suika.fam.cx>
1339
1340 * HTML.pm.src: s/local_name/manakai_local_name/g.
1341
1342 2008-03-01 Wakaba <wakaba@suika.fam.cx>
1343
1344 * _NamedEntityList.pm: Updated (HTML5 revision 1286).
1345
1346 * HTML.pm.src: |charset| in |content| attribute is
1347 case-insensitive (HTML5 revision 1270).
1348
1349 2008-02-26 Wakaba <wakaba@suika.fam.cx>
1350
1351 * ContentChecker.pm: New status constants are added.
1352 ($ElementDefault): |status| added.
1353 (check_element): Err for non-standard or deprecated elements.
1354 (_attr_status_info): For non-standard or deprecated attributes.
1355
1356 2008-02-24 Wakaba <wakaba@suika.fam.cx>
1357
1358 * ContentChecker.pm (_attr_status_info): New internal method.
1359
1360 2008-02-24 Wakaba <wakaba@suika.fam.cx>
1361
1362 * ContentChecker.pm (check_element): Element standardized
1363 status information is now dispatched.
1364
1365 2008-02-24 Wakaba <wakaba@suika.fam.cx>
1366
1367 * ContentChecker.pm (check_element): Fix |del|-and-significant
1368 problem by adding some more arguments.
1369
1370 2008-02-24 Wakaba <wakaba@suika.fam.cx>
1371
1372 * ContentChecker.pm (check_element): Use context of
1373 container-for-the-purpose-of-content-model element (not transparent
1374 element) for |check_child_element| calling and significant
1375 text flag marking. This reintroduces |<del>|-and-significant
1376 problem again.
1377
1378 2008-02-24 Wakaba <wakaba@suika.fam.cx>
1379
1380 * ContentChecker.pm (check_element): Make semi-transparent
1381 elements ignored for the purpose of phase changes in
1382 content model checking.
1383
1384 2008-02-23 Wakaba <wakaba@suika.fam.cx>
1385
1386 * ContentChecker.pm (check_element): In-element state
1387 was not properly managed for transparent cases.
1388
1389 2008-02-23 Wakaba <wakaba@suika.fam.cx>
1390
1391 * ContentChecker.pm (check_element): Support for |video|
1392 and |audio| as semi-transparent elements.
1393
1394 2008-02-23 Wakaba <wakaba@suika.fam.cx>
1395
1396 * ContentChecker.pm ($HTMLSemiTransparentElements): New.
1397 (check_element): s/minuses/minus_elements/, s/pluses/plus_elements/.
1398 Support for |html:object| as a semi-transparent element.
1399
1400 2008-02-23 Wakaba <wakaba@suika.fam.cx>
1401
1402 * ContentChecker.pm (check_element): The way to traverse
1403 the tree is entirely revised to make it easier to track
1404 the state of ancestors/descendants. As a result of this
1405 revision (which rewrites almost all of Whatpm::ContentChecker::HTML),
1406 support for content model checking for HTML elements |figure|,
1407 |object|, |video|, and |audio| and checking for XML elements (and
1408 some XMLNS checkings) are dropped for now. They will be
1409 reimplemented in due cource.
1410
1411 2008-02-17 Wakaba <wakaba@suika.fam.cx>
1412
1413 * HTML.pm.src: |>| in public or system literal closes the DOCTYPE
1414 token (HTML5 revision 1225).
1415
1416 2008-02-17 Wakaba <wakaba@suika.fam.cx>
1417
1418 * ContentChecker.pm ({unsupported_level}): New value.
1419
1420 * HTML.pm.src: Save whether |meta| |content| attribute
1421 contains character references or not.
1422
1423 2008-02-17 Wakaba <wakaba@suika.fam.cx>
1424
1425 * ContentChecker.pm (_get_children): (Incompleted) attempt to
1426 imlement significant content checking for contents
1427 with |del| elements.
1428
1429 2008-02-17 Wakaba <wakaba@suika.fam.cx>
1430
1431 * ContenteChecker.pm ($HTMLTransparentElements): More
1432 elements are added.
1433 (_get_children): HTML |object| elements are now semi-transparent.
1434
1435 * NanoDOM.pm (manakai_html, manakai_head): New methods.
1436
1437 2008-02-16 Wakaba <wakaba@suika.fam.cx>
1438
1439 * CacheManifest.pm: HTML5 revision 1211 implemented.
1440
1441 * CacheManifest.pod: Updated.
1442
1443 2008-02-10 Wakaba <wakaba@suika.fam.cx>
1444
1445 * ContentChecker.pm (check_document, check_element): Support
1446 for second argument ($onsubdoc).
1447 (_get_css_parser): Removed (now it is part of WDCC).
1448
1449 2008-02-09 Wakaba <wakaba@suika.fam.cx>
1450
1451 * ContentChecker.pm (_get_css_parser): New.
1452
1453 2007-11-25 Wakaba <wakaba@suika.fam.cx>
1454
1455 * ContentChecker.pm ($AnyChecker): Old way to add child elements
1456 for checking had been used.
1457
1458 2007-11-25 Wakaba <wakaba@suika.fam.cx>
1459
1460 * ContentChecker.pm (check_element): New todo item type |descendant|.
1461
1462 2007-11-23 Wakaba <wakaba@suika.fam.cx>
1463
1464 * IMTChecker.pm: Revised to raise errors and warnings as (poorly)
1465 specced in RFC 2046 and RFC 4288.
1466 (application/atom+xml): Definition added.
1467
1468 2007-11-23 Wakaba <wakaba@suika.fam.cx>
1469
1470 * URIChecker.pm: Make RFC 3986 should-level errors
1471 warnings (rather than SHOULD-level errors).
1472
1473 2007-11-23 Wakaba <wakaba@suika.fam.cx>
1474
1475 * NanoDOM.pm (get_user_data, set_user_data): New methods.
1476
1477 * HTML.pm.src: A flag for character references in attribute
1478 values are added. Set |manakai_has_reference| user data
1479 to |charset| attribute.
1480
1481 2007-11-23 Wakaba <wakaba@suika.fam.cx>
1482
1483 * NanoDOM.pm (input_encoding, manakai_charset, manakai_has_bom): New
1484 attributes.
1485
1486 * ContentChecker.pm (check_document): Warn if charset requirements
1487 cannot be tested.
1488
1489 2007-11-19 Wakaba <wakaba@suika.fam.cx>
1490
1491 * HTML.pm.src (parse_byte_string): Detect charset
1492 by universalchardet if charset parameter is not specified.
1493
1494 * Makefile (Charset-all, Charset-clean): New rules.
1495
1496 2007-11-18 Wakaba <wakaba@suika.fam.cx>
1497
1498 * ContentChecker.pm (check_document): Check the existence
1499 of character encoding declaration and selection of encoding
1500 for HTML document.
1501
1502 2007-11-18 Wakaba <wakaba@suika.fam.cx>
1503
1504 * ContentType.pm (get_sniffed_type): Return also the official
1505 type in list context.
1506
1507 2007-11-18 Wakaba <wakaba@suika.fam.cx>
1508
1509 * ContentType.pm: Sniffing with leading white space
1510 ignoring (HTML5 revisions 1013 and 1016).
1511
1512 2007-11-18 Wakaba <wakaba@suika.fam.cx>
1513
1514 * ContentType.pm: HTML5 revision 1013 changes, except for leading
1515 white spaces, are implemented.
1516
1517 2007-11-11 Wakaba <wakaba@suika.fam.cx>
1518
1519 * HTML.pm.src (parse_char_string): Set |inner_encoding|
1520 attribute if possible.
1521
1522 2007-11-11 Wakaba <wakaba@suika.fam.cx>
1523
1524 * HTML.pm.src (parse_byte_string): New method.
1525 (parse_char_string): New alias for |parse_string|.
1526 (main phase): Invoking "change the encoding" algorithm if desired.
1527
1528 * HTML.pod: Updated.
1529
1530 2007-11-11 Wakaba <wakaba@suika.fam.cx>
1531
1532 * HTML.pod (get_inner_html): Removed.
1533
1534 * Makefile (HTML-all, HTML-clean): New.
1535
1536 2007-11-11 Wakaba <wakaba@suika.fam.cx>
1537
1538 * HTML.pm.src (get_inner_html): Removed (moved to HTML/Serializer.pm).
1539
1540 2007-11-08 Wakaba <wakaba@suika.fam.cx>
1541
1542 * mklinktypelist.pl: s/noreferer/noreferrer/ (HTML5 revision 1132).
1543
1544 2007-11-04 Wakaba <wakaba@suika.fam.cx>
1545
1546 * Makefile: |CacheManifest.html| is added.
1547
1548 * CacheManifest.pod: New file.
1549
1550 2007-11-04 Wakaba <wakaba@suika.fam.cx>
1551
1552 * CacheManifest.pm: New module.
1553
1554 2007-11-04 Wakaba <wakaba@suika.fam.cx>
1555
1556 * HTML.pm.src: Support for application cache selection algorithm
1557 callback.
1558
1559 2007-11-04 Wakaba <wakaba@suika.fam.cx>
1560
1561 * mklinktypelist.pl: Support for rel=noreferer (HTML5 revision 1118).
1562
1563 2007-10-17 Wakaba <wakaba@suika.fam.cx>
1564
1565 * Makefile (clean): New rule.
1566
1567 * NanoDOM.pm (public_id, system_id): New attributes.
1568
1569 2007-10-17 Wakaba <wakaba@suika.fam.cx>
1570
1571 * Makefile (CSS-all, CSS-clean, clean): New rules.
1572
1573 2007-10-14 Wakaba <wakaba@suika.fam.cx>
1574
1575 * ContentChecker.pm (check_document): Support for
1576 new |is_xml_root| flag.
1577 (check_element): Support for new |pluses| state.
1578 (_add_pluses): New method.
1579 (_remove_minuses): Support for new |minus| item.
1580
1581 2007-09-24 Wakaba <wakaba@suika.fam.cx>
1582
1583 * ContentChecker.pm: Raise specific error for invalid
1584 root element.
1585
1586 2007-09-24 Wakaba <wakaba@suika.fam.cx>
1587
1588 * ContentChecker.pm: Set level values for later uses.
1589
1590 2007-09-09 Wakaba <wakaba@suika.fam.cx>
1591
1592 * ContentChecker.pm: Support for language tag validation.
1593
1594 2007-09-09 Wakaba <wakaba@suika.fam.cx>
1595
1596 * LangTag.pm (check_rfc3066_language_tag): New method.
1597
1598 2007-09-09 Wakaba <wakaba@suika.fam.cx>
1599
1600 * LangTag.pm: New module.
1601
1602 2007-09-04 Wakaba <wakaba@suika.fam.cx>
1603
1604 * HTML.pm.src: Some error types were wrong.
1605
1606 2007-08-17 Wakaba <wakaba@suika.fam.cx>
1607
1608 * CSS/: New directory.
1609
1610 2007-08-17 Wakaba <wakaba@suika.fam.cx>
1611
1612 * ContentChecker.pm (_check_get_children): Support
1613 for |noscript| in |head|.
1614
1615 2007-08-12 Wakaba <wakaba@suika.fam.cx>
1616
1617 * URI/: New directory.
1618
1619 2007-08-11 Wakaba <wakaba@suika.fam.cx>
1620
1621 * HTML.pm.src: Tokenizer's states are now represented in
1622 number.
1623
1624 2007-08-11 Wakaba <wakaba@suika.fam.cx>
1625
1626 * HTML.pm.src: |or|s for insertion modes are replaced
1627 by |&|s.
1628
1629 2007-08-11 Wakaba <wakaba@suika.fam.cx>
1630
1631 * HTML.pm.src: Token types are now represented in number.
1632
1633 2007-08-11 Wakaba <wakaba@suika.fam.cx>
1634
1635 * ContentType.pm (SEE ALSO): Updated.
1636
1637 * HTML.pm.src: Insertion modes are now represented in number.
1638
1639 2007-08-11 Wakaba <wakaba@suika.fam.cx>
1640
1641 * ContentType.pm: Sniffing for bitmap images (HTML5 revision
1642 999) is implemented.
1643
1644 2007-08-08 Wakaba <wakaba@suika.fam.cx>
1645
1646 * ContentType.pm: Sniffing for |<script| (HTML5 revision
1647 983) is implemented.
1648
1649 2007-08-06 Wakaba <wakaba@suika.fam.cx>
1650
1651 * ContentChecker.pod: New documentation.
1652
1653 * Makefile: A rule for |ContentChecker.html| is added.
1654
1655 * ContentChecker.pm: A pod "LICENSE" section is added.
1656
1657 * NanoDOM.pm ($VERSION): New variable.
1658
1659 2007-08-05 Wakaba <wakaba@suika.fam.cx>
1660
1661 * H2H.pm: |b|, |i|, and |sub| are added to the
1662 list of allowed HTML elements.
1663
1664 2007-08-05 Wakaba <wakaba@suika.fam.cx>
1665
1666 * H2H.pm: |samp| is added to the list of allowed
1667 HTML elements.
1668
1669 * URIChecker.pm (check_iri): New.
1670 (check_iri_reference): Error type for IRI reference
1671 syntax error is changed.
1672
1673 2007-08-04 Wakaba <wakaba@suika.fam.cx>
1674
1675 * ContentChecker.pm: Reference to the |Whatpm::ContentChecker::Atom|
1676 is added.
1677 (check_document): Load appropriate module before validation.
1678
1679 2007-08-04 Wakaba <wakaba@suika.fam.cx>
1680
1681 * ContentChecker/: New directory.
1682
1683 2007-08-04 Wakaba <wakaba@suika.fam.cx>
1684
1685 * ContentChecker.pm: HTML |time| element is implemented.
1686
1687 * HTMLTable.pm: Comments are updated as HTML5 is revised.
1688
1689 2007-07-21 Wakaba <wakaba@suika.fam.cx>
1690
1691 * ContentChecker.pm (check_document): Return value
1692 even if no document element is found.
1693
1694 2007-07-21 Wakaba <wakaba@suika.fam.cx>
1695
1696 * HTML.pm.src: |$in_body| is no longer a function.
1697
1698 2007-07-21 Wakaba <wakaba@suika.fam.cx>
1699
1700 * HTML.pm.src: The |$in_body| code has been moved down.
1701
1702 2007-07-21 Wakaba <wakaba@suika.fam.cx>
1703
1704 * HTML.pm.src: The "trailing end" insertion mode
1705 is split into "after html body" and "after html frameset"
1706 insertion modes. Their codes are merged with "after body"
1707 and "after frameset" codes. |$previous_insertion_mode|
1708 has been removed. "after frameset" code is
1709 merged with "in frameset" code.
1710
1711 2007-07-21 Wakaba <wakaba@suika.fam.cx>
1712
1713 * HTML.pm.src: The "before head" insertion mode is
1714 merged with the "in head" insertion mode.
1715
1716 2007-07-21 Wakaba <wakaba@suika.fam.cx>
1717
1718 * HTML.pm.src: Cases in "in head" insertion mode are
1719 reorganized.
1720
1721 2007-07-21 Wakaba <wakaba@suika.fam.cx>
1722
1723 * HTML.pm.src: Some cases in "in table" insertion mode
1724 are merged.
1725
1726 2007-07-21 Wakaba <wakaba@suika.fam.cx>
1727
1728 * HTML.pm.src: The "in row" insertion mode is merged
1729 with "in table" insertion mode.
1730
1731 2007-07-21 Wakaba <wakaba@suika.fam.cx>
1732
1733 * HTML.pm.src: The "in table" and "in table body" insertion
1734 modes are merged.
1735
1736 2007-07-21 Wakaba <wakaba@suika.fam.cx>
1737
1738 * HTML.pm.src: There is no "in table head" or "in table foot"
1739 insertion mode!
1740
1741 2007-07-21 Wakaba <wakaba@suika.fam.cx>
1742
1743 * HTML.pm.src: |<noframes>| "in frameset" and "in noframes"
1744 now directly invoke the handler.
1745
1746 2007-07-21 Wakaba <wakaba@suika.fam.cx>
1747
1748 * HTML.pm.src: Codes for "in cell" insertion mode
1749 is merged to the "in body" insertion mode code.
1750
1751 2007-07-21 Wakaba <wakaba@suika.fam.cx>
1752
1753 * HTML.pm.src: Codes for "in body" and "in caption"
1754 insertion modes are merged.
1755
1756 2007-07-21 Wakaba <wakaba@suika.fam.cx>
1757
1758 * HTML.pm.src: Two |!!!next-token|s were missing.
1759
1760 2007-07-21 Wakaba <wakaba@suika.fam.cx>
1761
1762 * HTML.pm.src: Use numeric constant for |{content_mode}|
1763 instead of string constant for |{content_model_flag}|.
1764
1765 2007-07-21 Wakaba <wakaba@suika.fam.cx>
1766
1767 * HTML.pm.src: Add the name of the attribute
1768 to the "duplicate attribute" error.
1769
1770 2007-07-17 Wakaba <wakaba@suika.fam.cx>
1771
1772 * ContentChecker.pm: Return the |class| node list.
1773
1774 2007-07-17 Wakaba <wakaba@suika.fam.cx>
1775
1776 * ContentChecker.pm: Return the |id| node list.
1777
1778 * HTML.pm.src: A typo is fixed.
1779
1780 2007-07-16 Wakaba <wakaba@suika.fam.cx>
1781
1782 * ContentChecker.pm: Drop wrong |level => 'error'| specification
1783 from "in HTML:xml:lang" error. Character position
1784 is now the last part of the error type in the URI error
1785 description. Report "unsupported" status for language
1786 tags, media queries, script codes, and style sheets.
1787
1788 2007-07-16 Wakaba <wakaba@suika.fam.cx>
1789
1790 * ContentChecker.pm: Report error if |xml:lang|
1791 in HTML, |lang| in XML, |xmlns| in XML, and |meta| |charset|
1792 in XML.
1793
1794 * NanoDOM.pm (Attr.owner_document): New attribute.
1795
1796 2007-07-16 Wakaba <wakaba@suika.fam.cx>
1797
1798 * HTML.pm.src: The character immediately following
1799 a bare |hcro| was discarded. Fix handling of
1800 entity references in attribute values.
1801
1802 2007-07-16 Wakaba <wakaba@suika.fam.cx>
1803
1804 * HTML.pm.src (main and trailin end phases): Token
1805 types |DOCTYPE|, |comment|, |end-of-file|, and |<html>| are
1806 factored out. Error types |in html:#DOCTYPE|
1807 and |after html:#DOCTYPE| are merged
1808 into |DOCTYPE in the middle|. |</frameset>|
1809 in fragment parsing mode changed the
1810 insertion mode.
1811
1812 2007-07-16 Wakaba <wakaba@suika.fam.cx>
1813
1814 * HTML.pm.src: |$phase| has been removed; The |trailing end|
1815 phase is now an insertion mode. Treatments for white
1816 space character tokens were incorrect for some
1817 insertion modes. An old |meta| case was not removed.
1818
1819 2007-07-16 Wakaba <wakaba@suika.fam.cx>
1820
1821 * HTML.pm.src: |meta| charset declaration extraction
1822 implemented (but changing the encoding is not yet:-).
1823
1824 2007-07-15 Wakaba <wakaba@suika.fam.cx>
1825
1826 * Charset/: New directory.
1827
1828 2007-07-15 Wakaba <wakaba@suika.fam.cx>
1829
1830 * H2H.pm: New Perl module (created from
1831 manakai's H2H.dis).
1832
1833 2007-07-15 Wakaba <wakaba@suika.fam.cx>
1834
1835 * XMLSerializer.pm: New Perl module (created from
1836 manakai's SimpleLS.dis).
1837
1838 2007-07-07 Wakaba <wakaba@suika.fam.cx>
1839
1840 * HTML.pm.src: |<!---x-->| was not processed correctly.
1841
1842 2007-07-01 Wakaba <wakaba@suika.fam.cx>
1843
1844 * HTML.pm.src: Report correct error message
1845 for |<body></div></body>|.
1846
1847 2007-07-01 Wakaba <wakaba@suika.fam.cx>
1848
1849 * HTMLTable.pm: An error description was incorrect.
1850
1851 2007-06-30 Wakaba <wakaba@suika.fam.cx>
1852
1853 * ContentChecker.pm: Return |{term}| list.
1854
1855 2007-06-30 Wakaba <wakaba@suika.fam.cx>
1856
1857 * HTML.pm.src: HTML5 revisions 961-966 (</p>, </br>,
1858 nested <nobr>, implied </tbody>, </tfoot>, and </thead>,
1859 and <title> outside of head).
1860
1861 2007-06-30 Wakaba <wakaba@suika.fam.cx>
1862
1863 * IMTChecker.pm: Report warning for unregistered
1864 and private types/subtypes.
1865
1866 * ContentChecker.pm, HTML.pm.src, IMTChecker.pm,
1867 URIChecker.pm, HTMLTable.pm: Error messages are now
1868 consistent; they are all listed in
1869 <http://suika.fam.cx/gate/2005/sw/Whatpm%20Error%20Types>.
1870
1871 2007-06-25 Wakaba <wakaba@suika.fam.cx>
1872
1873 * ContentChecker.pm: |<img ismap>| not in |<a></a>|
1874 is now erred. |<datalist>| is implemented.
1875 Attribute checker for |<command>| and |<menu>| are
1876 added. Support for |contextmenu| global attribute
1877 is added.
1878
1879 2007-06-25 Wakaba <wakaba@suika.fam.cx>
1880
1881 * HTML.pm.src (_reset_insertion_mode): Interpretation
1882 of Step 3 has been changed.
1883
1884 2007-06-25 Wakaba <wakaba@suika.fam.cx>
1885
1886 * HTML.pm.src: Late |<html>| parse error is implemented.
1887
1888 2007-06-24 Wakaba <wakaba@suika.fam.cx>
1889
1890 * URIChecker.pm (check_iri_reference): A |decode| method name was
1891 incorrect.
1892
1893 * ContentChecker.pm: Support for the |footer| element.
1894 Check URI syntax for space-separated URI attributes.
1895 Support for the |tabindex| attribute. Support
1896 for |datetime| attribute.
1897
1898 2007-06-24 Wakaba <wakaba@suika.fam.cx>
1899
1900 * HTML.pm.src: HTML5 revision 1.144 (&#x0D;) and 1.145 (invalid
1901 character references). HTML5 revision 1.146 (white space
1902 characters before root start tag). HTML5 revision
1903 1.148 (named character references in attribute values).
1904 HTML5 revision 1.152 (<plaintext>.innerHTML get).
1905
1906 2007-06-24 Wakaba <wakaba@suika.fam.cx>
1907
1908 * HTML.pm.src: HTML5 revisions 1.142 and 1.143 (<noscript>
1909 in <head>).
1910
1911 2007-06-24 Wakaba <wakaba@suika.fam.cx>
1912
1913 * HTML.pm.src: HTML5 revision 935 (<base>, <link>, <meta>
1914 in body).
1915
1916 * ContentChecker.pm: HTML5 revision 938 (scoped="").
1917
1918 2007-06-24 Wakaba <wakaba@suika.fam.cx>
1919
1920 * HTML.pm.src: HTML5 revision 923 (matching end tag in CDATA
1921 or RCDATA in fragment parsing mode). HTML5 revision
1922 924 (<!--> and <!--->). HTML5 revision 926 (hn in hn).
1923
1924 2007-06-23 Wakaba <wakaba@suika.fam.cx>
1925
1926 * HTML.pm.src (get_inner_html): HTML5 revision 922 (inner_html
1927 for <pre> and <textarea>).
1928
1929 2007-06-23 Wakaba <wakaba@suika.fam.cx>
1930
1931 * HTML.pm.src: HTML5 revision 920 (<isindex>).
1932
1933 2007-06-23 Wakaba <wakaba@suika.fam.cx>
1934
1935 * HTML.pm.src: HTML5 revision 918 (</head>, </body>, </html>).
1936
1937 2007-06-23 Wakaba <wakaba@suika.fam.cx>
1938
1939 * HTML.pm.src: HTML5 revision 916 (</body>).
1940 HTML5 revision 917 (conforming bare &).
1941
1942 2007-06-23 Wakaba <wakaba@suika.fam.cx>
1943
1944 * NanoDOM.pm (manakai_is_html): Setting to false did not work.
1945
1946 * HTML.pm.src: HTML5 revision 914 (</ in CDATA, RCDATA).
1947 HTML5 revision 915 (<nobr>).
1948
1949 2007-06-23 Wakaba <wakaba@suika.fam.cx>
1950
1951 * HTML.pm.src: HTML5 revisions 908, 909, 912, and 913 (quirks mode).
1952
1953 * NanoDOM.pm (manakai_is_html, manakai_compat_mode, compat_mode):
1954 New attributes.
1955
1956 2007-06-23 Wakaba <wakaba@suika.fam.cx>
1957
1958 * HTML.pm.src: HTML5 revisions 900, 901, 902, and 911 (<
1959 in tags).
1960
1961 2007-06-23 Wakaba <wakaba@suika.fam.cx>
1962
1963 * .cvsignore: |Entities.html| is added.
1964
1965 * HTML.pm.src: |$entity_char| is removed and
1966 requires |Whatpm::_NamedEntityList| instead.
1967 HTML5 revision 898 (refc), except that lack of refc
1968 is parse error.
1969
1970 * mkentitylist.pl: New script.
1971
1972 * Makefile (all): |_NamedEntityList.pm| is added.
1973 (_NamedEntityList.pm, Entities.html): New rules.
1974
1975 2007-06-23 Wakaba <wakaba@suika.fam.cx>
1976
1977 * HTML.pm.src: Parse errors immediately after U+000D
1978 were ignored and U+000D immediately following another
1979 U+000D was not converted to U+000A.
1980
1981 2007-06-23 Wakaba <wakaba@suika.fam.cx>
1982
1983 * HTML.pm.src (set_inner_html): HTML5 revision 892 (adopt
1984 nodes before appended). Parser was not ready for NULL
1985 parse error and escape flag.
1986
1987 * NanoDOM.pm (adopt_node): New.
1988
1989 2007-06-23 Wakaba <wakaba@suika.fam.cx>
1990
1991 * HTML.pm.src: HTML5 revision 886 (insane comment in
1992 CDATA and RCDATA). Note that current implementation
1993 is simply repeating what the spec says and it is maybe not
1994 a best way to do it.
1995
1996 2007-06-23 Wakaba <wakaba@suika.fam.cx>
1997
1998 * HTML.pm.src: HTML5 revision 884 (</form> don't close
1999 the form element if a descendant element without implied end tag has
2000 still been open).
2001
2002 2007-06-23 Wakaba <wakaba@suika.fam.cx>
2003
2004 * ContentChecker.pm: HTML5 revision 881 (Make |id| attribute with
2005 space characters non-conforming).
2006
2007 2007-06-23 Wakaba <wakaba@suika.fam.cx>
2008
2009 * HTML.pm.src: An error message was incorrect.
2010 HTML5 revision 869 (C1 character references).
2011
2012 2007-06-23 Wakaba <wakaba@suika.fam.cx>
2013
2014 * HTML.pm.src: HTML5 revision 867 (a LF at the beginning of
2015 a |textarea| is removed).
2016
2017 2007-06-05 Wakaba <wakaba@suika.fam.cx>
2018
2019 * NanoDOM.pm (get_attribute_node_ns): New method.
2020
2021 * ContentChecker.pm: |script| |async| and |defer|
2022 no longer require |src|. |async| MUST NOT be
2023 specified if |defer|. (HTML5 revision 858).
2024
2025 2007-05-30 Wakaba <wakaba@suika.fam.cx>
2026
2027 * HTML.pm.src: |<form><form>| went to inifinite loop.
2028
2029 2007-05-27 Wakaba <wakaba@suika.fam.cx>
2030
2031 * ContentChecker.pm (html): Set |is_root| (allowed
2032 as a document element) flag on.
2033 (new): Removed.
2034 (check_document): New method.
2035
2036 2007-05-27 Wakaba <wakaba@suika.fam.cx>
2037
2038 * ContentChecker.pm (thead, tfoot): Checker specifications
2039 were incorrect.
2040
2041 2007-05-27 Wakaba <wakaba@suika.fam.cx>
2042
2043 * ContentChecker.pm ($HTMLURIAttrChecker): Include
2044 error position in the |type| option of the error.
2045
2046 * HTMLTable.pm (form_table): The |$onerror| parameter
2047 is now optional. Some bugs are fixed.
2048
2049 2007-05-27 Wakaba <wakaba@suika.fam.cx>
2050
2051 * HTMLTable.pm: New module.
2052
2053 * ContentChecker.pm (table): Invoke table model error checker.
2054
2055 * NanoDOM.pm (first_child, get_attribute_ns): New.
2056
2057 2007-05-26 Wakaba <wakaba@suika.fam.cx>
2058
2059 * ContentChecker.pm ($HTMLLinkTypesAttrChecker): New checker.
2060 (link/@rel, a/@rel, area/@rel): Use new checker.
2061
2062 * Makefile (_LinkTypeList.pm, RelExtensions.html): New rules.
2063
2064 * _LinkTypeList.pm: New file.
2065
2066 * mklinktypelist.pl: New file.
2067
2068 * .cvsignore: |RelExtensions.html| added.
2069
2070 * NanoDOM.pm (child_nodes): Returns an empty array
2071 for non-child-containing node types.
2072 (text_content): New attribute.
2073
2074 2007-05-26 Wakaba <wakaba@suika.fam.cx>
2075
2076 * IMTChecker.pm: New module.
2077
2078 * ContentChecker.pm ($HTMLIMTAttrChecker): Call IMTChecker
2079 to test parameter value validity.
2080
2081 * HTML.pm.src ($style_start_tag): Attributes were
2082 discarded.
2083
2084 2007-05-25 Wakaba <wakaba@suika.fam.cx>
2085
2086 * ContentChecker.pm ($HTMLURIAttrChecker): Implemented.
2087
2088 2007-05-25 Wakaba <wakaba@suika.fam.cx>
2089
2090 * URIChecker.pm: All recommendations from RFC 3986
2091 and RFC 3987 are listed (not all testable items are checked yet).
2092
2093 2007-05-25 Wakaba <wakaba@suika.fam.cx>
2094
2095 * URIChecker.pm: New module.
2096
2097 2007-05-20 Wakaba <wakaba@suika.fam.cx>
2098
2099 * ContentChecker.pm: Now most attributes are implemented
2100 or associated to some placeholder.
2101 ($ElementDefault): Warn unknown attributes
2102 for unknown elements as "attribute not supported".
2103 ($HTMLLanguageTagAttrChecker, $HTMLMQAttrChecker): New placeholders.
2104 ($HTMLUsemapAttrChecker, $HTMLTargetAttrChecker): New checkers.
2105 (|a| attribute checker): Reimplemented.
2106
2107 2007-05-20 Wakaba <wakaba@suika.fam.cx>
2108
2109 * ContentChecker.pm ($HTMLEventHandlerAttrChecker): New placeholder.
2110 ($HTMLAttrChecker): Event handler content attributes
2111 are added.
2112 (link, embed): Required attribute is now checked.
2113 (embed): Unknown local attributes are no longer warned.
2114
2115 2007-05-20 Wakaba <wakaba@suika.fam.cx>
2116
2117 * ContentChecker.pm ($HTMLSpaceURIsAttrChecker): New placeholder.
2118 ($HTMLIMTAttrChecker): New checker.
2119 (link@rel, link@href, link@type, style@type,
2120 a@href, a@ping, a@ping, a@type, embed@src, embed@type,
2121 object@data, object@type, source@src, source@type, area@alt,
2122 area@shape, area@coords, area@href,
2123 area@ping, area@rel, area@type, script@src,
2124 script@defer, script@async, script@type): Checkers added.
2125
2126 2007-05-20 Wakaba <wakaba@suika.fam.cx>
2127
2128 * ContentChecker.pm: Descendant checking was incorrect.
2129
2130 2007-05-19 Wakaba <wakaba@suika.fam.cx>
2131
2132 * ContentChecker.pm: Support |xml:*| and |xmlns:*|
2133 attributes. Report an error if |Element.prefix|
2134 is |xmlns|.
2135
2136 * NanoDOM.pm (prefix): New attribute.
2137
2138 2007-05-19 Wakaba <wakaba@suika.fam.cx>
2139
2140 * HTML.pm.src: In |main| phase, |in body| insertion
2141 mode, action for |<iframe>| was missing.
2142
2143 2007-05-19 Wakaba <wakaba@suika.fam.cx>
2144
2145 * ContentChecker.pm: Support for many of HTML5 elements.
2146 ($GetHTMLNonNegativeIntegerAttrChecker): New.
2147
2148 2007-05-19 Wakaba <wakaba@suika.fam.cx>
2149
2150 * ContentChecker.pm: Support for most elements up to |progress|.
2151 ($HTMLURIAttrChecker): Placeholder.
2152 ($HTMLIntegerAttrChecker, $GetHTMLFloatingPointNumberAttrChecker):
2153 New.
2154
2155 2007-05-19 Wakaba <wakaba@suika.fam.cx>
2156
2157 * ContentChecker.pm: Attribute checkers
2158 for global attributes, |html|, |base|, |style|, and |meta|.
2159
2160 * NanoDOM.pm (insert_before): Weaken reference
2161 to the parent node.
2162 (Attr::new): Set |owner_element| attribute.
2163 (namespace_uri, manakai_local_name): New attribute implementations.
2164 (owner_element): New attribute.
2165
2166 2007-05-19 Wakaba <wakaba@suika.fam.cx>
2167
2168 * ContentChecker.pm ($AttrChecker, $HTMLAttrChecker,
2169 $AnyChecker->{attr_checker}, $HTMLAttrsChecker,
2170 $Element->{$HTML_NS}->{''}): New.
2171 (check_element): Invoke attrs_checker for each element.
2172
2173 2007-05-13 Wakaba <wakaba@suika.fam.cx>
2174
2175 * ContentChecker.pm: Don't use |manakai_element_type_match|.
2176
2177 2007-05-13 Wakaba <wakaba@suika.fam.cx>
2178
2179 * ContentChecker.pm: Use hashs rather than lists for
2180 element type testings.
2181
2182 2007-05-13 Wakaba <wakaba@suika.fam.cx>
2183
2184 * ContentChecker.pm: Don't generate duplicate
2185 error when an element type is put in the "minus" list
2186 and the element type is not allowed explicitly in the particular
2187 element content model.
2188 (html:a checker): New checker.
2189 (html:details, html:datagrid): New checkers.
2190 (html:legend): New checker.
2191
2192 2007-05-13 Wakaba <wakaba@suika.fam.cx>
2193
2194 * ContentChecker.pm (html:li checker): Implemented.
2195
2196 2007-05-13 Wakaba <wakaba@suika.fam.cx>
2197
2198 * ContentChecker.pm ($HTMLInlineOrStriclyInlineChecker): New
2199 checker.
2200 (html:dd checker): New checker.
2201 (html:q, html:em, html:strong, html:small,
2202 html:m, html:dfn, html:code, html:samp, html:span): New checkers.
2203
2204 2007-05-13 Wakaba <wakaba@suika.fam.cx>
2205
2206 * ContentChecker.pm ($AnyChecker): Renamed
2207 from |$ElementDefault->{checker}|.
2208 ($ElementDefault->{checker}): Throw an error that
2209 the element type is not supported by the checker.
2210 ($HTMLMetadataElement): |html:base| was missing.
2211 ($HTMLEmptyChecker): Don't throw an error
2212 for inter-element whitespace nodes.
2213 (html:html checker): Errors were not
2214 thrown even if |html:head| and/or |html:body|
2215 children were missing.
2216 (html:head checker): An error was not
2217 thrown if <meta charset> appered after other
2218 elements.
2219
2220 2007-05-05 Wakaba <wakaba@suika.fam.cx>
2221
2222 * ContentChecker.pm: |footer|, |video|, |audio|, |script|,
2223 and |noscript| elements are implemented.
2224 (new): New method.
2225
2226 2007-05-04 Wakaba <wakaba@suika.fam.cx>
2227
2228 * ContentChecker.pm: New module.
2229
2230 2007-05-04 Wakaba <wakaba@suika.fam.cx>
2231
2232 * NanoDOM.pm (manakai_parent_element,
2233 document_element, manakai_local_name,
2234 manakai_element_type_match): New method.
2235
2236 2007-05-03 Wakaba <wakaba@suika.fam.cx>
2237
2238 * HTML.pm.src: Replace decimal and hexadecimal numeric
2239 entities in C1 range using Windows-1252 mapping. Bare LF
2240 did not count as new line for error reporting.
2241
2242 2007-05-02 Wakaba <wakaba@suika.fam.cx>
2243
2244 * NanoDOM.pm (DOMImplementation): New class.
2245 (append_child): Weaken the |parent_node| reference.
2246 (create_element_ns, Element new): Set the |owner_document|
2247 reference.
2248 (implementation): New attribute.
2249 (owner_document, local_name, namespace_uri): New attributes.
2250
2251 * HTML.pm.src (parse_string): Line and column numbers
2252 are now provided to error handler.
2253 (!!!parse-error): Short descriptions are added.
2254 (_construct_tree): Split into three methods; support
2255 for innerHTML mode.
2256 (set_inner_html): New method.
2257
2258 2007-05-01 Wakaba <wakaba@suika.fam.cx>
2259
2260 * NanoDOM.html: Documentation is added.
2261
2262 * HTML.pod, ContentType.html: Documentation is revised.
2263
2264 * .cvsignore: Pod2html temporary files are added.
2265
2266 * Makefile: Make |NanoDOM.html|.
2267
2268 2007-05-01 Wakaba <wakaba@suika.fam.cx>
2269
2270 * HTML.pm.src (parse_string): New method.
2271 (get_inner_html): Renamed from |inner_html|.
2272
2273 * Makefile: A rule for |HTML.html| is added.
2274
2275 * HTML.pod: New documentation.
2276
2277 2007-05-01 Wakaba <wakaba@suika.fam.cx>
2278
2279 * NanoDOM.pm (last_child, previous_sibling): New attributes.
2280 (clone_node): Attribute nodes were not completely copied.
2281
2282 * HTML.pm.src: Many bugs are fixed.
2283
2284 2007-04-30 Wakaba <wakaba@suika.fam.cx>
2285
2286 * HTML.pm.src: Some typos are fixed.
2287
2288 2007-04-30 Wakaba <wakaba@suika.fam.cx>
2289
2290 * mkhtmlparser.pl, Makefile: References to the |HTML-consume-entity.src|
2291 are removed.
2292
2293 * HTML.pm.src: Tokenizer's handling on named entities are rewritten.
2294
2295 * HTML-consume-entity.src: Removed.
2296
2297 2007-04-30 Wakaba <wakaba@suika.fam.cx>
2298
2299 * HTML.pm.src: Tokenizer's handling on hexadecimal
2300 numeric entities are rewritten.
2301
2302 2007-04-30 Wakaba <wakaba@suika.fam.cx>
2303
2304 * HTML.pm.src: Some tokenizer bugs are fixed.
2305
2306 2007-04-30 Wakaba <wakaba@suika.fam.cx>
2307
2308 * HTML.pm.src: The tree construction stage is implemented.
2309
2310 * mkhtmlparser.pl: New macros are added.
2311
2312 2007-04-28 Wakaba <wakaba@suika.fam.cx>
2313
2314 * ContentType.pm: A note on bug in the specification
2315 is removed since it's been now fixed.
2316
2317 * .cvsignore: New file.
2318
2319 2007-04-28 Wakaba <wakaba@suika.fam.cx>
2320
2321 * HTML.pm.src, HTML-consume-entity.src: New files.
2322
2323 * Makefile (HTML.pm): New rule.
2324
2325 * mkhtmlparser.pl: New script.
2326
2327 2007-04-25 Wakaba <wakaba@suika.fam.cx>
2328
2329 * Makefile: New file.
2330
2331 2007-04-24 Wakaba <wakaba@suika.fam.cx>
2332
2333 * ContentType.pm: An error in pod is fixed.
2334
2335 2007-04-24 Wakaba <wakaba@suika.fam.cx>
2336
2337 * ContentType.pm: Documentation is added.
2338
2339 2007-04-24 Wakaba <wakaba@suika.fam.cx>
2340
2341 * ContentType.pm: New Perl module.
2342
2343 * ChangeLog: New file.
2344
2345

admin@suikawiki.org
ViewVC Help
Powered by ViewVC 1.1.24