/[suikacvs]/markup/html/whatpm/Whatpm/ChangeLog
Suika

Contents of /markup/html/whatpm/Whatpm/ChangeLog

Parent Directory Parent Directory | Revision Log Revision Log


Revision 1.370 - (show annotations) (download)
Fri Dec 12 12:59:17 2008 UTC (15 years, 10 months ago) by wakaba
Branch: MAIN
Changes since 1.369: +6 -0 lines
++ whatpm/t/dom-conformance/ChangeLog	12 Dec 2008 12:59:07 -0000
	* html-form-input.dat: Added test data for <input disabled> and
	<input autofocus>.

	* html-forms-1.dat: Added test data for <fieldset disabled>,
	<output for>, and <{formcontrol} name>.  Added test data for
	<{formcontrol} autofocus="">'s uniqueness.

2008-12-12  Wakaba  <wakaba@suika.fam.cx>

++ whatpm/Whatpm/ChangeLog	12 Dec 2008 12:54:19 -0000
	* ContentChecker.pm: Introduced new flag |has_autofocus|.  Added
	support for new |id_type| value of |any|, which means any kind of
	element is allowed as a referenced element.

2008-12-12  Wakaba  <wakaba@suika.fam.cx>

++ whatpm/Whatpm/ContentChecker/ChangeLog	12 Dec 2008 12:56:56 -0000
	* HTML.pm: Added checking for the uniqueness of |autofocus=""|
	attribute.  Implemented |name=""| attribute for form controls.
	Added U+000C into character class of space characters (it should
	have been, but was not in some of regular expressions).  Updated
	definitions for |output| elements.

2008-12-12  Wakaba  <wakaba@suika.fam.cx>

1 2008-12-12 Wakaba <wakaba@suika.fam.cx>
2
3 * ContentChecker.pm: Introduced new flag |has_autofocus|. Added
4 support for new |id_type| value of |any|, which means any kind of
5 element is allowed as a referenced element.
6
7 2008-12-12 Wakaba <wakaba@suika.fam.cx>
8
9 * ContentChecker.pm: Introduced |check_attrs2| method for the ease
10 of defining codes for checking required attribute and as such.
11
12 2008-12-12 Wakaba <wakaba@suika.fam.cx>
13
14 * IMTChecker.pm: Added more definitions for subtypes.
15
16 2008-12-11 Wakaba <wakaba@suika.fam.cx>
17
18 * URIChecker.pm: Some of |pos_end| values were wrong.
19
20 2008-12-06 Wakaba <wakaba@suika.fam.cx>
21
22 * ContentChecker.pm (check_element): Added support for "no
23 referenced datalist" error.
24
25 2008-12-06 Wakaba <wakaba@suika.fam.cx>
26
27 * URIChecker.pm: Bug fix: It did not work unless
28 Message::DOM::DOMImplementation has been |require|d.
29
30 2008-12-06 Wakaba <wakaba@suika.fam.cx>
31
32 * NanoDOM.pm (document_uri): New attribute.
33
34 * ContentChecker.pm: Don't use methods not implemented by NanoDOM.
35
36 2008-11-07 Wakaba <wakaba@suika.fam.cx>
37
38 * NanoDOM.pm (text_content): Don't create a Text node if the new
39 value is empty.
40
41 2008-11-06 Wakaba <wakaba@suika.fam.cx>
42
43 * SWML/: New directory.
44
45 2008-10-20 Wakaba <wakaba@suika.fam.cx>
46
47 * NanoDOM.pm (specified, all_declarations_processed,
48 manakai_attribute_type): New attributes.
49
50 2008-10-19 Wakaba <wakaba@suika.fam.cx>
51
52 * NanoDOM.pm (Entity->new): Initialize ->child_nodes as an empty
53 array.
54
55 2008-10-19 Wakaba <wakaba@suika.fam.cx>
56
57 * NanoDOM.pm (notation_name): New attribute.
58
59 2008-10-18 Wakaba <wakaba@suika.fam.cx>
60
61 * NanoDOM.pm (public_id, system_id): New attributes.a
62
63 2008-10-18 Wakaba <wakaba@suika.fam.cx>
64
65 * NanoDOM.pm (text_content): Moved to Node from Element. Setter
66 implemented.
67 (allowed_tokens, default_type, declared_type): Implemented.
68
69 2008-10-17 Wakaba <wakaba@suika.fam.cx>
70
71 * NanoDOM.pm (node_name): New attribute.
72 (ELEMENT_TYPE_DEFINITION_NODE, ATTRIBUTE_DEFINITION_NODE): New
73 constants.
74 (create_element_type_definition_node, create_attribute_definition,
75 create_notation, create_general_entity,
76 get_element_type_definition_node,
77 set_element_type_definition_node, get_general_entity_node,
78 set_general_entity_node, get_notation_node, set_notation_node,
79 get_attribute_definition_node, set_attribute_definition_node): New
80 methods.
81 (element_types, entities, notations, attribute_definitions): New
82 attributes.
83 (DocumentType): Support for child nodes, entities, notations, and
84 element types.
85 (Entity, Notation, ElementTypeDefinition, AttributeDefinition):
86 New classes.
87
88 * Dumper.pm: Support for general entities, notations, element type
89 definitions, and attribute definitions.
90
91 2008-10-15 Wakaba <wakaba@suika.fam.cx>
92
93 * NanoDOM.pm (create_processing_instruction): New method.
94 (xml_version, xml_encoding, xml_standalone): New attributes.
95 (ProcessingInstruction): New class.
96
97 2008-10-14 Wakaba <wakaba@suika.fam.cx>
98
99 * HTML.pm.src: Handling of end tags in the foreign content
100 insertion mode was partially wrong, because of wrong bit
101 operations.
102
103 2008-10-14 Wakaba <wakaba@suika.fam.cx>
104
105 * NanoDOM.pm (dom_config): New attribute (do nothing), for
106 Whatpm::XML::Parser support.
107
108 2008-10-14 Wakaba <wakaba@suika.fam.cx>
109
110 * Makefile: New rule to make HTML/Tokenizer.pm is added.
111
112 * HTML.pm.src: Tokenizer part moved to another file.
113
114 2008-10-13 Wakaba <wakaba@suika.fam.cx>
115
116 * HTML.pm.src: Merge |DT_EL| and |DD_EL| as |DTDD_EL|.
117
118 2008-10-13 Wakaba <wakaba@suika.fam.cx>
119
120 * HTML.pm.src: Element category constants redefined.
121
122 2008-10-13 Wakaba <wakaba@suika.fam.cx>
123
124 * HTML.pm.src: Steps for CDATA/RCDATA elements in tree
125 construction stage synced with the spec (HTML5 revisions 2139 and
126 2302).
127
128 2008-10-07 Wakaba <wakaba@suika.fam.cx>
129
130 * ContentChecker.pm: New error level "html5_fact" added, which
131 should be tentatively used until all of requirements are properly
132 specced as RFC 2119 "MUST" in HTML5.
133
134 2008-10-05 Wakaba <wakaba@suika.fam.cx>
135
136 * ContentChecker.pod: Note on internal flags is added.
137
138 2008-10-05 Wakaba <wakaba@suika.fam.cx>
139
140 * HTML.pm.src: An AAA bug fixed.
141
142 2008-10-04 Wakaba <wakaba@suika.fam.cx>
143
144 * HTML.pm.src: If another node is inserted by the parser, don't
145 reuse existing Text node to append a character (HTML5 revision
146 2124).
147
148 2008-10-04 Wakaba <wakaba@suika.fam.cx>
149
150 * HTML.pm.src: Support for <option> and <optgroup> in body (HTML5
151 revisions 1731 and 2128).
152
153 2008-10-04 Wakaba <wakaba@suika.fam.cx>
154
155 * HTML.pm.src: Make <foreignObject> scoping (HTML5 revision 1837).
156 Support for end tags of camelCase SVG elements were broken. A
157 wrong error type text fixed.
158
159 2008-10-04 Wakaba <wakaba@suika.fam.cx>
160
161 * HTML.pm.src: Drop redundant code (HTML5 revision 1731).
162
163 2008-10-04 Wakaba <wakaba@suika.fam.cx>
164
165 * HTML.pm.src: Support for new definition of |param| and |source|
166 start tag parsing (HTML5 revision 1731).
167
168 2008-10-04 Wakaba <wakaba@suika.fam.cx>
169
170 * HTML.pm.src: <p> steps reimplemented (HTML5 revision 1731).
171
172 2008-10-04 Wakaba <wakaba@suika.fam.cx>
173
174 * HTML.pm.src: <li>, <dt>, and <dd> steps reimplemented (HTML5
175 revisions 1731 and 1831).
176
177 2008-10-04 Wakaba <wakaba@suika.fam.cx>
178
179 * HTML.pm.src: Support for new flow (but not phrasing) elements (HTML5
180 revisions 1731 and 1778). Support for the </sarcasm> end tag (HTML5
181 revision 1731).
182
183 2008-10-04 Wakaba <wakaba@suika.fam.cx>
184
185 * HTML.pm.src: Support for |command| and |eventsource| elements (HTML5
186 revision 1731). End tags of |option| and |optgroup| elements are
187 now optional (HTML5 revision 1731).
188
189 2008-10-04 Wakaba <wakaba@suika.fam.cx>
190
191 * HTML.pm.src: New "special" elements added to the list (HTML5
192 revision 1778). "strile" -> "strike".
193
194 2008-10-02 Wakaba <wakaba@suika.fam.cx>
195
196 * ContentType.pm (get_sniffed_type): Support for the "better"
197 content sniffing (HTML5 revision 1927). In a case the official
198 type was not returned when the method is invoked in the list
199 context.
200
201 2008-09-22 Wakaba <wakaba@suika.fam.cx>
202
203 * HTML.pm.src: Character references for non-space C0 characters,
204 including U+000B VT, DEL character, noncharacter code points, are
205 now converted to the U+FFFD character (cf. HTML5 revision 2138).
206
207 2008-09-21 Wakaba <wakaba@suika.fam.cx>
208
209 * ContentChecker.pm: |form=""| check support added.
210
211 2008-09-21 Wakaba <wakaba@suika.fam.cx>
212
213 * ContentChecker.pm: |contextmenu| validness is now checked using
214 |id| and |id_type| properties, and |menu| property is removed.
215
216 2008-09-21 Wakaba <wakaba@suika.fam.cx>
217
218 * ContentChecker.pm: Prepare for |form| |name| attribute's
219 duplication checking.
220
221 2008-09-21 Wakaba <wakaba@suika.fam.cx>
222
223 * HTML.pm.src (parse_byte_stream): Support (or non-support) for
224 unsupported charset="" parameter value (HTML5 revision 2131).
225
226 2008-09-20 Wakaba <wakaba@suika.fam.cx>
227
228 * HTML.pm.src: Reminding places where U+000B is allowed as a space
229 character is fixed (cf. HTML5 revision 1738).
230
231 * ContentChecker.pm, HTMLTable.pm: U+000B is no longer part of
232 space characters (HTML5 revision 1738).
233
234 2008-09-20 Wakaba <wakaba@suika.fam.cx>
235
236 * HTML.pm.src: The "anything else" case for the "after after body"
237 insertion mode was not updated to swtich to the "in body"
238 insertion mode. U+000B is no longer a space character for the
239 purpose of tree construction phase (HTML5 revision 1738).
240
241 2008-09-20 Wakaba <wakaba@suika.fam.cx>
242
243 * HTML.pm.src: U+000B is no longer a space character (HTML5
244 revision 1738).
245
246 2008-09-20 Wakaba <wakaba@suika.fam.cx>
247
248 * ContentType.pm: 0x0B is no longer a space character (HTML5
249 revision 1738).
250
251 * HTML.pm.src: U+000B is no longer a space character for the
252 algorithm for extracting an encoding from a Content-Type (HTML5
253 revision 1738).
254
255 2008-09-20 Wakaba <wakaba@suika.fam.cx>
256
257 * ContentChecker.pm ($IsInHTMLInteractiveContent): New.
258
259 2008-09-18 Wakaba <wakaba@suika.fam.cx>
260
261 * LangTag.pm: Add checks for remaining requirements from RFC 4646.
262
263 * mklangreg.pl: Sort 'Prefix' values by their length, to ease
264 matching.
265
266 2008-09-18 Wakaba <wakaba@suika.fam.cx>
267
268 * LangTag.pm: Warn for private use language subtags. Error level
269 typos fixed. Support for Suppress-Script field.
270
271 * mklangreg.pl: Support for dumping of nested structure.
272
273 2008-09-18 Wakaba <wakaba@suika.fam.cx>
274
275 * LangTag.pm (check_rfc4646_langtag): Check if a tag is in the
276 recommended case as per RFC 4646.
277
278 2008-09-18 Wakaba <wakaba@suika.fam.cx>
279
280 * LangTag.pm (check_rfc4646_langtag): New method.
281
282 2008-09-18 Wakaba <wakaba@suika.fam.cx>
283
284 * mklangreg.pl: New script.
285
286 * Makefile: Updated for creation of the module for language subtag
287 registry.
288
289 2008-09-16 Wakaba <wakaba@suika.fam.cx>
290
291 * Makefile: WebIDL.html added.
292
293 * WebIDL.pod: New documentation.
294
295 2008-09-16 Wakaba <wakaba@suika.fam.cx>
296
297 * WebIDL.pm: Checker's error types are redefined.
298
299 2008-09-16 Wakaba <wakaba@suika.fam.cx>
300
301 * WebIDL.pm: Parser's error types are redefined. Some forward
302 compatible parsing bugs are fixed. Some unreachable codes are
303 commented out.
304
305 2008-09-16 Wakaba <wakaba@suika.fam.cx>
306
307 * WebIDL.pm: Support for the reminding extended attributes are
308 added. It does not satisfy the definition that a forward
309 interface declaration has an extended attribute. It seems that
310 unless explicitly allowed multiple extended attributes with the
311 same name is not allowed, though it is not explicitly mentioned in
312 the spec.
313
314 2008-09-16 Wakaba <wakaba@suika.fam.cx>
315
316 * WebIDL.pm: Unescapes extended attribute names and extended
317 attribute identifiers. Preserve whether an extended attribute has
318 an argument list of not. Support for extended attributes:
319 Constructor, ExceptionConsts, IndexGetter, IndexSetter,
320 NameGetter, NameSetter, and Null.
321 (has_argument_list): New attribute.
322 (idl_text): Stringifies argument lists, if any, even if it is
323 empty.
324
325 2008-09-15 Wakaba <wakaba@suika.fam.cx>
326
327 * HTML.pm.src: New state |PCDATA_STATE|. Use an empty string for
328 |{s_kwd}| in DATA_STATE as default.
329
330 2008-09-15 Wakaba <wakaba@suika.fam.cx>
331
332 * HTML.pm.src, mkhtmlparser.pl: Replace |{prev_char}|
333 by |{s_kwd}| in DATA_STATE.
334
335 2008-09-15 Wakaba <wakaba@suika.fam.cx>
336
337 * HTML.pm.src: Shorten keys.
338
339 2008-09-15 Wakaba <wakaba@suika.fam.cx>
340
341 * HTML.pm.src: Remove checking for control character, surrogate
342 pair, or noncharacter code points and non-Unicode code
343 points (they should be handled by Whatpm::Charset::UnicodeChecker).
344 (parse_char_stream): Support for the |$get_wrapper| argument and
345 character stream error handlers.
346
347 2008-09-15 Wakaba <wakaba@suika.fam.cx>
348
349 * ContentChecker.pm: Don't call |loda_ns_module|
350 for null-namespace elements/attributes.
351
352 * HTML.pm.src: Fact out $disallowed_control_chars
353 as a hash.
354
355 2008-09-14 Wakaba <wakaba@suika.fam.cx>
356
357 * HTML.pm.src: Regexp typo fixed. |{prev_char}|
358 and |{next_char}| initializations are moved to initialization
359 method. |{read_until}| now supports buffering. Sync |set_inner_html|
360 with |parse_char_stream|.
361
362 2008-09-14 Wakaba <wakaba@suika.fam.cx>
363
364 * HTML.pm.src (parse_char_stream): Make |set_next_char|
365 invoke |manakai_read_until|, not only |read|, where
366 possible, to decrease the number of |read| method calls.
367
368 * mkhtmlparser.pl: Related changes to the aforementioned
369 modification.
370
371 2008-09-14 Wakaba <wakaba@suika.fam.cx>
372
373 * HTML.pm.src: Use |read| instead of |getc|. |set_inner_html|
374 would report character error from now.
375
376 2008-09-14 Wakaba <wakaba@suika.fam.cx>
377
378 * HTML.pm.src: White-space-leaded non-white-space character
379 tokens in "before head insertion mode" was not
380 correctly handled.
381 (set_inner_html): Reimplemented using CharString decodehandle
382 class. Support for $get_wrapper argument. Support
383 for |{read_until}| feature.
384
385 2008-09-14 Wakaba <wakaba@suika.fam.cx>
386
387 * HTML.pm.src: Make a "bare ero" error for unknown
388 entities point the "&" character.
389
390 2008-09-14 Wakaba <wakaba@suika.fam.cx>
391
392 * HTML.pm.src: It turns out that U+FFFD don't have to
393 be added to the list of excluded characters.
394
395 2008-09-14 Wakaba <wakaba@suika.fam.cx>
396
397 * HTML.pm.src ($char_onerror): Have character decoder's |line|
398 and |column| a higher priority than the one set by the
399 tokenizer's input handler.
400 ($self->{read_until}): Exclude U+FFFD (but this might
401 not be necessary, since now we do line/column fixup in
402 the character decode handle).
403
404 2008-09-14 Wakaba <wakaba@suika.fam.cx>
405
406 * HTML.pm.src: Use |{read_until}| where possible.
407
408 2008-09-14 Wakaba <wakaba@suika.fam.cx>
409
410 * HTML.pm.src: Change |{getc_until}| to |{read_until}|
411 and |manakai_getc_until| to |manakai_read_until| to
412 reduce the number of string copies.
413
414 2008-09-14 Wakaba <wakaba@suika.fam.cx>
415
416 * HTML.pm.src (parse_char_string): Use newly created
417 |Whatpm::Charset::DecodeHandle::CharString| instead of Perl's
418 standard feature to |open| a string as a filehandle,
419 since Perl's string filehandle seems not supporting |ungetc|
420 method correctly.
421 (parse_char_stream): Define |{getc_until}| method.
422 (DATA_STATE): Experimental support for |getc_until| feature.
423
424 2008-09-13 Wakaba <wakaba@suika.fam.cx>
425
426 * HTML.pm.src: Check points added to newly added branches.
427
428 2008-09-13 Wakaba <wakaba@suika.fam.cx>
429
430 * HTML.pm.src: Remove |{char}|, which is no longer used.
431 Remove |{entity_in_attr}| and |{last_attribute_value_state}|
432 and replaced by |{prev_state}|.
433
434 * mkhtmlparser.pl: Remove |{char}| feature.
435 Remove |!!!back-next-input-character;| macro.
436
437 2008-09-13 Wakaba <wakaba@suika.fam.cx>
438
439 * HTML.pm.src: Finally we get rid of all the inner loops. Remove
440 entity related tokenizer states in favor of new states
441 implementing the consume character reference algorithm.
442
443 2008-09-13 Wakaba <wakaba@suika.fam.cx>
444
445 * HTML.pm.src: "Consume a character reference" algorithm is
446 now implemented as a tokenizer's state, rather than
447 a method, with minimum changes (more changes will
448 be made, in due course). "Bogus comment state"'s inner
449 loop gets removed.
450
451 2008-09-13 Wakaba <wakaba@suika.fam.cx>
452
453 * HTML.pm.src: Make |PUBLIC| and |SYSTEM| keyword tokenizing
454 into their own tokenizer states.
455
456 2008-09-13 Wakaba <wakaba@suika.fam.cx>
457
458 * HTML.pm.src: |CDATA_SECTION_STATE| (formally |CDATA_BLOCK_STATE|
459 is split into three states.
460
461 2008-09-13 Wakaba <wakaba@suika.fam.cx>
462
463 * HTML.pm.src: |CLOSE_TAG_OPEN_STATE| is broken into
464 itself and new |CDATA_PCDATA_CLOSE_TAG_STATE| so that
465 no longer does the tokenizer have to push back next input
466 characters in those states.
467
468 2008-09-13 Wakaba <wakaba@suika.fam.cx>
469
470 * HTML.pm.src: |MARKUP_DECLARATION_OPEN_STATE| broken
471 into four states so that no longer does the tokenizer have to push
472 back next input characters in that state.
473
474 2008-09-11 Wakaba <wakaba@suika.fam.cx>
475
476 * HTML.pm.src: Methods now accept additional parameter, $get_wrapper,
477 which can be used to insert some wrapper between the character
478 stream handle and the tokenizer. (It is currently not supported
479 for |set_inner_html| for |Element|s).
480
481 2008-09-10 Wakaba <wakaba@suika.fam.cx>
482
483 * HTML.pm.src: Ignore punctuations in charset names.
484
485 2008-09-10 Wakaba <wakaba@suika.fam.cx>
486
487 * ContentChecker.pm: Support for charset-layer error levels.
488
489 * HTML.pm.src: Don't specify |text| argument for the
490 |chardecode:fallback| error, since it is not the encoding
491 being used alternatively.
492
493 2008-09-06 Wakaba <wakaba@suika.fam.cx>
494
495 * HTML.pm.src: Support for |XSLT-compat| (HTML5 revision 2141).
496
497 2008-08-31 Wakaba <wakaba@suika.fam.cx>
498
499 * CacheManifest.pm: Support for extensibility (HTML5 revision 2051).
500
501 2008-08-31 Wakaba <wakaba@suika.fam.cx>
502
503 * HTML.pm.src: Bug fix and sync with the spec with regard
504 to after after frameset insertion mode processing (HTML5
505 revision 1909). Note that the implementation was wrong
506 per the old spec before the r1909 changes.
507
508 2008-08-30 Wakaba <wakaba@suika.fam.cx>
509
510 * HTMLTable.pm: scope=auto algorithm fix synced with the
511 spec (HTML5 revision 2093).
512 ($process_row): Algorithm step numbers synced with the
513 spec (HTML5 revision 2092).
514
515 2008-08-30 Wakaba <wakaba@suika.fam.cx>
516
517 * HTMLTable.pm: Zs is not what we want; we want White_Space! (HTML5
518 revision 2094).
519
520 2008-08-30 Wakaba <wakaba@suika.fam.cx>
521
522 * ContentType.pm: Support for image/svg+xml (HTML5 revision 2096).
523
524 2008-08-30 Wakaba <wakaba@suika.fam.cx>
525
526 * HTML.pm.src: '"' and "'" at the end of attribute
527 name (after another attribute) now raise parse error (HTML5
528 revision 2123). Empty unquoted attribute values are no
529 longer allowed (HTML5 revision 2122).
530
531 2008-08-30 Wakaba <wakaba@suika.fam.cx>
532
533 * mkhtmlparser.pl: Support for MathML |definitionURL| attribute (HTML5
534 revision 2130).
535
536 2008-08-30 Wakaba <wakaba@suika.fam.cx>
537
538 * ContentChecker.pm: |xml:lang| attribute value must be same
539 as |lang| attribute value for HTML elements (HTML5 revision 2062
540 and so on).
541
542 2008-08-30 Wakaba <wakaba@suika.fam.cx>
543
544 * ContentChecker.pm: Error level definition for |xml_id_error|
545 was missing.
546
547 * URIChecker.pm: The end of the URL should be marked as the
548 error location for an empty path error. The position
549 between the userinfo and the port components should be
550 marked as the error location for an empty host error.
551
552 2008-08-30 Wakaba <wakaba@suika.fam.cx>
553
554 * URIChecker.pm: Set parameters representing where in the
555 value the error occurs for errors. Report unknown
556 address format error in warning level, since address
557 formats are rarely added. Path segments starting with "/.."
558 were misinterpreted as a dot-segment.
559
560 2008-08-30 Wakaba <wakaba@suika.fam.cx>
561
562 * URIChecker.pm (check_iri_reference): Requires
563 |Message::DOM::DOMImplementation|.
564
565 2008-08-29 Wakaba <wakaba@suika.fam.cx>
566
567 * IMTChecker.pm: Updated for the new error reporting architecture.
568
569 * ContentChecker.pm: Error levels for IMTs are added.
570
571 2008-08-17 Wakaba <wakaba@suika.fam.cx>
572
573 * H2H.pm (_shift_token): Support for unquoted HTML attribute
574 values.
575
576 2008-08-16 Wakaba <wakaba@suika.fam.cx>
577
578 * CacheManifest.pm: Support for new style of error
579 reports.
580
581 * HTML.pm.src: Set line=1, column=1 to the document node.
582
583 2008-08-16 Wakaba <wakaba@suika.fam.cx>
584
585 * ContentChecker.pm, RDFXML.pm: Pass {level} object to language tag
586 and URL checkers. Support for more error levels for bogus
587 langauge tag and URL "standards".
588
589 * LangTag.pm, URIChecker.pm: Support for new style error
590 level reporting.
591
592 2008-08-15 Wakaba <wakaba@suika.fam.cx>
593
594 * ContentChecker.pm: Support for RDF/XML error levels.
595
596 * HTMLTable.pm, RDFXML.pm: Support for new style of error level
597 specifying. Error types are revised.
598
599 2008-08-15 Wakaba <wakaba@suika.fam.cx>
600
601 * ContentChecker.pm: All error reporting method calls are
602 renewed.
603
604 2008-08-15 Wakaba <wakaba@suika.fam.cx>
605
606 * HTML.pm.src: All error type names and "text" parameters
607 are revised. Use new style for "level" specification.
608
609 * mkhtmlparser.pl: Use new style for "level" specification.
610
611 2008-08-03 Wakaba <wakaba@suika.fam.cx>
612
613 * WebIDL.pm (parse_char_string): Simplified error
614 reporting process for broken ignored valuetype definition.
615 (Valuetype idl_text): Support for special "DOMString" name.
616
617 2008-08-03 Wakaba <wakaba@suika.fam.cx>
618
619 * WebIDL.pm ($get_scoped_name): Append "::::" if the last
620 terminal of the ScopedName is "DOMString", such that whether
621 the last part of the scoped name is "DOMString" or "_DOMString"
622 later. It is necessary to determine whether a |typedef|
623 definition should be ignored or not.
624 (parse_char_string): Unescape the identifier of
625 exception members.
626 ($resolve): Return undef for builtin types and sequence<T>
627 types (we might not have to do this, however...).
628 (check): Support checking for Exceptions, Valuetypes,
629 and Typedefs.
630 ($serialize_type): Support for "DOMString::::" syntax.
631 (Typedef idl_text): Output Type as "DOMString" if it
632 is really "DOMString" (i.e. its internal representation
633 is "::DOMString::").
634
635 2008-08-03 Wakaba <wakaba@suika.fam.cx>
636
637 * WebIDL.pm ($resolve): New code, based on resolve code
638 for constant types in the |check| method.
639 (check): Support for checking of attributes, operations, and
640 arguments.
641 (Attribute/Operation idl_text): Exception names in getraises,
642 setraises, and raises clauses is serizlied by |$serialize_type|
643 code.
644
645 2008-08-02 Wakaba <wakaba@suika.fam.cx>
646
647 * WebIDL.pm ($integer): Order of selections are changed to match
648 hexadecimal numbers (the original pattern, taken from the spec,
649 was not work for hexadecimal numbers, because the "0" prefix
650 matches to the [0-7]* part (as an empty string) and therefore
651 it does not match with remaining "x..." part of a "0x..." integer
652 literal.
653 ($get_type): It now returns a string, not an array reference,
654 for regular types and |sequence| types (i.e. it in any case
655 returns a string).
656 ($get_next_token): The second item in the array that represents
657 a integer or float token is now a Perl number value, not the
658 original string representation of the number.
659 (check): Support for const value consistency checking.
660 No extended attribute is defined for constants.
661 (Node subclasses): Use simple strings rather than array references
662 for default data type values.
663 ($serialize_type): Type values are now simple strings.
664 (value): If the new attribute value is a false value, then
665 a FALSE value is set to the attribute.
666
667 2008-08-02 Wakaba <wakaba@suika.fam.cx>
668
669 * WebIDL.pm ($get_scoped_name): Now scoped names are stored
670 in its stringified format ("scoped name" as defined in the
671 spec). Note that future version of this module should not use
672 array references for type values and the |type_text| attribute
673 should be made obsolete.
674 (parse_char_string): Unescape attribute names.
675 (check): Support for checking of whether inherited interfaces
676 are actually defined or not. Support for checking of whether
677 interface member identifiers are duplicated or not.
678 ($serialize_type): Scoped names are returned as is. A future
679 version of this code should escape identifiers other than "DOMString",
680 otherwise the idl_text would be non-conforming.
681
682 2008-08-02 Wakaba <wakaba@suika.fam.cx>
683
684 * WebIDL.pm (parse_char_string): Set line/column numbers
685 to generated nodes. Unescape identifiers. Extended attributes
686 for Definition's were ignored.
687 (append_child): Set |parent_node| attribute.
688 (parent_node): New attribute.
689 (check): Support interface/exception members. Support
690 extended attributes. Support definition identifier uniqueness
691 constraint.
692 (qualified_name): New attribute.
693 (Interface/Exception idl_text): Extended attributes were
694 not prepended to the returned text.
695
696 2008-08-02 Wakaba <wakaba@suika.fam.cx>
697
698 * WebIDL.pm (parse_char_string): Set line/column numbers
699 to interface object experimentally. s/shift/pop/g, shift
700 would make things wrong. Support for interface forward
701 declarations was missing. Broken interface declarations
702 with no block were not ignored entirely.
703 (Whatpm::WebIDL::Node): New abstract class. This class
704 makes things easier.
705 (child_nodes): New attribute. Unlike DOM's attribute with
706 same name, this attribute returns a dead list of nodes for
707 simplicity.
708 (get_user_data, set_user_data): New methods.
709 (Module idl_text): A SPACE character should be inserted
710 before the |{| character.
711 (Interface idl_text): Support for interface forward declarations.
712 (is_forward_declaration): New attribute.
713
714 2008-07-19 Wakaba <wakaba@suika.fam.cx>
715
716 * WebIDL.pm (type_text): Better serializer.
717
718 2008-07-19 Wakaba <wakaba@suika.fam.cx>
719
720 * WebIDL.pm: Revise forward-compatible parsing so that
721 it now can handle broken extended attributes and as such.
722
723 2008-07-19 Wakaba <wakaba@suika.fam.cx>
724
725 * WebIDL.pm: Real support for extended attributes.
726 Support for extended attributes with arguments.
727
728 2008-07-19 Wakaba <wakaba@suika.fam.cx>
729
730 * WebIDL.pm: Support for |exception| syntax.
731 (Interface->idl_text): Tentative support for inheritances.
732
733 2008-07-19 Wakaba <wakaba@suika.fam.cx>
734
735 * WebIDL.pm: Hierarchical scoped name support was broken.
736 Support for raises, setraises, and getraises syntaxes.
737
738 2008-07-18 Wakaba <wakaba@suika.fam.cx>
739
740 * WebIDL.pm: Support for |idl_text| attribute, version 1 (no
741 proper support for types, extended attributes, and exceptions yet).
742 WebIDL parser, version 1 (no support for exceptions yet,
743 no proper support for extended attributes yet).
744
745 2008-07-09 Wakaba <wakaba@suika.fam.cx>
746
747 * WebIDL.pm (parse_char_string): Support for basic attribute syntax.
748
749 2008-06-29 Wakaba <wakaba@suika.fam.cx>
750
751 * WebIDL.pm: Support for valuetype and const.
752
753 2008-06-29 Wakaba <wakaba@suika.fam.cx>
754
755 * WebIDL.pm: New module.
756
757 2008-06-15 Wakaba <wakaba@suika.fam.cx>
758
759 * Makefile (Entities.html): URI changed.
760
761 2008-06-08 Wakaba <wakaba@suika.fam.cx>
762
763 * HTML.pm.src: Support for ruby parsing (HTML5 revision 1704).
764
765 2008-06-01 Wakaba <wakaba@suika.fam.cx>
766
767 * HTML.pm.src (_get_next_token): A parse error was missing.
768
769 2008-06-01 Wakaba <wakaba@suika.fam.cx>
770
771 * mklinktypelist.pl: rel=contact is no longer part of the HTML5
772 spec (commented out). (HTML5 revision 1711).
773
774 2008-05-25 Wakaba <wakaba@suika.fam.cx>
775
776 * ContentType.pm: Drop support for UTF-32 (HTML5 revision 1701).
777
778 * HTML.pm.src: UTF-16BE and UTF-16LE should be considered
779 as UTF-16 (HTML5 revision 1701).
780
781 2008-05-25 Wakaba <wakaba@suika.fam.cx>
782
783 * HTML.pm.src: Support for <noframes> in <head> (HTML5 revision
784 1692).
785
786 2008-05-25 Wakaba <wakaba@suika.fam.cx>
787
788 * HTML.pm.src: The secondary insertion mode used when switching
789 to foreign content is the "in body" insertion mode (HTML5 revision
790 1696).
791
792 2008-05-25 Wakaba <wakaba@suika.fam.cx>
793
794 * HTML.pm.src: Don't raise parse error for <isindex/> (HTML5
795 revision 1697).
796
797 2008-05-24 Wakaba <wakaba@suika.fam.cx>
798
799 * HTML.pm.src: Support for end-of-file token in foreign content
800 insertion mode (HTML5 revision 1693). Update SVG camelCase
801 attribute list (HTML5 revision 1700). <textarea> closes
802 </select> (HTML5 revision 1699). More start tags close in
803 foreign content insertion mode (HTML5 revision 1698).
804
805 2008-05-24 Wakaba <wakaba@suika.fam.cx>
806
807 * HTML.pm.src: ";" is not part of charset name (HTML5 revision 1665).
808
809 2008-05-24 Wakaba <wakaba@suika.fam.cx>
810
811 * HTML.pm.src: More robust charset parameter detection (HTML5
812 revision 1674).
813
814 2008-05-24 Wakaba <wakaba@suika.fam.cx>
815
816 * ContentType.pm: Support for image/vnd.microsoft.icon (HTML5
817 revision 1676).
818
819 2008-05-24 Wakaba <wakaba@suika.fam.cx>
820
821 * HTML.pm.src: Ignore language part of public identifiers for
822 quriks mode detection (HTML5 revision 1679).
823
824 2008-05-24 Wakaba <wakaba@suika.fam.cx>
825
826 * HTML.pm.src: Reduce the number of errors in truncated doctypes (HTML5
827 revision 1685).
828
829 2008-05-24 Wakaba <wakaba@suika.fam.cx>
830
831 * HTML.pm.src: Support for EOF in new states for tags (HTML5
832 revision 1684).
833
834 2008-05-24 Wakaba <wakaba@suika.fam.cx>
835
836 * HTML.pm.src (_reset_insertion_mode): Make <td>.innerHTML
837 work (HTML5 revision 1690).
838
839 2008-05-24 Wakaba <wakaba@suika.fam.cx>
840
841 * HTML.pm.src (_tree_construction_main): Change handling of
842 end tags in head insertion modes (HTML5 revision 1686).
843 (parse_char_string): Bug fix for non-utf8 character string handlings.
844 (parse_char_stream): |ungetc| does not work well for this context.
845
846 2008-05-18 Wakaba <wakaba@suika.fam.cx>
847
848 * HTML.pm.src (parse_byte_string): Redefined to invoke
849 |parse_byte_stream|.
850 (parse_byte_stream): New method.
851
852 2008-05-18 Wakaba <wakaba@suika.fam.cx>
853
854 * HTML.pm.src (parse_byte_string): Fix the column number reported
855 by encoding layer error reporter.
856
857 2008-05-17 Wakaba <wakaba@suika.fam.cx>
858
859 * HTML.pm.src (parse_byte_string): Use streaming decoder
860 rather than converting the whole byte string and then parsing.
861 Propagate errors in character encoding layer.
862 (get_next_token): Precise error reporting for |bare stago| error.
863
864 2008-05-17 Wakaba <wakaba@suika.fam.cx>
865
866 * HTML.pm.src (parse_char_stream): New method.
867 (parse_char_string): This method is now defined as an invocation
868 of the |parse_char_stream| method.
869
870 2008-05-17 Wakaba <wakaba@suika.fam.cx>
871
872 * HTML.pm.src (parse_byte_string): Report various status
873 of the sniffing as info-level errors. Support for new
874 decoding framework in parser resestting.
875 (new): Various default error levels were not set.
876
877 2008-05-17 Wakaba <wakaba@suika.fam.cx>
878
879 * HTML.pm.src (parse_byte_string): HTML5 encoding siniffing
880 algorithm, except for the actual sniffing, is implemented
881 with new framework with Message::Charset::Info.
882
883 2008-05-16 Wakaba <wakaba@suika.fam.cx>
884
885 * CacheManifest.pm (_parse): Drop fragment identifiers from
886 URIs in fallback section (HTML5 revision 1596).
887
888 2008-05-10 Wakaba <wakaba@suika.fam.cx>
889
890 * Makefile (Entities.html): URI has changed.
891
892 2008-05-10 Wakaba <wakaba@suika.fam.cx>
893
894 * CacheManifest.pm: Don't replace U+0000 NULL (HTML5 revision
895 1553).
896
897 2008-05-06 Wakaba <wakaba@suika.fam.cx>
898
899 * ContentChecker.pm: Noted that those returned in |table| are
900 no longer table elements, but table objects returned
901 by Whatpm::HTMLTable.
902
903 * HTMLTable.pm (form_table): Return table element node
904 as |$table->{element}|.
905 (assign_header): Support for the |headers=""| attribute.
906
907 2008-05-06 Wakaba <wakaba@suika.fam.cx>
908
909 * HTMLTable.pm (assign_header): New function; first version
910 with no support for headers="".
911 (form_table): Include table width and height to the returned
912 table object for covenience. Indexing in column assignement was wrong.
913 Set whether a data cell is empty or not for convenience.
914
915 2008-05-05 Wakaba <wakaba@suika.fam.cx>
916
917 * HTMLTable.pm: Robuster caption support (HTML5 revision 1393).
918
919 2008-05-05 Wakaba <wakaba@suika.fam.cx>
920
921 * HTMLTable.pm: How table model errors are detected is
922 changed (HTML5 revision 1387).
923
924 2008-05-05 Wakaba <wakaba@suika.fam.cx>
925
926 * HTMLTable.pm: The algorithm now moves |tfoot| elements
927 to the end of the table (HTML5 revision 1380).
928
929 2008-05-05 Wakaba <wakaba@suika.fam.cx>
930
931 * HTMLTable.pm: The algorithm is now 0-based indexing, instead
932 of 1-based (HTML5 revision 1376).
933
934 2008-05-05 Wakaba <wakaba@suika.fam.cx>
935
936 * ContentType.pm: "Content-Type: text/plain; charset=UTF-8"
937 and "Content-Encoding" no longer prevent sniffing (HTML5
938 revision 1288).
939
940 2008-05-05 Wakaba <wakaba@suika.fam.cx>
941
942 * ContentType.pm: Skip BOMs n feed or HTML algorithm (HTML5 revision
943 1282).
944
945 2008-05-03 Wakaba <wakaba@suika.fam.cx>
946
947 * ContentChecker.pm: Support for global attributes.
948 Status of XML specs are added.
949
950 2008-05-03 Wakaba <wakaba@suika.fam.cx>
951
952 * ContentChecker.pm (check_element): Support for |template=""|
953 and |ref=""| attribute (referent element type checking).
954
955 2008-04-29 Wakaba <wakaba@suika.fam.cx>
956
957 * CacheManifest.pm (_parse): New same origin definition (HTML5 revision
958 1500) is implemented (except for IDNA part and URI-scheme-specific
959 knowledge). Line number counting was wrong for LF-only
960 documents.
961
962 2008-04-13 Wakaba <wakaba@suika.fam.cx>
963
964 * HTML.pm.src: Raise an parse error for any disallowed
965 character (HTML5 revision 1263).
966
967 2008-04-13 Wakaba <wakaba@suika.fam.cx>
968
969 * mkentitylist.pl: Support for new HTML5 entity table format (the
970 definition for |AElig;| was missing).
971
972 2008-04-13 Wakaba <wakaba@suika.fam.cx>
973
974 * HTML.pm.src, mkhtmlparser.pl: Support for element/attribute
975 name/namespace fixup (HTML5 revisions 1413, 1415, 1416, and 1417).
976
977 2008-04-13 Wakaba <wakaba@suika.fam.cx>
978
979 * HTML.pm.src: List of element names that close foreign content
980 insertion mode is added (HTML5 revisions 1412 and 1418).
981
982 2008-04-13 Wakaba <wakaba@suika.fam.cx>
983
984 * HTML.pm.src: Support for |mglyph| and |malignmark|
985 elements (HTML5 revision 1410).
986
987 2008-04-13 Wakaba <wakaba@suika.fam.cx>
988
989 * HTML.pm.src: Support for new long MathML entities (HTML5
990 revision 1406).
991
992 2008-04-13 Wakaba <wakaba@suika.fam.cx>
993
994 * HTML.pm.src: CDATA section support for MathML and SVG
995 elements (HTML5 revision 1404 and 1420).
996
997 2008-04-12 Wakaba <wakaba@suika.fam.cx>
998
999 * HTML.pm.src, mkhtmlparser.pl: Support for MathML and SVG
1000 elements (HTML5 revision 1404). Unused !!!macro definitions
1001 are removed.
1002
1003 2008-04-12 Wakaba <wakaba@suika.fam.cx>
1004
1005 * HTML.pm.src, mkhtmlparser.pl: The way permitted slash errors
1006 are raised is changed (HTML5 revision 1404).
1007
1008 2008-04-06 Wakaba <wakaba@suika.fam.cx>
1009
1010 * HTML.pm.src: Category sets that are no longer used are removed.
1011
1012 2008-04-06 Wakaba <wakaba@suika.fam.cx>
1013
1014 * HTML.pm.src: The ->[1] property of stack entries are now
1015 replaced by constants representing element category.
1016
1017 2008-04-06 Wakaba <wakaba@suika.fam.cx>
1018
1019 * HTML.pm.src: Don't use local name stored in stack (i.e. ->[1])
1020 for error reporting. (This is a preparation for using constant
1021 value for ->[1].)
1022
1023 2008-03-22 Wakaba <wakaba@suika.fam.cx>
1024
1025 * RDFXML.pm: Typo fixed.
1026
1027 2008-03-22 Wakaba <wakaba@suika.fam.cx>
1028
1029 * ContentChecker.pm: |fact_level| is now treated
1030 as same as |must_level|, i.e. level = |m|.
1031 (check_element): Make list of URIs in the DOM.
1032
1033 2008-03-21 Wakaba <wakaba@suika.fam.cx>
1034
1035 * RDFXML.pm: Language accessor implemented.
1036 Local (null-namespace) attribute support.
1037
1038 2008-03-21 Wakaba <wakaba@suika.fam.cx>
1039
1040 * RDFXML.pm: Factored out ID checking code.
1041
1042 2008-03-21 Wakaba <wakaba@suika.fam.cx>
1043
1044 * RDFXML.pm: TODO items noted. Validation of ID
1045 and URI attributes is implemented. Warn if unknown
1046 value is used in rdf:parseType="" attribute.
1047
1048 * URIChecker.pm (check_rdf_uri_reference): New function.
1049
1050 2008-03-21 Wakaba <wakaba@suika.fam.cx>
1051
1052 * RDFXML.pm: bnodeid implemented. Relative references
1053 are now resolved.
1054
1055 2008-03-21 Wakaba <wakaba@suika.fam.cx>
1056
1057 * ContentChecker.pm: RDF reification implemented.
1058
1059 * RDFXML.pm: undef vs false bug fixed.
1060 Reification implemented.
1061
1062 2008-03-21 Wakaba <wakaba@suika.fam.cx>
1063
1064 * RDFXML.pm: s/id/ID/ for attribute name.
1065 The |node| arguments are added for |ontriple| calls.
1066 Too many "attribute not allowed" errors were raised.
1067
1068 * ContentChecker.pm: Initial experimental support for rdf:RDF
1069 element.
1070
1071 2008-03-21 Wakaba <wakaba@suika.fam.cx>
1072
1073 * RDFXML.pm: New module.
1074
1075 2008-03-20 Wakaba <wakaba@suika.fam.cx>
1076
1077 * HTML.pm.src (set_inner_html): Line/column number
1078 code was old one yet.
1079
1080 2008-03-20 Wakaba <wakaba@suika.fam.cx>
1081
1082 * HTML.pm.src: Better line/column reporting for "duplicate attribute"
1083 errors. Line/column markings for DOCTYPE, comment, and
1084 character tokens are reintroduced; otherwise,
1085 error location for "not HTML5" error and errors
1086 for implied elements are not attached.
1087
1088 2008-03-20 Wakaba <wakaba@suika.fam.cx>
1089
1090 * HTML.pm.src: Set line/column numbers to attributes.
1091
1092 * NanoDOM.pm (create_attribute_ns, set_attribute_node_ns): Added.
1093 (value): Setter implemented.
1094
1095 * mkhtmlparser.pl: Set line/column numbers to Attr nodes.
1096
1097 2008-03-20 Wakaba <wakaba@suika.fam.cx>
1098
1099 * HTML.pm.src: Unused line/column markings are removed.
1100
1101 2008-03-20 Wakaba <wakaba@suika.fam.cx>
1102
1103 * HTML.pm.src (_get_next_token): Remove |first_start_tag|
1104 flag, which is no longer used.
1105
1106 2008-03-17 Wakaba <wakaba@suika.fam.cx>
1107
1108 * HTML.pm.src: Set line/column information to element nodes.
1109
1110 * mkhtmlparser.pl (!!!create-element, !!!insert-element,
1111 and !!!insert-element-t): Set line/column information to
1112 element nodes.
1113
1114 2008-03-17 Wakaba <wakaba@suika.fam.cx>
1115
1116 * HTML.pm.src (_get_next_token): The first "<" character
1117 in "<?", "<>", or "</>" should be the error point.
1118
1119 2008-03-16 Wakaba <wakaba@suika.fam.cx>
1120
1121 * HTML.pm.src: Some more fixes on error position reporting.
1122
1123 2008-03-16 Wakaba <wakaba@suika.fam.cx>
1124
1125 * HTML.pm.src: Token-level precious error reporting.
1126
1127 2008-03-16 Wakaba <wakaba@suika.fam.cx>
1128
1129 * HTML.pm.src: Preparation for more precious error point
1130 reporting.
1131
1132 2008-03-11 Wakaba <wakaba@suika.fam.cx>
1133
1134 * HTML.pm.src: Error type revised.
1135
1136 2008-03-11 Wakaba <wakaba@suika.fam.cx>
1137
1138 * HTML.pm.src: Similar codes are merged together, again.
1139
1140 2008-03-11 Wakaba <wakaba@suika.fam.cx>
1141
1142 * HTML.pm.src: Similar codes are merged together.
1143
1144 2008-03-10 Wakaba <wakaba@suika.fam.cx>
1145
1146 * mkhtmlparser.pl: Set "level" parameter to parse errors.
1147
1148 * HTML.pm.src: Code refined.
1149
1150 2008-03-09 Wakaba <wakaba@suika.fam.cx>
1151
1152 * HTML.pm.src: |</body>| treatement has been changed (HTML5
1153 revision 1348). Note that I really don't know this makes
1154 any difference in the black-box behavior of the parser.
1155
1156 2008-03-09 Wakaba <wakaba@suika.fam.cx>
1157
1158 * HTML.pm.src: New end-of-file token implementation (HTML5
1159 revision 1348).
1160
1161 2008-03-09 Wakaba <wakaba@suika.fam.cx>
1162
1163 * HTML.pm.src: |applet| support (HTML5 revision 1347).
1164
1165 2008-03-09 Wakaba <wakaba@suika.fam.cx>
1166
1167 * HTML.pm.src: Foster parenting in AAA (HTML5 revision 1343).
1168
1169 2008-03-09 Wakaba <wakaba@suika.fam.cx>
1170
1171 * HTML.pm.src: Support for |<input>| in the "in select" insertion
1172 mode and support for the "in select in table" insertion mode (HTML5
1173 revision 1342).
1174
1175 2008-03-09 Wakaba <wakaba@suika.fam.cx>
1176
1177 * HTML.pm.src: No foster parenting for <script> and <script>
1178 in non-tainted <table>s (HTML5 revision 1336).
1179
1180 2008-03-09 Wakaba <wakaba@suika.fam.cx>
1181
1182 * HTML.pm.src: Ignore white space characters between <html>
1183 and <head> (HTML5 revision 1332).
1184
1185 2008-03-09 Wakaba <wakaba@suika.fam.cx>
1186
1187 * HTML.pm.src: Treat <input type=hidden> as if it were a
1188 white space (HTML5 revision 1331).
1189
1190 2008-03-08 Wakaba <wakaba@suika.fam.cx>
1191
1192 * HTML.pm.src: Ignore U+000A at the beginning of a |listing|
1193 element (HTML5 revision 1330).
1194
1195 2008-03-08 Wakaba <wakaba@suika.fam.cx>
1196
1197 * HTML.pm.src: <title> is always appended to the current
1198 element (HTML5 revision 1328).
1199
1200 2008-03-08 Wakaba <wakaba@suika.fam.cx>
1201
1202 * HTML.pm.src: White space in tainted tables are moved
1203 into foster parents (HTML5 revision 1326).
1204
1205 2008-03-08 Wakaba <wakaba@suika.fam.cx>
1206
1207 * HTML.pm.src: Reduce errors from foster parenting cases (HTML5
1208 revision 1321).
1209
1210 2008-03-08 Wakaba <wakaba@suika.fam.cx>
1211
1212 * HTML.pm.src: |</h/n/>| case code rearranged to align with
1213 the spec (HTML5 revision 1320). Note that we finally complete
1214 all of HTML5 revision 1320 changes.
1215
1216 2008-03-08 Wakaba <wakaba@suika.fam.cx>
1217
1218 * HTML.pm.src: |</form>| now works similar to |</div>| for unclosed
1219 tags (HTML5 revision 1320).
1220
1221 2008-03-08 Wakaba <wakaba@suika.fam.cx>
1222
1223 * HTML.pm.src: |</p>| case rearranged with no actual change in fact.
1224
1225 2008-03-08 Wakaba <wakaba@suika.fam.cx>
1226
1227 * HTML.pm.src: A "generate implied end tags" code (t409.1)
1228 could not be reached so that it is now removed (HTML5 revision
1229 1320).
1230
1231 2008-03-08 Wakaba <wakaba@suika.fam.cx>
1232
1233 * HTML.pm.src: Code for the case of |</div>| and so on are revised
1234 to align with new spec text (HTML5 revision 1320).
1235
1236 2008-03-08 Wakaba <wakaba@suika.fam.cx>
1237
1238 * HTML.pm.src: Remove strange |if| condition; however, it should
1239 have had no harm in theory.
1240
1241 2008-03-08 Wakaba <wakaba@suika.fam.cx>
1242
1243 * HTML.pm.src (_tree_construction_main): '</p>' in body
1244 case is split from other end tags for the preparation
1245 of implementing HTML revision 1320.
1246
1247 2008-03-07 Wakaba <wakaba@suika.fam.cx>
1248
1249 * HTML.pm.src: Simplified "generate implied end tag" (HTML5
1250 revision 1320).
1251
1252 2008-03-07 Wakaba <wakaba@suika.fam.cx>
1253
1254 * HTML.pm.src (_tree_construction_main): Merge rules for "h1"
1255 and "div" (HTML5 revision 1318). Add comments to where
1256 |form| pointer association codes should be inserted (HTML5 revision
1257 1319).
1258
1259 2008-03-06 Wakaba <wakaba@suika.fam.cx>
1260
1261 * HTML.pm.src: <html> treatement refined (HTML5 revision 1314).
1262
1263 2008-03-05 Wakaba <wakaba@suika.fam.cx>
1264
1265 * HTML.pm.src: Since the case t268 should never be reached (no
1266 other token type, there are), it is replaced by a |die| statement.
1267
1268 2008-03-05 Wakaba <wakaba@suika.fam.cx>
1269
1270 * HTML.pm.src: Typo fixed.
1271
1272 2008-03-04 Wakaba <wakaba@suika.fam.cx>
1273
1274 * HTML.pm.src (_tree_construction_initial): Some limited quirks
1275 doctypes were not uppercased for comparison.
1276
1277 2008-03-03 Wakaba <wakaba@suika.fam.cx>
1278
1279 * HTML.pm.src (tree construction and set_inner_html): Checkpoints
1280 are added.
1281
1282 2008-03-03 Wakaba <wakaba@suika.fam.cx>
1283
1284 * HTML.pm.src (_tokenize_attempt_to_consume_an_entity): Checkpoints
1285 are set. Cases that are unlikely reached are noted as so.
1286
1287 2008-03-03 Wakaba <wakaba@suika.fam.cx>
1288
1289 * HTML.pm.src: Checkpoints for debugging are added.
1290
1291 * mkhtmlparser.pl: Support for |!!!cp| syntax.
1292
1293 2008-03-03 Wakaba <wakaba@suika.fam.cx>
1294
1295 * HTML.pm.src, mkhtmlparser.pl: s/_input_character/_char/g
1296 for simplicity.
1297
1298 2008-03-03 Wakaba <wakaba@suika.fam.cx>
1299
1300 * HTML.pm.src: Flag name changed: s/correct/force-quirks/g (HTML5
1301 revision 1307).
1302
1303 2008-03-03 Wakaba <wakaba@suika.fam.cx>
1304
1305 * HTML.pm.src (_get_next_token): Where the /incorrect/ flag
1306 is set are changed (HTML5 revision 1305).
1307
1308 2008-03-02 Wakaba <wakaba@suika.fam.cx>
1309
1310 * HTML.pm.src: Raise a parse error for |<span ===>| (HTML5 revision
1311 1292). Entities are not parsed in comment-like part in RCDATA
1312 elements (HTML5 revision 1294). Allow bare & at the end
1313 of attribute value literals (HTML5 revision 1296). More
1314 quirks mode doctypes (HTML5 revision 1302). Requires spaces
1315 between attributes and ban attribute names or unquoted
1316 attribute values containing single or double quotes (HTML5
1317 revision 1303).
1318
1319 2008-03-02 Wakaba <wakaba@suika.fam.cx>
1320
1321 * ContentChecker.pm: Typo fixed. Don't raise "character encoding"
1322 and related errors unless it is an HTML document (though the spec
1323 is unclear on whether it is applied to XHTML document).
1324
1325 * HTML.pm (%HTMLAttrStatus): WF2 repetition model attributes
1326 are added.
1327
1328 2008-03-02 Wakaba <wakaba@suika.fam.cx>
1329
1330 * HTML.pm.src: s/local_name/manakai_local_name/g.
1331
1332 2008-03-01 Wakaba <wakaba@suika.fam.cx>
1333
1334 * _NamedEntityList.pm: Updated (HTML5 revision 1286).
1335
1336 * HTML.pm.src: |charset| in |content| attribute is
1337 case-insensitive (HTML5 revision 1270).
1338
1339 2008-02-26 Wakaba <wakaba@suika.fam.cx>
1340
1341 * ContentChecker.pm: New status constants are added.
1342 ($ElementDefault): |status| added.
1343 (check_element): Err for non-standard or deprecated elements.
1344 (_attr_status_info): For non-standard or deprecated attributes.
1345
1346 2008-02-24 Wakaba <wakaba@suika.fam.cx>
1347
1348 * ContentChecker.pm (_attr_status_info): New internal method.
1349
1350 2008-02-24 Wakaba <wakaba@suika.fam.cx>
1351
1352 * ContentChecker.pm (check_element): Element standardized
1353 status information is now dispatched.
1354
1355 2008-02-24 Wakaba <wakaba@suika.fam.cx>
1356
1357 * ContentChecker.pm (check_element): Fix |del|-and-significant
1358 problem by adding some more arguments.
1359
1360 2008-02-24 Wakaba <wakaba@suika.fam.cx>
1361
1362 * ContentChecker.pm (check_element): Use context of
1363 container-for-the-purpose-of-content-model element (not transparent
1364 element) for |check_child_element| calling and significant
1365 text flag marking. This reintroduces |<del>|-and-significant
1366 problem again.
1367
1368 2008-02-24 Wakaba <wakaba@suika.fam.cx>
1369
1370 * ContentChecker.pm (check_element): Make semi-transparent
1371 elements ignored for the purpose of phase changes in
1372 content model checking.
1373
1374 2008-02-23 Wakaba <wakaba@suika.fam.cx>
1375
1376 * ContentChecker.pm (check_element): In-element state
1377 was not properly managed for transparent cases.
1378
1379 2008-02-23 Wakaba <wakaba@suika.fam.cx>
1380
1381 * ContentChecker.pm (check_element): Support for |video|
1382 and |audio| as semi-transparent elements.
1383
1384 2008-02-23 Wakaba <wakaba@suika.fam.cx>
1385
1386 * ContentChecker.pm ($HTMLSemiTransparentElements): New.
1387 (check_element): s/minuses/minus_elements/, s/pluses/plus_elements/.
1388 Support for |html:object| as a semi-transparent element.
1389
1390 2008-02-23 Wakaba <wakaba@suika.fam.cx>
1391
1392 * ContentChecker.pm (check_element): The way to traverse
1393 the tree is entirely revised to make it easier to track
1394 the state of ancestors/descendants. As a result of this
1395 revision (which rewrites almost all of Whatpm::ContentChecker::HTML),
1396 support for content model checking for HTML elements |figure|,
1397 |object|, |video|, and |audio| and checking for XML elements (and
1398 some XMLNS checkings) are dropped for now. They will be
1399 reimplemented in due cource.
1400
1401 2008-02-17 Wakaba <wakaba@suika.fam.cx>
1402
1403 * HTML.pm.src: |>| in public or system literal closes the DOCTYPE
1404 token (HTML5 revision 1225).
1405
1406 2008-02-17 Wakaba <wakaba@suika.fam.cx>
1407
1408 * ContentChecker.pm ({unsupported_level}): New value.
1409
1410 * HTML.pm.src: Save whether |meta| |content| attribute
1411 contains character references or not.
1412
1413 2008-02-17 Wakaba <wakaba@suika.fam.cx>
1414
1415 * ContentChecker.pm (_get_children): (Incompleted) attempt to
1416 imlement significant content checking for contents
1417 with |del| elements.
1418
1419 2008-02-17 Wakaba <wakaba@suika.fam.cx>
1420
1421 * ContenteChecker.pm ($HTMLTransparentElements): More
1422 elements are added.
1423 (_get_children): HTML |object| elements are now semi-transparent.
1424
1425 * NanoDOM.pm (manakai_html, manakai_head): New methods.
1426
1427 2008-02-16 Wakaba <wakaba@suika.fam.cx>
1428
1429 * CacheManifest.pm: HTML5 revision 1211 implemented.
1430
1431 * CacheManifest.pod: Updated.
1432
1433 2008-02-10 Wakaba <wakaba@suika.fam.cx>
1434
1435 * ContentChecker.pm (check_document, check_element): Support
1436 for second argument ($onsubdoc).
1437 (_get_css_parser): Removed (now it is part of WDCC).
1438
1439 2008-02-09 Wakaba <wakaba@suika.fam.cx>
1440
1441 * ContentChecker.pm (_get_css_parser): New.
1442
1443 2007-11-25 Wakaba <wakaba@suika.fam.cx>
1444
1445 * ContentChecker.pm ($AnyChecker): Old way to add child elements
1446 for checking had been used.
1447
1448 2007-11-25 Wakaba <wakaba@suika.fam.cx>
1449
1450 * ContentChecker.pm (check_element): New todo item type |descendant|.
1451
1452 2007-11-23 Wakaba <wakaba@suika.fam.cx>
1453
1454 * IMTChecker.pm: Revised to raise errors and warnings as (poorly)
1455 specced in RFC 2046 and RFC 4288.
1456 (application/atom+xml): Definition added.
1457
1458 2007-11-23 Wakaba <wakaba@suika.fam.cx>
1459
1460 * URIChecker.pm: Make RFC 3986 should-level errors
1461 warnings (rather than SHOULD-level errors).
1462
1463 2007-11-23 Wakaba <wakaba@suika.fam.cx>
1464
1465 * NanoDOM.pm (get_user_data, set_user_data): New methods.
1466
1467 * HTML.pm.src: A flag for character references in attribute
1468 values are added. Set |manakai_has_reference| user data
1469 to |charset| attribute.
1470
1471 2007-11-23 Wakaba <wakaba@suika.fam.cx>
1472
1473 * NanoDOM.pm (input_encoding, manakai_charset, manakai_has_bom): New
1474 attributes.
1475
1476 * ContentChecker.pm (check_document): Warn if charset requirements
1477 cannot be tested.
1478
1479 2007-11-19 Wakaba <wakaba@suika.fam.cx>
1480
1481 * HTML.pm.src (parse_byte_string): Detect charset
1482 by universalchardet if charset parameter is not specified.
1483
1484 * Makefile (Charset-all, Charset-clean): New rules.
1485
1486 2007-11-18 Wakaba <wakaba@suika.fam.cx>
1487
1488 * ContentChecker.pm (check_document): Check the existence
1489 of character encoding declaration and selection of encoding
1490 for HTML document.
1491
1492 2007-11-18 Wakaba <wakaba@suika.fam.cx>
1493
1494 * ContentType.pm (get_sniffed_type): Return also the official
1495 type in list context.
1496
1497 2007-11-18 Wakaba <wakaba@suika.fam.cx>
1498
1499 * ContentType.pm: Sniffing with leading white space
1500 ignoring (HTML5 revisions 1013 and 1016).
1501
1502 2007-11-18 Wakaba <wakaba@suika.fam.cx>
1503
1504 * ContentType.pm: HTML5 revision 1013 changes, except for leading
1505 white spaces, are implemented.
1506
1507 2007-11-11 Wakaba <wakaba@suika.fam.cx>
1508
1509 * HTML.pm.src (parse_char_string): Set |inner_encoding|
1510 attribute if possible.
1511
1512 2007-11-11 Wakaba <wakaba@suika.fam.cx>
1513
1514 * HTML.pm.src (parse_byte_string): New method.
1515 (parse_char_string): New alias for |parse_string|.
1516 (main phase): Invoking "change the encoding" algorithm if desired.
1517
1518 * HTML.pod: Updated.
1519
1520 2007-11-11 Wakaba <wakaba@suika.fam.cx>
1521
1522 * HTML.pod (get_inner_html): Removed.
1523
1524 * Makefile (HTML-all, HTML-clean): New.
1525
1526 2007-11-11 Wakaba <wakaba@suika.fam.cx>
1527
1528 * HTML.pm.src (get_inner_html): Removed (moved to HTML/Serializer.pm).
1529
1530 2007-11-08 Wakaba <wakaba@suika.fam.cx>
1531
1532 * mklinktypelist.pl: s/noreferer/noreferrer/ (HTML5 revision 1132).
1533
1534 2007-11-04 Wakaba <wakaba@suika.fam.cx>
1535
1536 * Makefile: |CacheManifest.html| is added.
1537
1538 * CacheManifest.pod: New file.
1539
1540 2007-11-04 Wakaba <wakaba@suika.fam.cx>
1541
1542 * CacheManifest.pm: New module.
1543
1544 2007-11-04 Wakaba <wakaba@suika.fam.cx>
1545
1546 * HTML.pm.src: Support for application cache selection algorithm
1547 callback.
1548
1549 2007-11-04 Wakaba <wakaba@suika.fam.cx>
1550
1551 * mklinktypelist.pl: Support for rel=noreferer (HTML5 revision 1118).
1552
1553 2007-10-17 Wakaba <wakaba@suika.fam.cx>
1554
1555 * Makefile (clean): New rule.
1556
1557 * NanoDOM.pm (public_id, system_id): New attributes.
1558
1559 2007-10-17 Wakaba <wakaba@suika.fam.cx>
1560
1561 * Makefile (CSS-all, CSS-clean, clean): New rules.
1562
1563 2007-10-14 Wakaba <wakaba@suika.fam.cx>
1564
1565 * ContentChecker.pm (check_document): Support for
1566 new |is_xml_root| flag.
1567 (check_element): Support for new |pluses| state.
1568 (_add_pluses): New method.
1569 (_remove_minuses): Support for new |minus| item.
1570
1571 2007-09-24 Wakaba <wakaba@suika.fam.cx>
1572
1573 * ContentChecker.pm: Raise specific error for invalid
1574 root element.
1575
1576 2007-09-24 Wakaba <wakaba@suika.fam.cx>
1577
1578 * ContentChecker.pm: Set level values for later uses.
1579
1580 2007-09-09 Wakaba <wakaba@suika.fam.cx>
1581
1582 * ContentChecker.pm: Support for language tag validation.
1583
1584 2007-09-09 Wakaba <wakaba@suika.fam.cx>
1585
1586 * LangTag.pm (check_rfc3066_language_tag): New method.
1587
1588 2007-09-09 Wakaba <wakaba@suika.fam.cx>
1589
1590 * LangTag.pm: New module.
1591
1592 2007-09-04 Wakaba <wakaba@suika.fam.cx>
1593
1594 * HTML.pm.src: Some error types were wrong.
1595
1596 2007-08-17 Wakaba <wakaba@suika.fam.cx>
1597
1598 * CSS/: New directory.
1599
1600 2007-08-17 Wakaba <wakaba@suika.fam.cx>
1601
1602 * ContentChecker.pm (_check_get_children): Support
1603 for |noscript| in |head|.
1604
1605 2007-08-12 Wakaba <wakaba@suika.fam.cx>
1606
1607 * URI/: New directory.
1608
1609 2007-08-11 Wakaba <wakaba@suika.fam.cx>
1610
1611 * HTML.pm.src: Tokenizer's states are now represented in
1612 number.
1613
1614 2007-08-11 Wakaba <wakaba@suika.fam.cx>
1615
1616 * HTML.pm.src: |or|s for insertion modes are replaced
1617 by |&|s.
1618
1619 2007-08-11 Wakaba <wakaba@suika.fam.cx>
1620
1621 * HTML.pm.src: Token types are now represented in number.
1622
1623 2007-08-11 Wakaba <wakaba@suika.fam.cx>
1624
1625 * ContentType.pm (SEE ALSO): Updated.
1626
1627 * HTML.pm.src: Insertion modes are now represented in number.
1628
1629 2007-08-11 Wakaba <wakaba@suika.fam.cx>
1630
1631 * ContentType.pm: Sniffing for bitmap images (HTML5 revision
1632 999) is implemented.
1633
1634 2007-08-08 Wakaba <wakaba@suika.fam.cx>
1635
1636 * ContentType.pm: Sniffing for |<script| (HTML5 revision
1637 983) is implemented.
1638
1639 2007-08-06 Wakaba <wakaba@suika.fam.cx>
1640
1641 * ContentChecker.pod: New documentation.
1642
1643 * Makefile: A rule for |ContentChecker.html| is added.
1644
1645 * ContentChecker.pm: A pod "LICENSE" section is added.
1646
1647 * NanoDOM.pm ($VERSION): New variable.
1648
1649 2007-08-05 Wakaba <wakaba@suika.fam.cx>
1650
1651 * H2H.pm: |b|, |i|, and |sub| are added to the
1652 list of allowed HTML elements.
1653
1654 2007-08-05 Wakaba <wakaba@suika.fam.cx>
1655
1656 * H2H.pm: |samp| is added to the list of allowed
1657 HTML elements.
1658
1659 * URIChecker.pm (check_iri): New.
1660 (check_iri_reference): Error type for IRI reference
1661 syntax error is changed.
1662
1663 2007-08-04 Wakaba <wakaba@suika.fam.cx>
1664
1665 * ContentChecker.pm: Reference to the |Whatpm::ContentChecker::Atom|
1666 is added.
1667 (check_document): Load appropriate module before validation.
1668
1669 2007-08-04 Wakaba <wakaba@suika.fam.cx>
1670
1671 * ContentChecker/: New directory.
1672
1673 2007-08-04 Wakaba <wakaba@suika.fam.cx>
1674
1675 * ContentChecker.pm: HTML |time| element is implemented.
1676
1677 * HTMLTable.pm: Comments are updated as HTML5 is revised.
1678
1679 2007-07-21 Wakaba <wakaba@suika.fam.cx>
1680
1681 * ContentChecker.pm (check_document): Return value
1682 even if no document element is found.
1683
1684 2007-07-21 Wakaba <wakaba@suika.fam.cx>
1685
1686 * HTML.pm.src: |$in_body| is no longer a function.
1687
1688 2007-07-21 Wakaba <wakaba@suika.fam.cx>
1689
1690 * HTML.pm.src: The |$in_body| code has been moved down.
1691
1692 2007-07-21 Wakaba <wakaba@suika.fam.cx>
1693
1694 * HTML.pm.src: The "trailing end" insertion mode
1695 is split into "after html body" and "after html frameset"
1696 insertion modes. Their codes are merged with "after body"
1697 and "after frameset" codes. |$previous_insertion_mode|
1698 has been removed. "after frameset" code is
1699 merged with "in frameset" code.
1700
1701 2007-07-21 Wakaba <wakaba@suika.fam.cx>
1702
1703 * HTML.pm.src: The "before head" insertion mode is
1704 merged with the "in head" insertion mode.
1705
1706 2007-07-21 Wakaba <wakaba@suika.fam.cx>
1707
1708 * HTML.pm.src: Cases in "in head" insertion mode are
1709 reorganized.
1710
1711 2007-07-21 Wakaba <wakaba@suika.fam.cx>
1712
1713 * HTML.pm.src: Some cases in "in table" insertion mode
1714 are merged.
1715
1716 2007-07-21 Wakaba <wakaba@suika.fam.cx>
1717
1718 * HTML.pm.src: The "in row" insertion mode is merged
1719 with "in table" insertion mode.
1720
1721 2007-07-21 Wakaba <wakaba@suika.fam.cx>
1722
1723 * HTML.pm.src: The "in table" and "in table body" insertion
1724 modes are merged.
1725
1726 2007-07-21 Wakaba <wakaba@suika.fam.cx>
1727
1728 * HTML.pm.src: There is no "in table head" or "in table foot"
1729 insertion mode!
1730
1731 2007-07-21 Wakaba <wakaba@suika.fam.cx>
1732
1733 * HTML.pm.src: |<noframes>| "in frameset" and "in noframes"
1734 now directly invoke the handler.
1735
1736 2007-07-21 Wakaba <wakaba@suika.fam.cx>
1737
1738 * HTML.pm.src: Codes for "in cell" insertion mode
1739 is merged to the "in body" insertion mode code.
1740
1741 2007-07-21 Wakaba <wakaba@suika.fam.cx>
1742
1743 * HTML.pm.src: Codes for "in body" and "in caption"
1744 insertion modes are merged.
1745
1746 2007-07-21 Wakaba <wakaba@suika.fam.cx>
1747
1748 * HTML.pm.src: Two |!!!next-token|s were missing.
1749
1750 2007-07-21 Wakaba <wakaba@suika.fam.cx>
1751
1752 * HTML.pm.src: Use numeric constant for |{content_mode}|
1753 instead of string constant for |{content_model_flag}|.
1754
1755 2007-07-21 Wakaba <wakaba@suika.fam.cx>
1756
1757 * HTML.pm.src: Add the name of the attribute
1758 to the "duplicate attribute" error.
1759
1760 2007-07-17 Wakaba <wakaba@suika.fam.cx>
1761
1762 * ContentChecker.pm: Return the |class| node list.
1763
1764 2007-07-17 Wakaba <wakaba@suika.fam.cx>
1765
1766 * ContentChecker.pm: Return the |id| node list.
1767
1768 * HTML.pm.src: A typo is fixed.
1769
1770 2007-07-16 Wakaba <wakaba@suika.fam.cx>
1771
1772 * ContentChecker.pm: Drop wrong |level => 'error'| specification
1773 from "in HTML:xml:lang" error. Character position
1774 is now the last part of the error type in the URI error
1775 description. Report "unsupported" status for language
1776 tags, media queries, script codes, and style sheets.
1777
1778 2007-07-16 Wakaba <wakaba@suika.fam.cx>
1779
1780 * ContentChecker.pm: Report error if |xml:lang|
1781 in HTML, |lang| in XML, |xmlns| in XML, and |meta| |charset|
1782 in XML.
1783
1784 * NanoDOM.pm (Attr.owner_document): New attribute.
1785
1786 2007-07-16 Wakaba <wakaba@suika.fam.cx>
1787
1788 * HTML.pm.src: The character immediately following
1789 a bare |hcro| was discarded. Fix handling of
1790 entity references in attribute values.
1791
1792 2007-07-16 Wakaba <wakaba@suika.fam.cx>
1793
1794 * HTML.pm.src (main and trailin end phases): Token
1795 types |DOCTYPE|, |comment|, |end-of-file|, and |<html>| are
1796 factored out. Error types |in html:#DOCTYPE|
1797 and |after html:#DOCTYPE| are merged
1798 into |DOCTYPE in the middle|. |</frameset>|
1799 in fragment parsing mode changed the
1800 insertion mode.
1801
1802 2007-07-16 Wakaba <wakaba@suika.fam.cx>
1803
1804 * HTML.pm.src: |$phase| has been removed; The |trailing end|
1805 phase is now an insertion mode. Treatments for white
1806 space character tokens were incorrect for some
1807 insertion modes. An old |meta| case was not removed.
1808
1809 2007-07-16 Wakaba <wakaba@suika.fam.cx>
1810
1811 * HTML.pm.src: |meta| charset declaration extraction
1812 implemented (but changing the encoding is not yet:-).
1813
1814 2007-07-15 Wakaba <wakaba@suika.fam.cx>
1815
1816 * Charset/: New directory.
1817
1818 2007-07-15 Wakaba <wakaba@suika.fam.cx>
1819
1820 * H2H.pm: New Perl module (created from
1821 manakai's H2H.dis).
1822
1823 2007-07-15 Wakaba <wakaba@suika.fam.cx>
1824
1825 * XMLSerializer.pm: New Perl module (created from
1826 manakai's SimpleLS.dis).
1827
1828 2007-07-07 Wakaba <wakaba@suika.fam.cx>
1829
1830 * HTML.pm.src: |<!---x-->| was not processed correctly.
1831
1832 2007-07-01 Wakaba <wakaba@suika.fam.cx>
1833
1834 * HTML.pm.src: Report correct error message
1835 for |<body></div></body>|.
1836
1837 2007-07-01 Wakaba <wakaba@suika.fam.cx>
1838
1839 * HTMLTable.pm: An error description was incorrect.
1840
1841 2007-06-30 Wakaba <wakaba@suika.fam.cx>
1842
1843 * ContentChecker.pm: Return |{term}| list.
1844
1845 2007-06-30 Wakaba <wakaba@suika.fam.cx>
1846
1847 * HTML.pm.src: HTML5 revisions 961-966 (</p>, </br>,
1848 nested <nobr>, implied </tbody>, </tfoot>, and </thead>,
1849 and <title> outside of head).
1850
1851 2007-06-30 Wakaba <wakaba@suika.fam.cx>
1852
1853 * IMTChecker.pm: Report warning for unregistered
1854 and private types/subtypes.
1855
1856 * ContentChecker.pm, HTML.pm.src, IMTChecker.pm,
1857 URIChecker.pm, HTMLTable.pm: Error messages are now
1858 consistent; they are all listed in
1859 <http://suika.fam.cx/gate/2005/sw/Whatpm%20Error%20Types>.
1860
1861 2007-06-25 Wakaba <wakaba@suika.fam.cx>
1862
1863 * ContentChecker.pm: |<img ismap>| not in |<a></a>|
1864 is now erred. |<datalist>| is implemented.
1865 Attribute checker for |<command>| and |<menu>| are
1866 added. Support for |contextmenu| global attribute
1867 is added.
1868
1869 2007-06-25 Wakaba <wakaba@suika.fam.cx>
1870
1871 * HTML.pm.src (_reset_insertion_mode): Interpretation
1872 of Step 3 has been changed.
1873
1874 2007-06-25 Wakaba <wakaba@suika.fam.cx>
1875
1876 * HTML.pm.src: Late |<html>| parse error is implemented.
1877
1878 2007-06-24 Wakaba <wakaba@suika.fam.cx>
1879
1880 * URIChecker.pm (check_iri_reference): A |decode| method name was
1881 incorrect.
1882
1883 * ContentChecker.pm: Support for the |footer| element.
1884 Check URI syntax for space-separated URI attributes.
1885 Support for the |tabindex| attribute. Support
1886 for |datetime| attribute.
1887
1888 2007-06-24 Wakaba <wakaba@suika.fam.cx>
1889
1890 * HTML.pm.src: HTML5 revision 1.144 (&#x0D;) and 1.145 (invalid
1891 character references). HTML5 revision 1.146 (white space
1892 characters before root start tag). HTML5 revision
1893 1.148 (named character references in attribute values).
1894 HTML5 revision 1.152 (<plaintext>.innerHTML get).
1895
1896 2007-06-24 Wakaba <wakaba@suika.fam.cx>
1897
1898 * HTML.pm.src: HTML5 revisions 1.142 and 1.143 (<noscript>
1899 in <head>).
1900
1901 2007-06-24 Wakaba <wakaba@suika.fam.cx>
1902
1903 * HTML.pm.src: HTML5 revision 935 (<base>, <link>, <meta>
1904 in body).
1905
1906 * ContentChecker.pm: HTML5 revision 938 (scoped="").
1907
1908 2007-06-24 Wakaba <wakaba@suika.fam.cx>
1909
1910 * HTML.pm.src: HTML5 revision 923 (matching end tag in CDATA
1911 or RCDATA in fragment parsing mode). HTML5 revision
1912 924 (<!--> and <!--->). HTML5 revision 926 (hn in hn).
1913
1914 2007-06-23 Wakaba <wakaba@suika.fam.cx>
1915
1916 * HTML.pm.src (get_inner_html): HTML5 revision 922 (inner_html
1917 for <pre> and <textarea>).
1918
1919 2007-06-23 Wakaba <wakaba@suika.fam.cx>
1920
1921 * HTML.pm.src: HTML5 revision 920 (<isindex>).
1922
1923 2007-06-23 Wakaba <wakaba@suika.fam.cx>
1924
1925 * HTML.pm.src: HTML5 revision 918 (</head>, </body>, </html>).
1926
1927 2007-06-23 Wakaba <wakaba@suika.fam.cx>
1928
1929 * HTML.pm.src: HTML5 revision 916 (</body>).
1930 HTML5 revision 917 (conforming bare &).
1931
1932 2007-06-23 Wakaba <wakaba@suika.fam.cx>
1933
1934 * NanoDOM.pm (manakai_is_html): Setting to false did not work.
1935
1936 * HTML.pm.src: HTML5 revision 914 (</ in CDATA, RCDATA).
1937 HTML5 revision 915 (<nobr>).
1938
1939 2007-06-23 Wakaba <wakaba@suika.fam.cx>
1940
1941 * HTML.pm.src: HTML5 revisions 908, 909, 912, and 913 (quirks mode).
1942
1943 * NanoDOM.pm (manakai_is_html, manakai_compat_mode, compat_mode):
1944 New attributes.
1945
1946 2007-06-23 Wakaba <wakaba@suika.fam.cx>
1947
1948 * HTML.pm.src: HTML5 revisions 900, 901, 902, and 911 (<
1949 in tags).
1950
1951 2007-06-23 Wakaba <wakaba@suika.fam.cx>
1952
1953 * .cvsignore: |Entities.html| is added.
1954
1955 * HTML.pm.src: |$entity_char| is removed and
1956 requires |Whatpm::_NamedEntityList| instead.
1957 HTML5 revision 898 (refc), except that lack of refc
1958 is parse error.
1959
1960 * mkentitylist.pl: New script.
1961
1962 * Makefile (all): |_NamedEntityList.pm| is added.
1963 (_NamedEntityList.pm, Entities.html): New rules.
1964
1965 2007-06-23 Wakaba <wakaba@suika.fam.cx>
1966
1967 * HTML.pm.src: Parse errors immediately after U+000D
1968 were ignored and U+000D immediately following another
1969 U+000D was not converted to U+000A.
1970
1971 2007-06-23 Wakaba <wakaba@suika.fam.cx>
1972
1973 * HTML.pm.src (set_inner_html): HTML5 revision 892 (adopt
1974 nodes before appended). Parser was not ready for NULL
1975 parse error and escape flag.
1976
1977 * NanoDOM.pm (adopt_node): New.
1978
1979 2007-06-23 Wakaba <wakaba@suika.fam.cx>
1980
1981 * HTML.pm.src: HTML5 revision 886 (insane comment in
1982 CDATA and RCDATA). Note that current implementation
1983 is simply repeating what the spec says and it is maybe not
1984 a best way to do it.
1985
1986 2007-06-23 Wakaba <wakaba@suika.fam.cx>
1987
1988 * HTML.pm.src: HTML5 revision 884 (</form> don't close
1989 the form element if a descendant element without implied end tag has
1990 still been open).
1991
1992 2007-06-23 Wakaba <wakaba@suika.fam.cx>
1993
1994 * ContentChecker.pm: HTML5 revision 881 (Make |id| attribute with
1995 space characters non-conforming).
1996
1997 2007-06-23 Wakaba <wakaba@suika.fam.cx>
1998
1999 * HTML.pm.src: An error message was incorrect.
2000 HTML5 revision 869 (C1 character references).
2001
2002 2007-06-23 Wakaba <wakaba@suika.fam.cx>
2003
2004 * HTML.pm.src: HTML5 revision 867 (a LF at the beginning of
2005 a |textarea| is removed).
2006
2007 2007-06-05 Wakaba <wakaba@suika.fam.cx>
2008
2009 * NanoDOM.pm (get_attribute_node_ns): New method.
2010
2011 * ContentChecker.pm: |script| |async| and |defer|
2012 no longer require |src|. |async| MUST NOT be
2013 specified if |defer|. (HTML5 revision 858).
2014
2015 2007-05-30 Wakaba <wakaba@suika.fam.cx>
2016
2017 * HTML.pm.src: |<form><form>| went to inifinite loop.
2018
2019 2007-05-27 Wakaba <wakaba@suika.fam.cx>
2020
2021 * ContentChecker.pm (html): Set |is_root| (allowed
2022 as a document element) flag on.
2023 (new): Removed.
2024 (check_document): New method.
2025
2026 2007-05-27 Wakaba <wakaba@suika.fam.cx>
2027
2028 * ContentChecker.pm (thead, tfoot): Checker specifications
2029 were incorrect.
2030
2031 2007-05-27 Wakaba <wakaba@suika.fam.cx>
2032
2033 * ContentChecker.pm ($HTMLURIAttrChecker): Include
2034 error position in the |type| option of the error.
2035
2036 * HTMLTable.pm (form_table): The |$onerror| parameter
2037 is now optional. Some bugs are fixed.
2038
2039 2007-05-27 Wakaba <wakaba@suika.fam.cx>
2040
2041 * HTMLTable.pm: New module.
2042
2043 * ContentChecker.pm (table): Invoke table model error checker.
2044
2045 * NanoDOM.pm (first_child, get_attribute_ns): New.
2046
2047 2007-05-26 Wakaba <wakaba@suika.fam.cx>
2048
2049 * ContentChecker.pm ($HTMLLinkTypesAttrChecker): New checker.
2050 (link/@rel, a/@rel, area/@rel): Use new checker.
2051
2052 * Makefile (_LinkTypeList.pm, RelExtensions.html): New rules.
2053
2054 * _LinkTypeList.pm: New file.
2055
2056 * mklinktypelist.pl: New file.
2057
2058 * .cvsignore: |RelExtensions.html| added.
2059
2060 * NanoDOM.pm (child_nodes): Returns an empty array
2061 for non-child-containing node types.
2062 (text_content): New attribute.
2063
2064 2007-05-26 Wakaba <wakaba@suika.fam.cx>
2065
2066 * IMTChecker.pm: New module.
2067
2068 * ContentChecker.pm ($HTMLIMTAttrChecker): Call IMTChecker
2069 to test parameter value validity.
2070
2071 * HTML.pm.src ($style_start_tag): Attributes were
2072 discarded.
2073
2074 2007-05-25 Wakaba <wakaba@suika.fam.cx>
2075
2076 * ContentChecker.pm ($HTMLURIAttrChecker): Implemented.
2077
2078 2007-05-25 Wakaba <wakaba@suika.fam.cx>
2079
2080 * URIChecker.pm: All recommendations from RFC 3986
2081 and RFC 3987 are listed (not all testable items are checked yet).
2082
2083 2007-05-25 Wakaba <wakaba@suika.fam.cx>
2084
2085 * URIChecker.pm: New module.
2086
2087 2007-05-20 Wakaba <wakaba@suika.fam.cx>
2088
2089 * ContentChecker.pm: Now most attributes are implemented
2090 or associated to some placeholder.
2091 ($ElementDefault): Warn unknown attributes
2092 for unknown elements as "attribute not supported".
2093 ($HTMLLanguageTagAttrChecker, $HTMLMQAttrChecker): New placeholders.
2094 ($HTMLUsemapAttrChecker, $HTMLTargetAttrChecker): New checkers.
2095 (|a| attribute checker): Reimplemented.
2096
2097 2007-05-20 Wakaba <wakaba@suika.fam.cx>
2098
2099 * ContentChecker.pm ($HTMLEventHandlerAttrChecker): New placeholder.
2100 ($HTMLAttrChecker): Event handler content attributes
2101 are added.
2102 (link, embed): Required attribute is now checked.
2103 (embed): Unknown local attributes are no longer warned.
2104
2105 2007-05-20 Wakaba <wakaba@suika.fam.cx>
2106
2107 * ContentChecker.pm ($HTMLSpaceURIsAttrChecker): New placeholder.
2108 ($HTMLIMTAttrChecker): New checker.
2109 (link@rel, link@href, link@type, style@type,
2110 a@href, a@ping, a@ping, a@type, embed@src, embed@type,
2111 object@data, object@type, source@src, source@type, area@alt,
2112 area@shape, area@coords, area@href,
2113 area@ping, area@rel, area@type, script@src,
2114 script@defer, script@async, script@type): Checkers added.
2115
2116 2007-05-20 Wakaba <wakaba@suika.fam.cx>
2117
2118 * ContentChecker.pm: Descendant checking was incorrect.
2119
2120 2007-05-19 Wakaba <wakaba@suika.fam.cx>
2121
2122 * ContentChecker.pm: Support |xml:*| and |xmlns:*|
2123 attributes. Report an error if |Element.prefix|
2124 is |xmlns|.
2125
2126 * NanoDOM.pm (prefix): New attribute.
2127
2128 2007-05-19 Wakaba <wakaba@suika.fam.cx>
2129
2130 * HTML.pm.src: In |main| phase, |in body| insertion
2131 mode, action for |<iframe>| was missing.
2132
2133 2007-05-19 Wakaba <wakaba@suika.fam.cx>
2134
2135 * ContentChecker.pm: Support for many of HTML5 elements.
2136 ($GetHTMLNonNegativeIntegerAttrChecker): New.
2137
2138 2007-05-19 Wakaba <wakaba@suika.fam.cx>
2139
2140 * ContentChecker.pm: Support for most elements up to |progress|.
2141 ($HTMLURIAttrChecker): Placeholder.
2142 ($HTMLIntegerAttrChecker, $GetHTMLFloatingPointNumberAttrChecker):
2143 New.
2144
2145 2007-05-19 Wakaba <wakaba@suika.fam.cx>
2146
2147 * ContentChecker.pm: Attribute checkers
2148 for global attributes, |html|, |base|, |style|, and |meta|.
2149
2150 * NanoDOM.pm (insert_before): Weaken reference
2151 to the parent node.
2152 (Attr::new): Set |owner_element| attribute.
2153 (namespace_uri, manakai_local_name): New attribute implementations.
2154 (owner_element): New attribute.
2155
2156 2007-05-19 Wakaba <wakaba@suika.fam.cx>
2157
2158 * ContentChecker.pm ($AttrChecker, $HTMLAttrChecker,
2159 $AnyChecker->{attr_checker}, $HTMLAttrsChecker,
2160 $Element->{$HTML_NS}->{''}): New.
2161 (check_element): Invoke attrs_checker for each element.
2162
2163 2007-05-13 Wakaba <wakaba@suika.fam.cx>
2164
2165 * ContentChecker.pm: Don't use |manakai_element_type_match|.
2166
2167 2007-05-13 Wakaba <wakaba@suika.fam.cx>
2168
2169 * ContentChecker.pm: Use hashs rather than lists for
2170 element type testings.
2171
2172 2007-05-13 Wakaba <wakaba@suika.fam.cx>
2173
2174 * ContentChecker.pm: Don't generate duplicate
2175 error when an element type is put in the "minus" list
2176 and the element type is not allowed explicitly in the particular
2177 element content model.
2178 (html:a checker): New checker.
2179 (html:details, html:datagrid): New checkers.
2180 (html:legend): New checker.
2181
2182 2007-05-13 Wakaba <wakaba@suika.fam.cx>
2183
2184 * ContentChecker.pm (html:li checker): Implemented.
2185
2186 2007-05-13 Wakaba <wakaba@suika.fam.cx>
2187
2188 * ContentChecker.pm ($HTMLInlineOrStriclyInlineChecker): New
2189 checker.
2190 (html:dd checker): New checker.
2191 (html:q, html:em, html:strong, html:small,
2192 html:m, html:dfn, html:code, html:samp, html:span): New checkers.
2193
2194 2007-05-13 Wakaba <wakaba@suika.fam.cx>
2195
2196 * ContentChecker.pm ($AnyChecker): Renamed
2197 from |$ElementDefault->{checker}|.
2198 ($ElementDefault->{checker}): Throw an error that
2199 the element type is not supported by the checker.
2200 ($HTMLMetadataElement): |html:base| was missing.
2201 ($HTMLEmptyChecker): Don't throw an error
2202 for inter-element whitespace nodes.
2203 (html:html checker): Errors were not
2204 thrown even if |html:head| and/or |html:body|
2205 children were missing.
2206 (html:head checker): An error was not
2207 thrown if <meta charset> appered after other
2208 elements.
2209
2210 2007-05-05 Wakaba <wakaba@suika.fam.cx>
2211
2212 * ContentChecker.pm: |footer|, |video|, |audio|, |script|,
2213 and |noscript| elements are implemented.
2214 (new): New method.
2215
2216 2007-05-04 Wakaba <wakaba@suika.fam.cx>
2217
2218 * ContentChecker.pm: New module.
2219
2220 2007-05-04 Wakaba <wakaba@suika.fam.cx>
2221
2222 * NanoDOM.pm (manakai_parent_element,
2223 document_element, manakai_local_name,
2224 manakai_element_type_match): New method.
2225
2226 2007-05-03 Wakaba <wakaba@suika.fam.cx>
2227
2228 * HTML.pm.src: Replace decimal and hexadecimal numeric
2229 entities in C1 range using Windows-1252 mapping. Bare LF
2230 did not count as new line for error reporting.
2231
2232 2007-05-02 Wakaba <wakaba@suika.fam.cx>
2233
2234 * NanoDOM.pm (DOMImplementation): New class.
2235 (append_child): Weaken the |parent_node| reference.
2236 (create_element_ns, Element new): Set the |owner_document|
2237 reference.
2238 (implementation): New attribute.
2239 (owner_document, local_name, namespace_uri): New attributes.
2240
2241 * HTML.pm.src (parse_string): Line and column numbers
2242 are now provided to error handler.
2243 (!!!parse-error): Short descriptions are added.
2244 (_construct_tree): Split into three methods; support
2245 for innerHTML mode.
2246 (set_inner_html): New method.
2247
2248 2007-05-01 Wakaba <wakaba@suika.fam.cx>
2249
2250 * NanoDOM.html: Documentation is added.
2251
2252 * HTML.pod, ContentType.html: Documentation is revised.
2253
2254 * .cvsignore: Pod2html temporary files are added.
2255
2256 * Makefile: Make |NanoDOM.html|.
2257
2258 2007-05-01 Wakaba <wakaba@suika.fam.cx>
2259
2260 * HTML.pm.src (parse_string): New method.
2261 (get_inner_html): Renamed from |inner_html|.
2262
2263 * Makefile: A rule for |HTML.html| is added.
2264
2265 * HTML.pod: New documentation.
2266
2267 2007-05-01 Wakaba <wakaba@suika.fam.cx>
2268
2269 * NanoDOM.pm (last_child, previous_sibling): New attributes.
2270 (clone_node): Attribute nodes were not completely copied.
2271
2272 * HTML.pm.src: Many bugs are fixed.
2273
2274 2007-04-30 Wakaba <wakaba@suika.fam.cx>
2275
2276 * HTML.pm.src: Some typos are fixed.
2277
2278 2007-04-30 Wakaba <wakaba@suika.fam.cx>
2279
2280 * mkhtmlparser.pl, Makefile: References to the |HTML-consume-entity.src|
2281 are removed.
2282
2283 * HTML.pm.src: Tokenizer's handling on named entities are rewritten.
2284
2285 * HTML-consume-entity.src: Removed.
2286
2287 2007-04-30 Wakaba <wakaba@suika.fam.cx>
2288
2289 * HTML.pm.src: Tokenizer's handling on hexadecimal
2290 numeric entities are rewritten.
2291
2292 2007-04-30 Wakaba <wakaba@suika.fam.cx>
2293
2294 * HTML.pm.src: Some tokenizer bugs are fixed.
2295
2296 2007-04-30 Wakaba <wakaba@suika.fam.cx>
2297
2298 * HTML.pm.src: The tree construction stage is implemented.
2299
2300 * mkhtmlparser.pl: New macros are added.
2301
2302 2007-04-28 Wakaba <wakaba@suika.fam.cx>
2303
2304 * ContentType.pm: A note on bug in the specification
2305 is removed since it's been now fixed.
2306
2307 * .cvsignore: New file.
2308
2309 2007-04-28 Wakaba <wakaba@suika.fam.cx>
2310
2311 * HTML.pm.src, HTML-consume-entity.src: New files.
2312
2313 * Makefile (HTML.pm): New rule.
2314
2315 * mkhtmlparser.pl: New script.
2316
2317 2007-04-25 Wakaba <wakaba@suika.fam.cx>
2318
2319 * Makefile: New file.
2320
2321 2007-04-24 Wakaba <wakaba@suika.fam.cx>
2322
2323 * ContentType.pm: An error in pod is fixed.
2324
2325 2007-04-24 Wakaba <wakaba@suika.fam.cx>
2326
2327 * ContentType.pm: Documentation is added.
2328
2329 2007-04-24 Wakaba <wakaba@suika.fam.cx>
2330
2331 * ContentType.pm: New Perl module.
2332
2333 * ChangeLog: New file.
2334
2335

admin@suikawiki.org
ViewVC Help
Powered by ViewVC 1.1.24