/[suikacvs]/markup/html/whatpm/Whatpm/ChangeLog
Suika

Contents of /markup/html/whatpm/Whatpm/ChangeLog

Parent Directory Parent Directory | Revision Log Revision Log


Revision 1.328 - (show annotations) (download)
Sat Sep 20 10:20:47 2008 UTC (16 years, 1 month ago) by wakaba
Branch: MAIN
Changes since 1.327: +7 -0 lines
++ whatpm/t/ChangeLog	20 Sep 2008 10:20:39 -0000
	* tokenizer-test-2.dat: Test result updated (cf. HTML5 revision
	1738).

	* tree-test-1.dat: Test results updated.  New test data for the
	treatement of U+000B are added (HTML5 revision 1738).

2008-09-20  Wakaba  <wakaba@suika.fam.cx>

++ whatpm/Whatpm/ChangeLog	20 Sep 2008 10:19:37 -0000
	* HTML.pm.src: The "anything else" case for the "after after body"
	insertion mode was not updated to swtich to the "in body"
	insertion mode.  U+000B is no longer a space character for the
	purpose of tree construction phase (HTML5 revision 1738).

2008-09-20  Wakaba  <wakaba@suika.fam.cx>

1 2008-09-20 Wakaba <wakaba@suika.fam.cx>
2
3 * HTML.pm.src: The "anything else" case for the "after after body"
4 insertion mode was not updated to swtich to the "in body"
5 insertion mode. U+000B is no longer a space character for the
6 purpose of tree construction phase (HTML5 revision 1738).
7
8 2008-09-20 Wakaba <wakaba@suika.fam.cx>
9
10 * HTML.pm.src: U+000B is no longer a space character (HTML5
11 revision 1738).
12
13 2008-09-20 Wakaba <wakaba@suika.fam.cx>
14
15 * ContentType.pm: 0x0B is no longer a space character (HTML5
16 revision 1738).
17
18 * HTML.pm.src: U+000B is no longer a space character for the
19 algorithm for extracting an encoding from a Content-Type (HTML5
20 revision 1738).
21
22 2008-09-20 Wakaba <wakaba@suika.fam.cx>
23
24 * ContentChecker.pm ($IsInHTMLInteractiveContent): New.
25
26 2008-09-18 Wakaba <wakaba@suika.fam.cx>
27
28 * LangTag.pm: Add checks for remaining requirements from RFC 4646.
29
30 * mklangreg.pl: Sort 'Prefix' values by their length, to ease
31 matching.
32
33 2008-09-18 Wakaba <wakaba@suika.fam.cx>
34
35 * LangTag.pm: Warn for private use language subtags. Error level
36 typos fixed. Support for Suppress-Script field.
37
38 * mklangreg.pl: Support for dumping of nested structure.
39
40 2008-09-18 Wakaba <wakaba@suika.fam.cx>
41
42 * LangTag.pm (check_rfc4646_langtag): Check if a tag is in the
43 recommended case as per RFC 4646.
44
45 2008-09-18 Wakaba <wakaba@suika.fam.cx>
46
47 * LangTag.pm (check_rfc4646_langtag): New method.
48
49 2008-09-18 Wakaba <wakaba@suika.fam.cx>
50
51 * mklangreg.pl: New script.
52
53 * Makefile: Updated for creation of the module for language subtag
54 registry.
55
56 2008-09-16 Wakaba <wakaba@suika.fam.cx>
57
58 * Makefile: WebIDL.html added.
59
60 * WebIDL.pod: New documentation.
61
62 2008-09-16 Wakaba <wakaba@suika.fam.cx>
63
64 * WebIDL.pm: Checker's error types are redefined.
65
66 2008-09-16 Wakaba <wakaba@suika.fam.cx>
67
68 * WebIDL.pm: Parser's error types are redefined. Some forward
69 compatible parsing bugs are fixed. Some unreachable codes are
70 commented out.
71
72 2008-09-16 Wakaba <wakaba@suika.fam.cx>
73
74 * WebIDL.pm: Support for the reminding extended attributes are
75 added. It does not satisfy the definition that a forward
76 interface declaration has an extended attribute. It seems that
77 unless explicitly allowed multiple extended attributes with the
78 same name is not allowed, though it is not explicitly mentioned in
79 the spec.
80
81 2008-09-16 Wakaba <wakaba@suika.fam.cx>
82
83 * WebIDL.pm: Unescapes extended attribute names and extended
84 attribute identifiers. Preserve whether an extended attribute has
85 an argument list of not. Support for extended attributes:
86 Constructor, ExceptionConsts, IndexGetter, IndexSetter,
87 NameGetter, NameSetter, and Null.
88 (has_argument_list): New attribute.
89 (idl_text): Stringifies argument lists, if any, even if it is
90 empty.
91
92 2008-09-15 Wakaba <wakaba@suika.fam.cx>
93
94 * HTML.pm.src: New state |PCDATA_STATE|. Use an empty string for
95 |{s_kwd}| in DATA_STATE as default.
96
97 2008-09-15 Wakaba <wakaba@suika.fam.cx>
98
99 * HTML.pm.src, mkhtmlparser.pl: Replace |{prev_char}|
100 by |{s_kwd}| in DATA_STATE.
101
102 2008-09-15 Wakaba <wakaba@suika.fam.cx>
103
104 * HTML.pm.src: Shorten keys.
105
106 2008-09-15 Wakaba <wakaba@suika.fam.cx>
107
108 * HTML.pm.src: Remove checking for control character, surrogate
109 pair, or noncharacter code points and non-Unicode code
110 points (they should be handled by Whatpm::Charset::UnicodeChecker).
111 (parse_char_stream): Support for the |$get_wrapper| argument and
112 character stream error handlers.
113
114 2008-09-15 Wakaba <wakaba@suika.fam.cx>
115
116 * ContentChecker.pm: Don't call |loda_ns_module|
117 for null-namespace elements/attributes.
118
119 * HTML.pm.src: Fact out $disallowed_control_chars
120 as a hash.
121
122 2008-09-14 Wakaba <wakaba@suika.fam.cx>
123
124 * HTML.pm.src: Regexp typo fixed. |{prev_char}|
125 and |{next_char}| initializations are moved to initialization
126 method. |{read_until}| now supports buffering. Sync |set_inner_html|
127 with |parse_char_stream|.
128
129 2008-09-14 Wakaba <wakaba@suika.fam.cx>
130
131 * HTML.pm.src (parse_char_stream): Make |set_next_char|
132 invoke |manakai_read_until|, not only |read|, where
133 possible, to decrease the number of |read| method calls.
134
135 * mkhtmlparser.pl: Related changes to the aforementioned
136 modification.
137
138 2008-09-14 Wakaba <wakaba@suika.fam.cx>
139
140 * HTML.pm.src: Use |read| instead of |getc|. |set_inner_html|
141 would report character error from now.
142
143 2008-09-14 Wakaba <wakaba@suika.fam.cx>
144
145 * HTML.pm.src: White-space-leaded non-white-space character
146 tokens in "before head insertion mode" was not
147 correctly handled.
148 (set_inner_html): Reimplemented using CharString decodehandle
149 class. Support for $get_wrapper argument. Support
150 for |{read_until}| feature.
151
152 2008-09-14 Wakaba <wakaba@suika.fam.cx>
153
154 * HTML.pm.src: Make a "bare ero" error for unknown
155 entities point the "&" character.
156
157 2008-09-14 Wakaba <wakaba@suika.fam.cx>
158
159 * HTML.pm.src: It turns out that U+FFFD don't have to
160 be added to the list of excluded characters.
161
162 2008-09-14 Wakaba <wakaba@suika.fam.cx>
163
164 * HTML.pm.src ($char_onerror): Have character decoder's |line|
165 and |column| a higher priority than the one set by the
166 tokenizer's input handler.
167 ($self->{read_until}): Exclude U+FFFD (but this might
168 not be necessary, since now we do line/column fixup in
169 the character decode handle).
170
171 2008-09-14 Wakaba <wakaba@suika.fam.cx>
172
173 * HTML.pm.src: Use |{read_until}| where possible.
174
175 2008-09-14 Wakaba <wakaba@suika.fam.cx>
176
177 * HTML.pm.src: Change |{getc_until}| to |{read_until}|
178 and |manakai_getc_until| to |manakai_read_until| to
179 reduce the number of string copies.
180
181 2008-09-14 Wakaba <wakaba@suika.fam.cx>
182
183 * HTML.pm.src (parse_char_string): Use newly created
184 |Whatpm::Charset::DecodeHandle::CharString| instead of Perl's
185 standard feature to |open| a string as a filehandle,
186 since Perl's string filehandle seems not supporting |ungetc|
187 method correctly.
188 (parse_char_stream): Define |{getc_until}| method.
189 (DATA_STATE): Experimental support for |getc_until| feature.
190
191 2008-09-13 Wakaba <wakaba@suika.fam.cx>
192
193 * HTML.pm.src: Check points added to newly added branches.
194
195 2008-09-13 Wakaba <wakaba@suika.fam.cx>
196
197 * HTML.pm.src: Remove |{char}|, which is no longer used.
198 Remove |{entity_in_attr}| and |{last_attribute_value_state}|
199 and replaced by |{prev_state}|.
200
201 * mkhtmlparser.pl: Remove |{char}| feature.
202 Remove |!!!back-next-input-character;| macro.
203
204 2008-09-13 Wakaba <wakaba@suika.fam.cx>
205
206 * HTML.pm.src: Finally we get rid of all the inner loops. Remove
207 entity related tokenizer states in favor of new states
208 implementing the consume character reference algorithm.
209
210 2008-09-13 Wakaba <wakaba@suika.fam.cx>
211
212 * HTML.pm.src: "Consume a character reference" algorithm is
213 now implemented as a tokenizer's state, rather than
214 a method, with minimum changes (more changes will
215 be made, in due course). "Bogus comment state"'s inner
216 loop gets removed.
217
218 2008-09-13 Wakaba <wakaba@suika.fam.cx>
219
220 * HTML.pm.src: Make |PUBLIC| and |SYSTEM| keyword tokenizing
221 into their own tokenizer states.
222
223 2008-09-13 Wakaba <wakaba@suika.fam.cx>
224
225 * HTML.pm.src: |CDATA_SECTION_STATE| (formally |CDATA_BLOCK_STATE|
226 is split into three states.
227
228 2008-09-13 Wakaba <wakaba@suika.fam.cx>
229
230 * HTML.pm.src: |CLOSE_TAG_OPEN_STATE| is broken into
231 itself and new |CDATA_PCDATA_CLOSE_TAG_STATE| so that
232 no longer does the tokenizer have to push back next input
233 characters in those states.
234
235 2008-09-13 Wakaba <wakaba@suika.fam.cx>
236
237 * HTML.pm.src: |MARKUP_DECLARATION_OPEN_STATE| broken
238 into four states so that no longer does the tokenizer have to push
239 back next input characters in that state.
240
241 2008-09-11 Wakaba <wakaba@suika.fam.cx>
242
243 * HTML.pm.src: Methods now accept additional parameter, $get_wrapper,
244 which can be used to insert some wrapper between the character
245 stream handle and the tokenizer. (It is currently not supported
246 for |set_inner_html| for |Element|s).
247
248 2008-09-10 Wakaba <wakaba@suika.fam.cx>
249
250 * HTML.pm.src: Ignore punctuations in charset names.
251
252 2008-09-10 Wakaba <wakaba@suika.fam.cx>
253
254 * ContentChecker.pm: Support for charset-layer error levels.
255
256 * HTML.pm.src: Don't specify |text| argument for the
257 |chardecode:fallback| error, since it is not the encoding
258 being used alternatively.
259
260 2008-09-06 Wakaba <wakaba@suika.fam.cx>
261
262 * HTML.pm.src: Support for |XSLT-compat| (HTML5 revision 2141).
263
264 2008-08-31 Wakaba <wakaba@suika.fam.cx>
265
266 * CacheManifest.pm: Support for extensibility (HTML5 revision 2051).
267
268 2008-08-31 Wakaba <wakaba@suika.fam.cx>
269
270 * HTML.pm.src: Bug fix and sync with the spec with regard
271 to after after frameset insertion mode processing (HTML5
272 revision 1909). Note that the implementation was wrong
273 per the old spec before the r1909 changes.
274
275 2008-08-30 Wakaba <wakaba@suika.fam.cx>
276
277 * HTMLTable.pm: scope=auto algorithm fix synced with the
278 spec (HTML5 revision 2093).
279 ($process_row): Algorithm step numbers synced with the
280 spec (HTML5 revision 2092).
281
282 2008-08-30 Wakaba <wakaba@suika.fam.cx>
283
284 * HTMLTable.pm: Zs is not what we want; we want White_Space! (HTML5
285 revision 2094).
286
287 2008-08-30 Wakaba <wakaba@suika.fam.cx>
288
289 * ContentType.pm: Support for image/svg+xml (HTML5 revision 2096).
290
291 2008-08-30 Wakaba <wakaba@suika.fam.cx>
292
293 * HTML.pm.src: '"' and "'" at the end of attribute
294 name (after another attribute) now raise parse error (HTML5
295 revision 2123). Empty unquoted attribute values are no
296 longer allowed (HTML5 revision 2122).
297
298 2008-08-30 Wakaba <wakaba@suika.fam.cx>
299
300 * mkhtmlparser.pl: Support for MathML |definitionURL| attribute (HTML5
301 revision 2130).
302
303 2008-08-30 Wakaba <wakaba@suika.fam.cx>
304
305 * ContentChecker.pm: |xml:lang| attribute value must be same
306 as |lang| attribute value for HTML elements (HTML5 revision 2062
307 and so on).
308
309 2008-08-30 Wakaba <wakaba@suika.fam.cx>
310
311 * ContentChecker.pm: Error level definition for |xml_id_error|
312 was missing.
313
314 * URIChecker.pm: The end of the URL should be marked as the
315 error location for an empty path error. The position
316 between the userinfo and the port components should be
317 marked as the error location for an empty host error.
318
319 2008-08-30 Wakaba <wakaba@suika.fam.cx>
320
321 * URIChecker.pm: Set parameters representing where in the
322 value the error occurs for errors. Report unknown
323 address format error in warning level, since address
324 formats are rarely added. Path segments starting with "/.."
325 were misinterpreted as a dot-segment.
326
327 2008-08-30 Wakaba <wakaba@suika.fam.cx>
328
329 * URIChecker.pm (check_iri_reference): Requires
330 |Message::DOM::DOMImplementation|.
331
332 2008-08-29 Wakaba <wakaba@suika.fam.cx>
333
334 * IMTChecker.pm: Updated for the new error reporting architecture.
335
336 * ContentChecker.pm: Error levels for IMTs are added.
337
338 2008-08-17 Wakaba <wakaba@suika.fam.cx>
339
340 * H2H.pm (_shift_token): Support for unquoted HTML attribute
341 values.
342
343 2008-08-16 Wakaba <wakaba@suika.fam.cx>
344
345 * CacheManifest.pm: Support for new style of error
346 reports.
347
348 * HTML.pm.src: Set line=1, column=1 to the document node.
349
350 2008-08-16 Wakaba <wakaba@suika.fam.cx>
351
352 * ContentChecker.pm, RDFXML.pm: Pass {level} object to language tag
353 and URL checkers. Support for more error levels for bogus
354 langauge tag and URL "standards".
355
356 * LangTag.pm, URIChecker.pm: Support for new style error
357 level reporting.
358
359 2008-08-15 Wakaba <wakaba@suika.fam.cx>
360
361 * ContentChecker.pm: Support for RDF/XML error levels.
362
363 * HTMLTable.pm, RDFXML.pm: Support for new style of error level
364 specifying. Error types are revised.
365
366 2008-08-15 Wakaba <wakaba@suika.fam.cx>
367
368 * ContentChecker.pm: All error reporting method calls are
369 renewed.
370
371 2008-08-15 Wakaba <wakaba@suika.fam.cx>
372
373 * HTML.pm.src: All error type names and "text" parameters
374 are revised. Use new style for "level" specification.
375
376 * mkhtmlparser.pl: Use new style for "level" specification.
377
378 2008-08-03 Wakaba <wakaba@suika.fam.cx>
379
380 * WebIDL.pm (parse_char_string): Simplified error
381 reporting process for broken ignored valuetype definition.
382 (Valuetype idl_text): Support for special "DOMString" name.
383
384 2008-08-03 Wakaba <wakaba@suika.fam.cx>
385
386 * WebIDL.pm ($get_scoped_name): Append "::::" if the last
387 terminal of the ScopedName is "DOMString", such that whether
388 the last part of the scoped name is "DOMString" or "_DOMString"
389 later. It is necessary to determine whether a |typedef|
390 definition should be ignored or not.
391 (parse_char_string): Unescape the identifier of
392 exception members.
393 ($resolve): Return undef for builtin types and sequence<T>
394 types (we might not have to do this, however...).
395 (check): Support checking for Exceptions, Valuetypes,
396 and Typedefs.
397 ($serialize_type): Support for "DOMString::::" syntax.
398 (Typedef idl_text): Output Type as "DOMString" if it
399 is really "DOMString" (i.e. its internal representation
400 is "::DOMString::").
401
402 2008-08-03 Wakaba <wakaba@suika.fam.cx>
403
404 * WebIDL.pm ($resolve): New code, based on resolve code
405 for constant types in the |check| method.
406 (check): Support for checking of attributes, operations, and
407 arguments.
408 (Attribute/Operation idl_text): Exception names in getraises,
409 setraises, and raises clauses is serizlied by |$serialize_type|
410 code.
411
412 2008-08-02 Wakaba <wakaba@suika.fam.cx>
413
414 * WebIDL.pm ($integer): Order of selections are changed to match
415 hexadecimal numbers (the original pattern, taken from the spec,
416 was not work for hexadecimal numbers, because the "0" prefix
417 matches to the [0-7]* part (as an empty string) and therefore
418 it does not match with remaining "x..." part of a "0x..." integer
419 literal.
420 ($get_type): It now returns a string, not an array reference,
421 for regular types and |sequence| types (i.e. it in any case
422 returns a string).
423 ($get_next_token): The second item in the array that represents
424 a integer or float token is now a Perl number value, not the
425 original string representation of the number.
426 (check): Support for const value consistency checking.
427 No extended attribute is defined for constants.
428 (Node subclasses): Use simple strings rather than array references
429 for default data type values.
430 ($serialize_type): Type values are now simple strings.
431 (value): If the new attribute value is a false value, then
432 a FALSE value is set to the attribute.
433
434 2008-08-02 Wakaba <wakaba@suika.fam.cx>
435
436 * WebIDL.pm ($get_scoped_name): Now scoped names are stored
437 in its stringified format ("scoped name" as defined in the
438 spec). Note that future version of this module should not use
439 array references for type values and the |type_text| attribute
440 should be made obsolete.
441 (parse_char_string): Unescape attribute names.
442 (check): Support for checking of whether inherited interfaces
443 are actually defined or not. Support for checking of whether
444 interface member identifiers are duplicated or not.
445 ($serialize_type): Scoped names are returned as is. A future
446 version of this code should escape identifiers other than "DOMString",
447 otherwise the idl_text would be non-conforming.
448
449 2008-08-02 Wakaba <wakaba@suika.fam.cx>
450
451 * WebIDL.pm (parse_char_string): Set line/column numbers
452 to generated nodes. Unescape identifiers. Extended attributes
453 for Definition's were ignored.
454 (append_child): Set |parent_node| attribute.
455 (parent_node): New attribute.
456 (check): Support interface/exception members. Support
457 extended attributes. Support definition identifier uniqueness
458 constraint.
459 (qualified_name): New attribute.
460 (Interface/Exception idl_text): Extended attributes were
461 not prepended to the returned text.
462
463 2008-08-02 Wakaba <wakaba@suika.fam.cx>
464
465 * WebIDL.pm (parse_char_string): Set line/column numbers
466 to interface object experimentally. s/shift/pop/g, shift
467 would make things wrong. Support for interface forward
468 declarations was missing. Broken interface declarations
469 with no block were not ignored entirely.
470 (Whatpm::WebIDL::Node): New abstract class. This class
471 makes things easier.
472 (child_nodes): New attribute. Unlike DOM's attribute with
473 same name, this attribute returns a dead list of nodes for
474 simplicity.
475 (get_user_data, set_user_data): New methods.
476 (Module idl_text): A SPACE character should be inserted
477 before the |{| character.
478 (Interface idl_text): Support for interface forward declarations.
479 (is_forward_declaration): New attribute.
480
481 2008-07-19 Wakaba <wakaba@suika.fam.cx>
482
483 * WebIDL.pm (type_text): Better serializer.
484
485 2008-07-19 Wakaba <wakaba@suika.fam.cx>
486
487 * WebIDL.pm: Revise forward-compatible parsing so that
488 it now can handle broken extended attributes and as such.
489
490 2008-07-19 Wakaba <wakaba@suika.fam.cx>
491
492 * WebIDL.pm: Real support for extended attributes.
493 Support for extended attributes with arguments.
494
495 2008-07-19 Wakaba <wakaba@suika.fam.cx>
496
497 * WebIDL.pm: Support for |exception| syntax.
498 (Interface->idl_text): Tentative support for inheritances.
499
500 2008-07-19 Wakaba <wakaba@suika.fam.cx>
501
502 * WebIDL.pm: Hierarchical scoped name support was broken.
503 Support for raises, setraises, and getraises syntaxes.
504
505 2008-07-18 Wakaba <wakaba@suika.fam.cx>
506
507 * WebIDL.pm: Support for |idl_text| attribute, version 1 (no
508 proper support for types, extended attributes, and exceptions yet).
509 WebIDL parser, version 1 (no support for exceptions yet,
510 no proper support for extended attributes yet).
511
512 2008-07-09 Wakaba <wakaba@suika.fam.cx>
513
514 * WebIDL.pm (parse_char_string): Support for basic attribute syntax.
515
516 2008-06-29 Wakaba <wakaba@suika.fam.cx>
517
518 * WebIDL.pm: Support for valuetype and const.
519
520 2008-06-29 Wakaba <wakaba@suika.fam.cx>
521
522 * WebIDL.pm: New module.
523
524 2008-06-15 Wakaba <wakaba@suika.fam.cx>
525
526 * Makefile (Entities.html): URI changed.
527
528 2008-06-08 Wakaba <wakaba@suika.fam.cx>
529
530 * HTML.pm.src: Support for ruby parsing (HTML5 revision 1704).
531
532 2008-06-01 Wakaba <wakaba@suika.fam.cx>
533
534 * HTML.pm.src (_get_next_token): A parse error was missing.
535
536 2008-06-01 Wakaba <wakaba@suika.fam.cx>
537
538 * mklinktypelist.pl: rel=contact is no longer part of the HTML5
539 spec (commented out). (HTML5 revision 1711).
540
541 2008-05-25 Wakaba <wakaba@suika.fam.cx>
542
543 * ContentType.pm: Drop support for UTF-32 (HTML5 revision 1701).
544
545 * HTML.pm.src: UTF-16BE and UTF-16LE should be considered
546 as UTF-16 (HTML5 revision 1701).
547
548 2008-05-25 Wakaba <wakaba@suika.fam.cx>
549
550 * HTML.pm.src: Support for <noframes> in <head> (HTML5 revision
551 1692).
552
553 2008-05-25 Wakaba <wakaba@suika.fam.cx>
554
555 * HTML.pm.src: The secondary insertion mode used when switching
556 to foreign content is the "in body" insertion mode (HTML5 revision
557 1696).
558
559 2008-05-25 Wakaba <wakaba@suika.fam.cx>
560
561 * HTML.pm.src: Don't raise parse error for <isindex/> (HTML5
562 revision 1697).
563
564 2008-05-24 Wakaba <wakaba@suika.fam.cx>
565
566 * HTML.pm.src: Support for end-of-file token in foreign content
567 insertion mode (HTML5 revision 1693). Update SVG camelCase
568 attribute list (HTML5 revision 1700). <textarea> closes
569 </select> (HTML5 revision 1699). More start tags close in
570 foreign content insertion mode (HTML5 revision 1698).
571
572 2008-05-24 Wakaba <wakaba@suika.fam.cx>
573
574 * HTML.pm.src: ";" is not part of charset name (HTML5 revision 1665).
575
576 2008-05-24 Wakaba <wakaba@suika.fam.cx>
577
578 * HTML.pm.src: More robust charset parameter detection (HTML5
579 revision 1674).
580
581 2008-05-24 Wakaba <wakaba@suika.fam.cx>
582
583 * ContentType.pm: Support for image/vnd.microsoft.icon (HTML5
584 revision 1676).
585
586 2008-05-24 Wakaba <wakaba@suika.fam.cx>
587
588 * HTML.pm.src: Ignore language part of public identifiers for
589 quriks mode detection (HTML5 revision 1679).
590
591 2008-05-24 Wakaba <wakaba@suika.fam.cx>
592
593 * HTML.pm.src: Reduce the number of errors in truncated doctypes (HTML5
594 revision 1685).
595
596 2008-05-24 Wakaba <wakaba@suika.fam.cx>
597
598 * HTML.pm.src: Support for EOF in new states for tags (HTML5
599 revision 1684).
600
601 2008-05-24 Wakaba <wakaba@suika.fam.cx>
602
603 * HTML.pm.src (_reset_insertion_mode): Make <td>.innerHTML
604 work (HTML5 revision 1690).
605
606 2008-05-24 Wakaba <wakaba@suika.fam.cx>
607
608 * HTML.pm.src (_tree_construction_main): Change handling of
609 end tags in head insertion modes (HTML5 revision 1686).
610 (parse_char_string): Bug fix for non-utf8 character string handlings.
611 (parse_char_stream): |ungetc| does not work well for this context.
612
613 2008-05-18 Wakaba <wakaba@suika.fam.cx>
614
615 * HTML.pm.src (parse_byte_string): Redefined to invoke
616 |parse_byte_stream|.
617 (parse_byte_stream): New method.
618
619 2008-05-18 Wakaba <wakaba@suika.fam.cx>
620
621 * HTML.pm.src (parse_byte_string): Fix the column number reported
622 by encoding layer error reporter.
623
624 2008-05-17 Wakaba <wakaba@suika.fam.cx>
625
626 * HTML.pm.src (parse_byte_string): Use streaming decoder
627 rather than converting the whole byte string and then parsing.
628 Propagate errors in character encoding layer.
629 (get_next_token): Precise error reporting for |bare stago| error.
630
631 2008-05-17 Wakaba <wakaba@suika.fam.cx>
632
633 * HTML.pm.src (parse_char_stream): New method.
634 (parse_char_string): This method is now defined as an invocation
635 of the |parse_char_stream| method.
636
637 2008-05-17 Wakaba <wakaba@suika.fam.cx>
638
639 * HTML.pm.src (parse_byte_string): Report various status
640 of the sniffing as info-level errors. Support for new
641 decoding framework in parser resestting.
642 (new): Various default error levels were not set.
643
644 2008-05-17 Wakaba <wakaba@suika.fam.cx>
645
646 * HTML.pm.src (parse_byte_string): HTML5 encoding siniffing
647 algorithm, except for the actual sniffing, is implemented
648 with new framework with Message::Charset::Info.
649
650 2008-05-16 Wakaba <wakaba@suika.fam.cx>
651
652 * CacheManifest.pm (_parse): Drop fragment identifiers from
653 URIs in fallback section (HTML5 revision 1596).
654
655 2008-05-10 Wakaba <wakaba@suika.fam.cx>
656
657 * Makefile (Entities.html): URI has changed.
658
659 2008-05-10 Wakaba <wakaba@suika.fam.cx>
660
661 * CacheManifest.pm: Don't replace U+0000 NULL (HTML5 revision
662 1553).
663
664 2008-05-06 Wakaba <wakaba@suika.fam.cx>
665
666 * ContentChecker.pm: Noted that those returned in |table| are
667 no longer table elements, but table objects returned
668 by Whatpm::HTMLTable.
669
670 * HTMLTable.pm (form_table): Return table element node
671 as |$table->{element}|.
672 (assign_header): Support for the |headers=""| attribute.
673
674 2008-05-06 Wakaba <wakaba@suika.fam.cx>
675
676 * HTMLTable.pm (assign_header): New function; first version
677 with no support for headers="".
678 (form_table): Include table width and height to the returned
679 table object for covenience. Indexing in column assignement was wrong.
680 Set whether a data cell is empty or not for convenience.
681
682 2008-05-05 Wakaba <wakaba@suika.fam.cx>
683
684 * HTMLTable.pm: Robuster caption support (HTML5 revision 1393).
685
686 2008-05-05 Wakaba <wakaba@suika.fam.cx>
687
688 * HTMLTable.pm: How table model errors are detected is
689 changed (HTML5 revision 1387).
690
691 2008-05-05 Wakaba <wakaba@suika.fam.cx>
692
693 * HTMLTable.pm: The algorithm now moves |tfoot| elements
694 to the end of the table (HTML5 revision 1380).
695
696 2008-05-05 Wakaba <wakaba@suika.fam.cx>
697
698 * HTMLTable.pm: The algorithm is now 0-based indexing, instead
699 of 1-based (HTML5 revision 1376).
700
701 2008-05-05 Wakaba <wakaba@suika.fam.cx>
702
703 * ContentType.pm: "Content-Type: text/plain; charset=UTF-8"
704 and "Content-Encoding" no longer prevent sniffing (HTML5
705 revision 1288).
706
707 2008-05-05 Wakaba <wakaba@suika.fam.cx>
708
709 * ContentType.pm: Skip BOMs n feed or HTML algorithm (HTML5 revision
710 1282).
711
712 2008-05-03 Wakaba <wakaba@suika.fam.cx>
713
714 * ContentChecker.pm: Support for global attributes.
715 Status of XML specs are added.
716
717 2008-05-03 Wakaba <wakaba@suika.fam.cx>
718
719 * ContentChecker.pm (check_element): Support for |template=""|
720 and |ref=""| attribute (referent element type checking).
721
722 2008-04-29 Wakaba <wakaba@suika.fam.cx>
723
724 * CacheManifest.pm (_parse): New same origin definition (HTML5 revision
725 1500) is implemented (except for IDNA part and URI-scheme-specific
726 knowledge). Line number counting was wrong for LF-only
727 documents.
728
729 2008-04-13 Wakaba <wakaba@suika.fam.cx>
730
731 * HTML.pm.src: Raise an parse error for any disallowed
732 character (HTML5 revision 1263).
733
734 2008-04-13 Wakaba <wakaba@suika.fam.cx>
735
736 * mkentitylist.pl: Support for new HTML5 entity table format (the
737 definition for |AElig;| was missing).
738
739 2008-04-13 Wakaba <wakaba@suika.fam.cx>
740
741 * HTML.pm.src, mkhtmlparser.pl: Support for element/attribute
742 name/namespace fixup (HTML5 revisions 1413, 1415, 1416, and 1417).
743
744 2008-04-13 Wakaba <wakaba@suika.fam.cx>
745
746 * HTML.pm.src: List of element names that close foreign content
747 insertion mode is added (HTML5 revisions 1412 and 1418).
748
749 2008-04-13 Wakaba <wakaba@suika.fam.cx>
750
751 * HTML.pm.src: Support for |mglyph| and |malignmark|
752 elements (HTML5 revision 1410).
753
754 2008-04-13 Wakaba <wakaba@suika.fam.cx>
755
756 * HTML.pm.src: Support for new long MathML entities (HTML5
757 revision 1406).
758
759 2008-04-13 Wakaba <wakaba@suika.fam.cx>
760
761 * HTML.pm.src: CDATA section support for MathML and SVG
762 elements (HTML5 revision 1404 and 1420).
763
764 2008-04-12 Wakaba <wakaba@suika.fam.cx>
765
766 * HTML.pm.src, mkhtmlparser.pl: Support for MathML and SVG
767 elements (HTML5 revision 1404). Unused !!!macro definitions
768 are removed.
769
770 2008-04-12 Wakaba <wakaba@suika.fam.cx>
771
772 * HTML.pm.src, mkhtmlparser.pl: The way permitted slash errors
773 are raised is changed (HTML5 revision 1404).
774
775 2008-04-06 Wakaba <wakaba@suika.fam.cx>
776
777 * HTML.pm.src: Category sets that are no longer used are removed.
778
779 2008-04-06 Wakaba <wakaba@suika.fam.cx>
780
781 * HTML.pm.src: The ->[1] property of stack entries are now
782 replaced by constants representing element category.
783
784 2008-04-06 Wakaba <wakaba@suika.fam.cx>
785
786 * HTML.pm.src: Don't use local name stored in stack (i.e. ->[1])
787 for error reporting. (This is a preparation for using constant
788 value for ->[1].)
789
790 2008-03-22 Wakaba <wakaba@suika.fam.cx>
791
792 * RDFXML.pm: Typo fixed.
793
794 2008-03-22 Wakaba <wakaba@suika.fam.cx>
795
796 * ContentChecker.pm: |fact_level| is now treated
797 as same as |must_level|, i.e. level = |m|.
798 (check_element): Make list of URIs in the DOM.
799
800 2008-03-21 Wakaba <wakaba@suika.fam.cx>
801
802 * RDFXML.pm: Language accessor implemented.
803 Local (null-namespace) attribute support.
804
805 2008-03-21 Wakaba <wakaba@suika.fam.cx>
806
807 * RDFXML.pm: Factored out ID checking code.
808
809 2008-03-21 Wakaba <wakaba@suika.fam.cx>
810
811 * RDFXML.pm: TODO items noted. Validation of ID
812 and URI attributes is implemented. Warn if unknown
813 value is used in rdf:parseType="" attribute.
814
815 * URIChecker.pm (check_rdf_uri_reference): New function.
816
817 2008-03-21 Wakaba <wakaba@suika.fam.cx>
818
819 * RDFXML.pm: bnodeid implemented. Relative references
820 are now resolved.
821
822 2008-03-21 Wakaba <wakaba@suika.fam.cx>
823
824 * ContentChecker.pm: RDF reification implemented.
825
826 * RDFXML.pm: undef vs false bug fixed.
827 Reification implemented.
828
829 2008-03-21 Wakaba <wakaba@suika.fam.cx>
830
831 * RDFXML.pm: s/id/ID/ for attribute name.
832 The |node| arguments are added for |ontriple| calls.
833 Too many "attribute not allowed" errors were raised.
834
835 * ContentChecker.pm: Initial experimental support for rdf:RDF
836 element.
837
838 2008-03-21 Wakaba <wakaba@suika.fam.cx>
839
840 * RDFXML.pm: New module.
841
842 2008-03-20 Wakaba <wakaba@suika.fam.cx>
843
844 * HTML.pm.src (set_inner_html): Line/column number
845 code was old one yet.
846
847 2008-03-20 Wakaba <wakaba@suika.fam.cx>
848
849 * HTML.pm.src: Better line/column reporting for "duplicate attribute"
850 errors. Line/column markings for DOCTYPE, comment, and
851 character tokens are reintroduced; otherwise,
852 error location for "not HTML5" error and errors
853 for implied elements are not attached.
854
855 2008-03-20 Wakaba <wakaba@suika.fam.cx>
856
857 * HTML.pm.src: Set line/column numbers to attributes.
858
859 * NanoDOM.pm (create_attribute_ns, set_attribute_node_ns): Added.
860 (value): Setter implemented.
861
862 * mkhtmlparser.pl: Set line/column numbers to Attr nodes.
863
864 2008-03-20 Wakaba <wakaba@suika.fam.cx>
865
866 * HTML.pm.src: Unused line/column markings are removed.
867
868 2008-03-20 Wakaba <wakaba@suika.fam.cx>
869
870 * HTML.pm.src (_get_next_token): Remove |first_start_tag|
871 flag, which is no longer used.
872
873 2008-03-17 Wakaba <wakaba@suika.fam.cx>
874
875 * HTML.pm.src: Set line/column information to element nodes.
876
877 * mkhtmlparser.pl (!!!create-element, !!!insert-element,
878 and !!!insert-element-t): Set line/column information to
879 element nodes.
880
881 2008-03-17 Wakaba <wakaba@suika.fam.cx>
882
883 * HTML.pm.src (_get_next_token): The first "<" character
884 in "<?", "<>", or "</>" should be the error point.
885
886 2008-03-16 Wakaba <wakaba@suika.fam.cx>
887
888 * HTML.pm.src: Some more fixes on error position reporting.
889
890 2008-03-16 Wakaba <wakaba@suika.fam.cx>
891
892 * HTML.pm.src: Token-level precious error reporting.
893
894 2008-03-16 Wakaba <wakaba@suika.fam.cx>
895
896 * HTML.pm.src: Preparation for more precious error point
897 reporting.
898
899 2008-03-11 Wakaba <wakaba@suika.fam.cx>
900
901 * HTML.pm.src: Error type revised.
902
903 2008-03-11 Wakaba <wakaba@suika.fam.cx>
904
905 * HTML.pm.src: Similar codes are merged together, again.
906
907 2008-03-11 Wakaba <wakaba@suika.fam.cx>
908
909 * HTML.pm.src: Similar codes are merged together.
910
911 2008-03-10 Wakaba <wakaba@suika.fam.cx>
912
913 * mkhtmlparser.pl: Set "level" parameter to parse errors.
914
915 * HTML.pm.src: Code refined.
916
917 2008-03-09 Wakaba <wakaba@suika.fam.cx>
918
919 * HTML.pm.src: |</body>| treatement has been changed (HTML5
920 revision 1348). Note that I really don't know this makes
921 any difference in the black-box behavior of the parser.
922
923 2008-03-09 Wakaba <wakaba@suika.fam.cx>
924
925 * HTML.pm.src: New end-of-file token implementation (HTML5
926 revision 1348).
927
928 2008-03-09 Wakaba <wakaba@suika.fam.cx>
929
930 * HTML.pm.src: |applet| support (HTML5 revision 1347).
931
932 2008-03-09 Wakaba <wakaba@suika.fam.cx>
933
934 * HTML.pm.src: Foster parenting in AAA (HTML5 revision 1343).
935
936 2008-03-09 Wakaba <wakaba@suika.fam.cx>
937
938 * HTML.pm.src: Support for |<input>| in the "in select" insertion
939 mode and support for the "in select in table" insertion mode (HTML5
940 revision 1342).
941
942 2008-03-09 Wakaba <wakaba@suika.fam.cx>
943
944 * HTML.pm.src: No foster parenting for <script> and <script>
945 in non-tainted <table>s (HTML5 revision 1336).
946
947 2008-03-09 Wakaba <wakaba@suika.fam.cx>
948
949 * HTML.pm.src: Ignore white space characters between <html>
950 and <head> (HTML5 revision 1332).
951
952 2008-03-09 Wakaba <wakaba@suika.fam.cx>
953
954 * HTML.pm.src: Treat <input type=hidden> as if it were a
955 white space (HTML5 revision 1331).
956
957 2008-03-08 Wakaba <wakaba@suika.fam.cx>
958
959 * HTML.pm.src: Ignore U+000A at the beginning of a |listing|
960 element (HTML5 revision 1330).
961
962 2008-03-08 Wakaba <wakaba@suika.fam.cx>
963
964 * HTML.pm.src: <title> is always appended to the current
965 element (HTML5 revision 1328).
966
967 2008-03-08 Wakaba <wakaba@suika.fam.cx>
968
969 * HTML.pm.src: White space in tainted tables are moved
970 into foster parents (HTML5 revision 1326).
971
972 2008-03-08 Wakaba <wakaba@suika.fam.cx>
973
974 * HTML.pm.src: Reduce errors from foster parenting cases (HTML5
975 revision 1321).
976
977 2008-03-08 Wakaba <wakaba@suika.fam.cx>
978
979 * HTML.pm.src: |</h/n/>| case code rearranged to align with
980 the spec (HTML5 revision 1320). Note that we finally complete
981 all of HTML5 revision 1320 changes.
982
983 2008-03-08 Wakaba <wakaba@suika.fam.cx>
984
985 * HTML.pm.src: |</form>| now works similar to |</div>| for unclosed
986 tags (HTML5 revision 1320).
987
988 2008-03-08 Wakaba <wakaba@suika.fam.cx>
989
990 * HTML.pm.src: |</p>| case rearranged with no actual change in fact.
991
992 2008-03-08 Wakaba <wakaba@suika.fam.cx>
993
994 * HTML.pm.src: A "generate implied end tags" code (t409.1)
995 could not be reached so that it is now removed (HTML5 revision
996 1320).
997
998 2008-03-08 Wakaba <wakaba@suika.fam.cx>
999
1000 * HTML.pm.src: Code for the case of |</div>| and so on are revised
1001 to align with new spec text (HTML5 revision 1320).
1002
1003 2008-03-08 Wakaba <wakaba@suika.fam.cx>
1004
1005 * HTML.pm.src: Remove strange |if| condition; however, it should
1006 have had no harm in theory.
1007
1008 2008-03-08 Wakaba <wakaba@suika.fam.cx>
1009
1010 * HTML.pm.src (_tree_construction_main): '</p>' in body
1011 case is split from other end tags for the preparation
1012 of implementing HTML revision 1320.
1013
1014 2008-03-07 Wakaba <wakaba@suika.fam.cx>
1015
1016 * HTML.pm.src: Simplified "generate implied end tag" (HTML5
1017 revision 1320).
1018
1019 2008-03-07 Wakaba <wakaba@suika.fam.cx>
1020
1021 * HTML.pm.src (_tree_construction_main): Merge rules for "h1"
1022 and "div" (HTML5 revision 1318). Add comments to where
1023 |form| pointer association codes should be inserted (HTML5 revision
1024 1319).
1025
1026 2008-03-06 Wakaba <wakaba@suika.fam.cx>
1027
1028 * HTML.pm.src: <html> treatement refined (HTML5 revision 1314).
1029
1030 2008-03-05 Wakaba <wakaba@suika.fam.cx>
1031
1032 * HTML.pm.src: Since the case t268 should never be reached (no
1033 other token type, there are), it is replaced by a |die| statement.
1034
1035 2008-03-05 Wakaba <wakaba@suika.fam.cx>
1036
1037 * HTML.pm.src: Typo fixed.
1038
1039 2008-03-04 Wakaba <wakaba@suika.fam.cx>
1040
1041 * HTML.pm.src (_tree_construction_initial): Some limited quirks
1042 doctypes were not uppercased for comparison.
1043
1044 2008-03-03 Wakaba <wakaba@suika.fam.cx>
1045
1046 * HTML.pm.src (tree construction and set_inner_html): Checkpoints
1047 are added.
1048
1049 2008-03-03 Wakaba <wakaba@suika.fam.cx>
1050
1051 * HTML.pm.src (_tokenize_attempt_to_consume_an_entity): Checkpoints
1052 are set. Cases that are unlikely reached are noted as so.
1053
1054 2008-03-03 Wakaba <wakaba@suika.fam.cx>
1055
1056 * HTML.pm.src: Checkpoints for debugging are added.
1057
1058 * mkhtmlparser.pl: Support for |!!!cp| syntax.
1059
1060 2008-03-03 Wakaba <wakaba@suika.fam.cx>
1061
1062 * HTML.pm.src, mkhtmlparser.pl: s/_input_character/_char/g
1063 for simplicity.
1064
1065 2008-03-03 Wakaba <wakaba@suika.fam.cx>
1066
1067 * HTML.pm.src: Flag name changed: s/correct/force-quirks/g (HTML5
1068 revision 1307).
1069
1070 2008-03-03 Wakaba <wakaba@suika.fam.cx>
1071
1072 * HTML.pm.src (_get_next_token): Where the /incorrect/ flag
1073 is set are changed (HTML5 revision 1305).
1074
1075 2008-03-02 Wakaba <wakaba@suika.fam.cx>
1076
1077 * HTML.pm.src: Raise a parse error for |<span ===>| (HTML5 revision
1078 1292). Entities are not parsed in comment-like part in RCDATA
1079 elements (HTML5 revision 1294). Allow bare & at the end
1080 of attribute value literals (HTML5 revision 1296). More
1081 quirks mode doctypes (HTML5 revision 1302). Requires spaces
1082 between attributes and ban attribute names or unquoted
1083 attribute values containing single or double quotes (HTML5
1084 revision 1303).
1085
1086 2008-03-02 Wakaba <wakaba@suika.fam.cx>
1087
1088 * ContentChecker.pm: Typo fixed. Don't raise "character encoding"
1089 and related errors unless it is an HTML document (though the spec
1090 is unclear on whether it is applied to XHTML document).
1091
1092 * HTML.pm (%HTMLAttrStatus): WF2 repetition model attributes
1093 are added.
1094
1095 2008-03-02 Wakaba <wakaba@suika.fam.cx>
1096
1097 * HTML.pm.src: s/local_name/manakai_local_name/g.
1098
1099 2008-03-01 Wakaba <wakaba@suika.fam.cx>
1100
1101 * _NamedEntityList.pm: Updated (HTML5 revision 1286).
1102
1103 * HTML.pm.src: |charset| in |content| attribute is
1104 case-insensitive (HTML5 revision 1270).
1105
1106 2008-02-26 Wakaba <wakaba@suika.fam.cx>
1107
1108 * ContentChecker.pm: New status constants are added.
1109 ($ElementDefault): |status| added.
1110 (check_element): Err for non-standard or deprecated elements.
1111 (_attr_status_info): For non-standard or deprecated attributes.
1112
1113 2008-02-24 Wakaba <wakaba@suika.fam.cx>
1114
1115 * ContentChecker.pm (_attr_status_info): New internal method.
1116
1117 2008-02-24 Wakaba <wakaba@suika.fam.cx>
1118
1119 * ContentChecker.pm (check_element): Element standardized
1120 status information is now dispatched.
1121
1122 2008-02-24 Wakaba <wakaba@suika.fam.cx>
1123
1124 * ContentChecker.pm (check_element): Fix |del|-and-significant
1125 problem by adding some more arguments.
1126
1127 2008-02-24 Wakaba <wakaba@suika.fam.cx>
1128
1129 * ContentChecker.pm (check_element): Use context of
1130 container-for-the-purpose-of-content-model element (not transparent
1131 element) for |check_child_element| calling and significant
1132 text flag marking. This reintroduces |<del>|-and-significant
1133 problem again.
1134
1135 2008-02-24 Wakaba <wakaba@suika.fam.cx>
1136
1137 * ContentChecker.pm (check_element): Make semi-transparent
1138 elements ignored for the purpose of phase changes in
1139 content model checking.
1140
1141 2008-02-23 Wakaba <wakaba@suika.fam.cx>
1142
1143 * ContentChecker.pm (check_element): In-element state
1144 was not properly managed for transparent cases.
1145
1146 2008-02-23 Wakaba <wakaba@suika.fam.cx>
1147
1148 * ContentChecker.pm (check_element): Support for |video|
1149 and |audio| as semi-transparent elements.
1150
1151 2008-02-23 Wakaba <wakaba@suika.fam.cx>
1152
1153 * ContentChecker.pm ($HTMLSemiTransparentElements): New.
1154 (check_element): s/minuses/minus_elements/, s/pluses/plus_elements/.
1155 Support for |html:object| as a semi-transparent element.
1156
1157 2008-02-23 Wakaba <wakaba@suika.fam.cx>
1158
1159 * ContentChecker.pm (check_element): The way to traverse
1160 the tree is entirely revised to make it easier to track
1161 the state of ancestors/descendants. As a result of this
1162 revision (which rewrites almost all of Whatpm::ContentChecker::HTML),
1163 support for content model checking for HTML elements |figure|,
1164 |object|, |video|, and |audio| and checking for XML elements (and
1165 some XMLNS checkings) are dropped for now. They will be
1166 reimplemented in due cource.
1167
1168 2008-02-17 Wakaba <wakaba@suika.fam.cx>
1169
1170 * HTML.pm.src: |>| in public or system literal closes the DOCTYPE
1171 token (HTML5 revision 1225).
1172
1173 2008-02-17 Wakaba <wakaba@suika.fam.cx>
1174
1175 * ContentChecker.pm ({unsupported_level}): New value.
1176
1177 * HTML.pm.src: Save whether |meta| |content| attribute
1178 contains character references or not.
1179
1180 2008-02-17 Wakaba <wakaba@suika.fam.cx>
1181
1182 * ContentChecker.pm (_get_children): (Incompleted) attempt to
1183 imlement significant content checking for contents
1184 with |del| elements.
1185
1186 2008-02-17 Wakaba <wakaba@suika.fam.cx>
1187
1188 * ContenteChecker.pm ($HTMLTransparentElements): More
1189 elements are added.
1190 (_get_children): HTML |object| elements are now semi-transparent.
1191
1192 * NanoDOM.pm (manakai_html, manakai_head): New methods.
1193
1194 2008-02-16 Wakaba <wakaba@suika.fam.cx>
1195
1196 * CacheManifest.pm: HTML5 revision 1211 implemented.
1197
1198 * CacheManifest.pod: Updated.
1199
1200 2008-02-10 Wakaba <wakaba@suika.fam.cx>
1201
1202 * ContentChecker.pm (check_document, check_element): Support
1203 for second argument ($onsubdoc).
1204 (_get_css_parser): Removed (now it is part of WDCC).
1205
1206 2008-02-09 Wakaba <wakaba@suika.fam.cx>
1207
1208 * ContentChecker.pm (_get_css_parser): New.
1209
1210 2007-11-25 Wakaba <wakaba@suika.fam.cx>
1211
1212 * ContentChecker.pm ($AnyChecker): Old way to add child elements
1213 for checking had been used.
1214
1215 2007-11-25 Wakaba <wakaba@suika.fam.cx>
1216
1217 * ContentChecker.pm (check_element): New todo item type |descendant|.
1218
1219 2007-11-23 Wakaba <wakaba@suika.fam.cx>
1220
1221 * IMTChecker.pm: Revised to raise errors and warnings as (poorly)
1222 specced in RFC 2046 and RFC 4288.
1223 (application/atom+xml): Definition added.
1224
1225 2007-11-23 Wakaba <wakaba@suika.fam.cx>
1226
1227 * URIChecker.pm: Make RFC 3986 should-level errors
1228 warnings (rather than SHOULD-level errors).
1229
1230 2007-11-23 Wakaba <wakaba@suika.fam.cx>
1231
1232 * NanoDOM.pm (get_user_data, set_user_data): New methods.
1233
1234 * HTML.pm.src: A flag for character references in attribute
1235 values are added. Set |manakai_has_reference| user data
1236 to |charset| attribute.
1237
1238 2007-11-23 Wakaba <wakaba@suika.fam.cx>
1239
1240 * NanoDOM.pm (input_encoding, manakai_charset, manakai_has_bom): New
1241 attributes.
1242
1243 * ContentChecker.pm (check_document): Warn if charset requirements
1244 cannot be tested.
1245
1246 2007-11-19 Wakaba <wakaba@suika.fam.cx>
1247
1248 * HTML.pm.src (parse_byte_string): Detect charset
1249 by universalchardet if charset parameter is not specified.
1250
1251 * Makefile (Charset-all, Charset-clean): New rules.
1252
1253 2007-11-18 Wakaba <wakaba@suika.fam.cx>
1254
1255 * ContentChecker.pm (check_document): Check the existence
1256 of character encoding declaration and selection of encoding
1257 for HTML document.
1258
1259 2007-11-18 Wakaba <wakaba@suika.fam.cx>
1260
1261 * ContentType.pm (get_sniffed_type): Return also the official
1262 type in list context.
1263
1264 2007-11-18 Wakaba <wakaba@suika.fam.cx>
1265
1266 * ContentType.pm: Sniffing with leading white space
1267 ignoring (HTML5 revisions 1013 and 1016).
1268
1269 2007-11-18 Wakaba <wakaba@suika.fam.cx>
1270
1271 * ContentType.pm: HTML5 revision 1013 changes, except for leading
1272 white spaces, are implemented.
1273
1274 2007-11-11 Wakaba <wakaba@suika.fam.cx>
1275
1276 * HTML.pm.src (parse_char_string): Set |inner_encoding|
1277 attribute if possible.
1278
1279 2007-11-11 Wakaba <wakaba@suika.fam.cx>
1280
1281 * HTML.pm.src (parse_byte_string): New method.
1282 (parse_char_string): New alias for |parse_string|.
1283 (main phase): Invoking "change the encoding" algorithm if desired.
1284
1285 * HTML.pod: Updated.
1286
1287 2007-11-11 Wakaba <wakaba@suika.fam.cx>
1288
1289 * HTML.pod (get_inner_html): Removed.
1290
1291 * Makefile (HTML-all, HTML-clean): New.
1292
1293 2007-11-11 Wakaba <wakaba@suika.fam.cx>
1294
1295 * HTML.pm.src (get_inner_html): Removed (moved to HTML/Serializer.pm).
1296
1297 2007-11-08 Wakaba <wakaba@suika.fam.cx>
1298
1299 * mklinktypelist.pl: s/noreferer/noreferrer/ (HTML5 revision 1132).
1300
1301 2007-11-04 Wakaba <wakaba@suika.fam.cx>
1302
1303 * Makefile: |CacheManifest.html| is added.
1304
1305 * CacheManifest.pod: New file.
1306
1307 2007-11-04 Wakaba <wakaba@suika.fam.cx>
1308
1309 * CacheManifest.pm: New module.
1310
1311 2007-11-04 Wakaba <wakaba@suika.fam.cx>
1312
1313 * HTML.pm.src: Support for application cache selection algorithm
1314 callback.
1315
1316 2007-11-04 Wakaba <wakaba@suika.fam.cx>
1317
1318 * mklinktypelist.pl: Support for rel=noreferer (HTML5 revision 1118).
1319
1320 2007-10-17 Wakaba <wakaba@suika.fam.cx>
1321
1322 * Makefile (clean): New rule.
1323
1324 * NanoDOM.pm (public_id, system_id): New attributes.
1325
1326 2007-10-17 Wakaba <wakaba@suika.fam.cx>
1327
1328 * Makefile (CSS-all, CSS-clean, clean): New rules.
1329
1330 2007-10-14 Wakaba <wakaba@suika.fam.cx>
1331
1332 * ContentChecker.pm (check_document): Support for
1333 new |is_xml_root| flag.
1334 (check_element): Support for new |pluses| state.
1335 (_add_pluses): New method.
1336 (_remove_minuses): Support for new |minus| item.
1337
1338 2007-09-24 Wakaba <wakaba@suika.fam.cx>
1339
1340 * ContentChecker.pm: Raise specific error for invalid
1341 root element.
1342
1343 2007-09-24 Wakaba <wakaba@suika.fam.cx>
1344
1345 * ContentChecker.pm: Set level values for later uses.
1346
1347 2007-09-09 Wakaba <wakaba@suika.fam.cx>
1348
1349 * ContentChecker.pm: Support for language tag validation.
1350
1351 2007-09-09 Wakaba <wakaba@suika.fam.cx>
1352
1353 * LangTag.pm (check_rfc3066_language_tag): New method.
1354
1355 2007-09-09 Wakaba <wakaba@suika.fam.cx>
1356
1357 * LangTag.pm: New module.
1358
1359 2007-09-04 Wakaba <wakaba@suika.fam.cx>
1360
1361 * HTML.pm.src: Some error types were wrong.
1362
1363 2007-08-17 Wakaba <wakaba@suika.fam.cx>
1364
1365 * CSS/: New directory.
1366
1367 2007-08-17 Wakaba <wakaba@suika.fam.cx>
1368
1369 * ContentChecker.pm (_check_get_children): Support
1370 for |noscript| in |head|.
1371
1372 2007-08-12 Wakaba <wakaba@suika.fam.cx>
1373
1374 * URI/: New directory.
1375
1376 2007-08-11 Wakaba <wakaba@suika.fam.cx>
1377
1378 * HTML.pm.src: Tokenizer's states are now represented in
1379 number.
1380
1381 2007-08-11 Wakaba <wakaba@suika.fam.cx>
1382
1383 * HTML.pm.src: |or|s for insertion modes are replaced
1384 by |&|s.
1385
1386 2007-08-11 Wakaba <wakaba@suika.fam.cx>
1387
1388 * HTML.pm.src: Token types are now represented in number.
1389
1390 2007-08-11 Wakaba <wakaba@suika.fam.cx>
1391
1392 * ContentType.pm (SEE ALSO): Updated.
1393
1394 * HTML.pm.src: Insertion modes are now represented in number.
1395
1396 2007-08-11 Wakaba <wakaba@suika.fam.cx>
1397
1398 * ContentType.pm: Sniffing for bitmap images (HTML5 revision
1399 999) is implemented.
1400
1401 2007-08-08 Wakaba <wakaba@suika.fam.cx>
1402
1403 * ContentType.pm: Sniffing for |<script| (HTML5 revision
1404 983) is implemented.
1405
1406 2007-08-06 Wakaba <wakaba@suika.fam.cx>
1407
1408 * ContentChecker.pod: New documentation.
1409
1410 * Makefile: A rule for |ContentChecker.html| is added.
1411
1412 * ContentChecker.pm: A pod "LICENSE" section is added.
1413
1414 * NanoDOM.pm ($VERSION): New variable.
1415
1416 2007-08-05 Wakaba <wakaba@suika.fam.cx>
1417
1418 * H2H.pm: |b|, |i|, and |sub| are added to the
1419 list of allowed HTML elements.
1420
1421 2007-08-05 Wakaba <wakaba@suika.fam.cx>
1422
1423 * H2H.pm: |samp| is added to the list of allowed
1424 HTML elements.
1425
1426 * URIChecker.pm (check_iri): New.
1427 (check_iri_reference): Error type for IRI reference
1428 syntax error is changed.
1429
1430 2007-08-04 Wakaba <wakaba@suika.fam.cx>
1431
1432 * ContentChecker.pm: Reference to the |Whatpm::ContentChecker::Atom|
1433 is added.
1434 (check_document): Load appropriate module before validation.
1435
1436 2007-08-04 Wakaba <wakaba@suika.fam.cx>
1437
1438 * ContentChecker/: New directory.
1439
1440 2007-08-04 Wakaba <wakaba@suika.fam.cx>
1441
1442 * ContentChecker.pm: HTML |time| element is implemented.
1443
1444 * HTMLTable.pm: Comments are updated as HTML5 is revised.
1445
1446 2007-07-21 Wakaba <wakaba@suika.fam.cx>
1447
1448 * ContentChecker.pm (check_document): Return value
1449 even if no document element is found.
1450
1451 2007-07-21 Wakaba <wakaba@suika.fam.cx>
1452
1453 * HTML.pm.src: |$in_body| is no longer a function.
1454
1455 2007-07-21 Wakaba <wakaba@suika.fam.cx>
1456
1457 * HTML.pm.src: The |$in_body| code has been moved down.
1458
1459 2007-07-21 Wakaba <wakaba@suika.fam.cx>
1460
1461 * HTML.pm.src: The "trailing end" insertion mode
1462 is split into "after html body" and "after html frameset"
1463 insertion modes. Their codes are merged with "after body"
1464 and "after frameset" codes. |$previous_insertion_mode|
1465 has been removed. "after frameset" code is
1466 merged with "in frameset" code.
1467
1468 2007-07-21 Wakaba <wakaba@suika.fam.cx>
1469
1470 * HTML.pm.src: The "before head" insertion mode is
1471 merged with the "in head" insertion mode.
1472
1473 2007-07-21 Wakaba <wakaba@suika.fam.cx>
1474
1475 * HTML.pm.src: Cases in "in head" insertion mode are
1476 reorganized.
1477
1478 2007-07-21 Wakaba <wakaba@suika.fam.cx>
1479
1480 * HTML.pm.src: Some cases in "in table" insertion mode
1481 are merged.
1482
1483 2007-07-21 Wakaba <wakaba@suika.fam.cx>
1484
1485 * HTML.pm.src: The "in row" insertion mode is merged
1486 with "in table" insertion mode.
1487
1488 2007-07-21 Wakaba <wakaba@suika.fam.cx>
1489
1490 * HTML.pm.src: The "in table" and "in table body" insertion
1491 modes are merged.
1492
1493 2007-07-21 Wakaba <wakaba@suika.fam.cx>
1494
1495 * HTML.pm.src: There is no "in table head" or "in table foot"
1496 insertion mode!
1497
1498 2007-07-21 Wakaba <wakaba@suika.fam.cx>
1499
1500 * HTML.pm.src: |<noframes>| "in frameset" and "in noframes"
1501 now directly invoke the handler.
1502
1503 2007-07-21 Wakaba <wakaba@suika.fam.cx>
1504
1505 * HTML.pm.src: Codes for "in cell" insertion mode
1506 is merged to the "in body" insertion mode code.
1507
1508 2007-07-21 Wakaba <wakaba@suika.fam.cx>
1509
1510 * HTML.pm.src: Codes for "in body" and "in caption"
1511 insertion modes are merged.
1512
1513 2007-07-21 Wakaba <wakaba@suika.fam.cx>
1514
1515 * HTML.pm.src: Two |!!!next-token|s were missing.
1516
1517 2007-07-21 Wakaba <wakaba@suika.fam.cx>
1518
1519 * HTML.pm.src: Use numeric constant for |{content_mode}|
1520 instead of string constant for |{content_model_flag}|.
1521
1522 2007-07-21 Wakaba <wakaba@suika.fam.cx>
1523
1524 * HTML.pm.src: Add the name of the attribute
1525 to the "duplicate attribute" error.
1526
1527 2007-07-17 Wakaba <wakaba@suika.fam.cx>
1528
1529 * ContentChecker.pm: Return the |class| node list.
1530
1531 2007-07-17 Wakaba <wakaba@suika.fam.cx>
1532
1533 * ContentChecker.pm: Return the |id| node list.
1534
1535 * HTML.pm.src: A typo is fixed.
1536
1537 2007-07-16 Wakaba <wakaba@suika.fam.cx>
1538
1539 * ContentChecker.pm: Drop wrong |level => 'error'| specification
1540 from "in HTML:xml:lang" error. Character position
1541 is now the last part of the error type in the URI error
1542 description. Report "unsupported" status for language
1543 tags, media queries, script codes, and style sheets.
1544
1545 2007-07-16 Wakaba <wakaba@suika.fam.cx>
1546
1547 * ContentChecker.pm: Report error if |xml:lang|
1548 in HTML, |lang| in XML, |xmlns| in XML, and |meta| |charset|
1549 in XML.
1550
1551 * NanoDOM.pm (Attr.owner_document): New attribute.
1552
1553 2007-07-16 Wakaba <wakaba@suika.fam.cx>
1554
1555 * HTML.pm.src: The character immediately following
1556 a bare |hcro| was discarded. Fix handling of
1557 entity references in attribute values.
1558
1559 2007-07-16 Wakaba <wakaba@suika.fam.cx>
1560
1561 * HTML.pm.src (main and trailin end phases): Token
1562 types |DOCTYPE|, |comment|, |end-of-file|, and |<html>| are
1563 factored out. Error types |in html:#DOCTYPE|
1564 and |after html:#DOCTYPE| are merged
1565 into |DOCTYPE in the middle|. |</frameset>|
1566 in fragment parsing mode changed the
1567 insertion mode.
1568
1569 2007-07-16 Wakaba <wakaba@suika.fam.cx>
1570
1571 * HTML.pm.src: |$phase| has been removed; The |trailing end|
1572 phase is now an insertion mode. Treatments for white
1573 space character tokens were incorrect for some
1574 insertion modes. An old |meta| case was not removed.
1575
1576 2007-07-16 Wakaba <wakaba@suika.fam.cx>
1577
1578 * HTML.pm.src: |meta| charset declaration extraction
1579 implemented (but changing the encoding is not yet:-).
1580
1581 2007-07-15 Wakaba <wakaba@suika.fam.cx>
1582
1583 * Charset/: New directory.
1584
1585 2007-07-15 Wakaba <wakaba@suika.fam.cx>
1586
1587 * H2H.pm: New Perl module (created from
1588 manakai's H2H.dis).
1589
1590 2007-07-15 Wakaba <wakaba@suika.fam.cx>
1591
1592 * XMLSerializer.pm: New Perl module (created from
1593 manakai's SimpleLS.dis).
1594
1595 2007-07-07 Wakaba <wakaba@suika.fam.cx>
1596
1597 * HTML.pm.src: |<!---x-->| was not processed correctly.
1598
1599 2007-07-01 Wakaba <wakaba@suika.fam.cx>
1600
1601 * HTML.pm.src: Report correct error message
1602 for |<body></div></body>|.
1603
1604 2007-07-01 Wakaba <wakaba@suika.fam.cx>
1605
1606 * HTMLTable.pm: An error description was incorrect.
1607
1608 2007-06-30 Wakaba <wakaba@suika.fam.cx>
1609
1610 * ContentChecker.pm: Return |{term}| list.
1611
1612 2007-06-30 Wakaba <wakaba@suika.fam.cx>
1613
1614 * HTML.pm.src: HTML5 revisions 961-966 (</p>, </br>,
1615 nested <nobr>, implied </tbody>, </tfoot>, and </thead>,
1616 and <title> outside of head).
1617
1618 2007-06-30 Wakaba <wakaba@suika.fam.cx>
1619
1620 * IMTChecker.pm: Report warning for unregistered
1621 and private types/subtypes.
1622
1623 * ContentChecker.pm, HTML.pm.src, IMTChecker.pm,
1624 URIChecker.pm, HTMLTable.pm: Error messages are now
1625 consistent; they are all listed in
1626 <http://suika.fam.cx/gate/2005/sw/Whatpm%20Error%20Types>.
1627
1628 2007-06-25 Wakaba <wakaba@suika.fam.cx>
1629
1630 * ContentChecker.pm: |<img ismap>| not in |<a></a>|
1631 is now erred. |<datalist>| is implemented.
1632 Attribute checker for |<command>| and |<menu>| are
1633 added. Support for |contextmenu| global attribute
1634 is added.
1635
1636 2007-06-25 Wakaba <wakaba@suika.fam.cx>
1637
1638 * HTML.pm.src (_reset_insertion_mode): Interpretation
1639 of Step 3 has been changed.
1640
1641 2007-06-25 Wakaba <wakaba@suika.fam.cx>
1642
1643 * HTML.pm.src: Late |<html>| parse error is implemented.
1644
1645 2007-06-24 Wakaba <wakaba@suika.fam.cx>
1646
1647 * URIChecker.pm (check_iri_reference): A |decode| method name was
1648 incorrect.
1649
1650 * ContentChecker.pm: Support for the |footer| element.
1651 Check URI syntax for space-separated URI attributes.
1652 Support for the |tabindex| attribute. Support
1653 for |datetime| attribute.
1654
1655 2007-06-24 Wakaba <wakaba@suika.fam.cx>
1656
1657 * HTML.pm.src: HTML5 revision 1.144 (&#x0D;) and 1.145 (invalid
1658 character references). HTML5 revision 1.146 (white space
1659 characters before root start tag). HTML5 revision
1660 1.148 (named character references in attribute values).
1661 HTML5 revision 1.152 (<plaintext>.innerHTML get).
1662
1663 2007-06-24 Wakaba <wakaba@suika.fam.cx>
1664
1665 * HTML.pm.src: HTML5 revisions 1.142 and 1.143 (<noscript>
1666 in <head>).
1667
1668 2007-06-24 Wakaba <wakaba@suika.fam.cx>
1669
1670 * HTML.pm.src: HTML5 revision 935 (<base>, <link>, <meta>
1671 in body).
1672
1673 * ContentChecker.pm: HTML5 revision 938 (scoped="").
1674
1675 2007-06-24 Wakaba <wakaba@suika.fam.cx>
1676
1677 * HTML.pm.src: HTML5 revision 923 (matching end tag in CDATA
1678 or RCDATA in fragment parsing mode). HTML5 revision
1679 924 (<!--> and <!--->). HTML5 revision 926 (hn in hn).
1680
1681 2007-06-23 Wakaba <wakaba@suika.fam.cx>
1682
1683 * HTML.pm.src (get_inner_html): HTML5 revision 922 (inner_html
1684 for <pre> and <textarea>).
1685
1686 2007-06-23 Wakaba <wakaba@suika.fam.cx>
1687
1688 * HTML.pm.src: HTML5 revision 920 (<isindex>).
1689
1690 2007-06-23 Wakaba <wakaba@suika.fam.cx>
1691
1692 * HTML.pm.src: HTML5 revision 918 (</head>, </body>, </html>).
1693
1694 2007-06-23 Wakaba <wakaba@suika.fam.cx>
1695
1696 * HTML.pm.src: HTML5 revision 916 (</body>).
1697 HTML5 revision 917 (conforming bare &).
1698
1699 2007-06-23 Wakaba <wakaba@suika.fam.cx>
1700
1701 * NanoDOM.pm (manakai_is_html): Setting to false did not work.
1702
1703 * HTML.pm.src: HTML5 revision 914 (</ in CDATA, RCDATA).
1704 HTML5 revision 915 (<nobr>).
1705
1706 2007-06-23 Wakaba <wakaba@suika.fam.cx>
1707
1708 * HTML.pm.src: HTML5 revisions 908, 909, 912, and 913 (quirks mode).
1709
1710 * NanoDOM.pm (manakai_is_html, manakai_compat_mode, compat_mode):
1711 New attributes.
1712
1713 2007-06-23 Wakaba <wakaba@suika.fam.cx>
1714
1715 * HTML.pm.src: HTML5 revisions 900, 901, 902, and 911 (<
1716 in tags).
1717
1718 2007-06-23 Wakaba <wakaba@suika.fam.cx>
1719
1720 * .cvsignore: |Entities.html| is added.
1721
1722 * HTML.pm.src: |$entity_char| is removed and
1723 requires |Whatpm::_NamedEntityList| instead.
1724 HTML5 revision 898 (refc), except that lack of refc
1725 is parse error.
1726
1727 * mkentitylist.pl: New script.
1728
1729 * Makefile (all): |_NamedEntityList.pm| is added.
1730 (_NamedEntityList.pm, Entities.html): New rules.
1731
1732 2007-06-23 Wakaba <wakaba@suika.fam.cx>
1733
1734 * HTML.pm.src: Parse errors immediately after U+000D
1735 were ignored and U+000D immediately following another
1736 U+000D was not converted to U+000A.
1737
1738 2007-06-23 Wakaba <wakaba@suika.fam.cx>
1739
1740 * HTML.pm.src (set_inner_html): HTML5 revision 892 (adopt
1741 nodes before appended). Parser was not ready for NULL
1742 parse error and escape flag.
1743
1744 * NanoDOM.pm (adopt_node): New.
1745
1746 2007-06-23 Wakaba <wakaba@suika.fam.cx>
1747
1748 * HTML.pm.src: HTML5 revision 886 (insane comment in
1749 CDATA and RCDATA). Note that current implementation
1750 is simply repeating what the spec says and it is maybe not
1751 a best way to do it.
1752
1753 2007-06-23 Wakaba <wakaba@suika.fam.cx>
1754
1755 * HTML.pm.src: HTML5 revision 884 (</form> don't close
1756 the form element if a descendant element without implied end tag has
1757 still been open).
1758
1759 2007-06-23 Wakaba <wakaba@suika.fam.cx>
1760
1761 * ContentChecker.pm: HTML5 revision 881 (Make |id| attribute with
1762 space characters non-conforming).
1763
1764 2007-06-23 Wakaba <wakaba@suika.fam.cx>
1765
1766 * HTML.pm.src: An error message was incorrect.
1767 HTML5 revision 869 (C1 character references).
1768
1769 2007-06-23 Wakaba <wakaba@suika.fam.cx>
1770
1771 * HTML.pm.src: HTML5 revision 867 (a LF at the beginning of
1772 a |textarea| is removed).
1773
1774 2007-06-05 Wakaba <wakaba@suika.fam.cx>
1775
1776 * NanoDOM.pm (get_attribute_node_ns): New method.
1777
1778 * ContentChecker.pm: |script| |async| and |defer|
1779 no longer require |src|. |async| MUST NOT be
1780 specified if |defer|. (HTML5 revision 858).
1781
1782 2007-05-30 Wakaba <wakaba@suika.fam.cx>
1783
1784 * HTML.pm.src: |<form><form>| went to inifinite loop.
1785
1786 2007-05-27 Wakaba <wakaba@suika.fam.cx>
1787
1788 * ContentChecker.pm (html): Set |is_root| (allowed
1789 as a document element) flag on.
1790 (new): Removed.
1791 (check_document): New method.
1792
1793 2007-05-27 Wakaba <wakaba@suika.fam.cx>
1794
1795 * ContentChecker.pm (thead, tfoot): Checker specifications
1796 were incorrect.
1797
1798 2007-05-27 Wakaba <wakaba@suika.fam.cx>
1799
1800 * ContentChecker.pm ($HTMLURIAttrChecker): Include
1801 error position in the |type| option of the error.
1802
1803 * HTMLTable.pm (form_table): The |$onerror| parameter
1804 is now optional. Some bugs are fixed.
1805
1806 2007-05-27 Wakaba <wakaba@suika.fam.cx>
1807
1808 * HTMLTable.pm: New module.
1809
1810 * ContentChecker.pm (table): Invoke table model error checker.
1811
1812 * NanoDOM.pm (first_child, get_attribute_ns): New.
1813
1814 2007-05-26 Wakaba <wakaba@suika.fam.cx>
1815
1816 * ContentChecker.pm ($HTMLLinkTypesAttrChecker): New checker.
1817 (link/@rel, a/@rel, area/@rel): Use new checker.
1818
1819 * Makefile (_LinkTypeList.pm, RelExtensions.html): New rules.
1820
1821 * _LinkTypeList.pm: New file.
1822
1823 * mklinktypelist.pl: New file.
1824
1825 * .cvsignore: |RelExtensions.html| added.
1826
1827 * NanoDOM.pm (child_nodes): Returns an empty array
1828 for non-child-containing node types.
1829 (text_content): New attribute.
1830
1831 2007-05-26 Wakaba <wakaba@suika.fam.cx>
1832
1833 * IMTChecker.pm: New module.
1834
1835 * ContentChecker.pm ($HTMLIMTAttrChecker): Call IMTChecker
1836 to test parameter value validity.
1837
1838 * HTML.pm.src ($style_start_tag): Attributes were
1839 discarded.
1840
1841 2007-05-25 Wakaba <wakaba@suika.fam.cx>
1842
1843 * ContentChecker.pm ($HTMLURIAttrChecker): Implemented.
1844
1845 2007-05-25 Wakaba <wakaba@suika.fam.cx>
1846
1847 * URIChecker.pm: All recommendations from RFC 3986
1848 and RFC 3987 are listed (not all testable items are checked yet).
1849
1850 2007-05-25 Wakaba <wakaba@suika.fam.cx>
1851
1852 * URIChecker.pm: New module.
1853
1854 2007-05-20 Wakaba <wakaba@suika.fam.cx>
1855
1856 * ContentChecker.pm: Now most attributes are implemented
1857 or associated to some placeholder.
1858 ($ElementDefault): Warn unknown attributes
1859 for unknown elements as "attribute not supported".
1860 ($HTMLLanguageTagAttrChecker, $HTMLMQAttrChecker): New placeholders.
1861 ($HTMLUsemapAttrChecker, $HTMLTargetAttrChecker): New checkers.
1862 (|a| attribute checker): Reimplemented.
1863
1864 2007-05-20 Wakaba <wakaba@suika.fam.cx>
1865
1866 * ContentChecker.pm ($HTMLEventHandlerAttrChecker): New placeholder.
1867 ($HTMLAttrChecker): Event handler content attributes
1868 are added.
1869 (link, embed): Required attribute is now checked.
1870 (embed): Unknown local attributes are no longer warned.
1871
1872 2007-05-20 Wakaba <wakaba@suika.fam.cx>
1873
1874 * ContentChecker.pm ($HTMLSpaceURIsAttrChecker): New placeholder.
1875 ($HTMLIMTAttrChecker): New checker.
1876 (link@rel, link@href, link@type, style@type,
1877 a@href, a@ping, a@ping, a@type, embed@src, embed@type,
1878 object@data, object@type, source@src, source@type, area@alt,
1879 area@shape, area@coords, area@href,
1880 area@ping, area@rel, area@type, script@src,
1881 script@defer, script@async, script@type): Checkers added.
1882
1883 2007-05-20 Wakaba <wakaba@suika.fam.cx>
1884
1885 * ContentChecker.pm: Descendant checking was incorrect.
1886
1887 2007-05-19 Wakaba <wakaba@suika.fam.cx>
1888
1889 * ContentChecker.pm: Support |xml:*| and |xmlns:*|
1890 attributes. Report an error if |Element.prefix|
1891 is |xmlns|.
1892
1893 * NanoDOM.pm (prefix): New attribute.
1894
1895 2007-05-19 Wakaba <wakaba@suika.fam.cx>
1896
1897 * HTML.pm.src: In |main| phase, |in body| insertion
1898 mode, action for |<iframe>| was missing.
1899
1900 2007-05-19 Wakaba <wakaba@suika.fam.cx>
1901
1902 * ContentChecker.pm: Support for many of HTML5 elements.
1903 ($GetHTMLNonNegativeIntegerAttrChecker): New.
1904
1905 2007-05-19 Wakaba <wakaba@suika.fam.cx>
1906
1907 * ContentChecker.pm: Support for most elements up to |progress|.
1908 ($HTMLURIAttrChecker): Placeholder.
1909 ($HTMLIntegerAttrChecker, $GetHTMLFloatingPointNumberAttrChecker):
1910 New.
1911
1912 2007-05-19 Wakaba <wakaba@suika.fam.cx>
1913
1914 * ContentChecker.pm: Attribute checkers
1915 for global attributes, |html|, |base|, |style|, and |meta|.
1916
1917 * NanoDOM.pm (insert_before): Weaken reference
1918 to the parent node.
1919 (Attr::new): Set |owner_element| attribute.
1920 (namespace_uri, manakai_local_name): New attribute implementations.
1921 (owner_element): New attribute.
1922
1923 2007-05-19 Wakaba <wakaba@suika.fam.cx>
1924
1925 * ContentChecker.pm ($AttrChecker, $HTMLAttrChecker,
1926 $AnyChecker->{attr_checker}, $HTMLAttrsChecker,
1927 $Element->{$HTML_NS}->{''}): New.
1928 (check_element): Invoke attrs_checker for each element.
1929
1930 2007-05-13 Wakaba <wakaba@suika.fam.cx>
1931
1932 * ContentChecker.pm: Don't use |manakai_element_type_match|.
1933
1934 2007-05-13 Wakaba <wakaba@suika.fam.cx>
1935
1936 * ContentChecker.pm: Use hashs rather than lists for
1937 element type testings.
1938
1939 2007-05-13 Wakaba <wakaba@suika.fam.cx>
1940
1941 * ContentChecker.pm: Don't generate duplicate
1942 error when an element type is put in the "minus" list
1943 and the element type is not allowed explicitly in the particular
1944 element content model.
1945 (html:a checker): New checker.
1946 (html:details, html:datagrid): New checkers.
1947 (html:legend): New checker.
1948
1949 2007-05-13 Wakaba <wakaba@suika.fam.cx>
1950
1951 * ContentChecker.pm (html:li checker): Implemented.
1952
1953 2007-05-13 Wakaba <wakaba@suika.fam.cx>
1954
1955 * ContentChecker.pm ($HTMLInlineOrStriclyInlineChecker): New
1956 checker.
1957 (html:dd checker): New checker.
1958 (html:q, html:em, html:strong, html:small,
1959 html:m, html:dfn, html:code, html:samp, html:span): New checkers.
1960
1961 2007-05-13 Wakaba <wakaba@suika.fam.cx>
1962
1963 * ContentChecker.pm ($AnyChecker): Renamed
1964 from |$ElementDefault->{checker}|.
1965 ($ElementDefault->{checker}): Throw an error that
1966 the element type is not supported by the checker.
1967 ($HTMLMetadataElement): |html:base| was missing.
1968 ($HTMLEmptyChecker): Don't throw an error
1969 for inter-element whitespace nodes.
1970 (html:html checker): Errors were not
1971 thrown even if |html:head| and/or |html:body|
1972 children were missing.
1973 (html:head checker): An error was not
1974 thrown if <meta charset> appered after other
1975 elements.
1976
1977 2007-05-05 Wakaba <wakaba@suika.fam.cx>
1978
1979 * ContentChecker.pm: |footer|, |video|, |audio|, |script|,
1980 and |noscript| elements are implemented.
1981 (new): New method.
1982
1983 2007-05-04 Wakaba <wakaba@suika.fam.cx>
1984
1985 * ContentChecker.pm: New module.
1986
1987 2007-05-04 Wakaba <wakaba@suika.fam.cx>
1988
1989 * NanoDOM.pm (manakai_parent_element,
1990 document_element, manakai_local_name,
1991 manakai_element_type_match): New method.
1992
1993 2007-05-03 Wakaba <wakaba@suika.fam.cx>
1994
1995 * HTML.pm.src: Replace decimal and hexadecimal numeric
1996 entities in C1 range using Windows-1252 mapping. Bare LF
1997 did not count as new line for error reporting.
1998
1999 2007-05-02 Wakaba <wakaba@suika.fam.cx>
2000
2001 * NanoDOM.pm (DOMImplementation): New class.
2002 (append_child): Weaken the |parent_node| reference.
2003 (create_element_ns, Element new): Set the |owner_document|
2004 reference.
2005 (implementation): New attribute.
2006 (owner_document, local_name, namespace_uri): New attributes.
2007
2008 * HTML.pm.src (parse_string): Line and column numbers
2009 are now provided to error handler.
2010 (!!!parse-error): Short descriptions are added.
2011 (_construct_tree): Split into three methods; support
2012 for innerHTML mode.
2013 (set_inner_html): New method.
2014
2015 2007-05-01 Wakaba <wakaba@suika.fam.cx>
2016
2017 * NanoDOM.html: Documentation is added.
2018
2019 * HTML.pod, ContentType.html: Documentation is revised.
2020
2021 * .cvsignore: Pod2html temporary files are added.
2022
2023 * Makefile: Make |NanoDOM.html|.
2024
2025 2007-05-01 Wakaba <wakaba@suika.fam.cx>
2026
2027 * HTML.pm.src (parse_string): New method.
2028 (get_inner_html): Renamed from |inner_html|.
2029
2030 * Makefile: A rule for |HTML.html| is added.
2031
2032 * HTML.pod: New documentation.
2033
2034 2007-05-01 Wakaba <wakaba@suika.fam.cx>
2035
2036 * NanoDOM.pm (last_child, previous_sibling): New attributes.
2037 (clone_node): Attribute nodes were not completely copied.
2038
2039 * HTML.pm.src: Many bugs are fixed.
2040
2041 2007-04-30 Wakaba <wakaba@suika.fam.cx>
2042
2043 * HTML.pm.src: Some typos are fixed.
2044
2045 2007-04-30 Wakaba <wakaba@suika.fam.cx>
2046
2047 * mkhtmlparser.pl, Makefile: References to the |HTML-consume-entity.src|
2048 are removed.
2049
2050 * HTML.pm.src: Tokenizer's handling on named entities are rewritten.
2051
2052 * HTML-consume-entity.src: Removed.
2053
2054 2007-04-30 Wakaba <wakaba@suika.fam.cx>
2055
2056 * HTML.pm.src: Tokenizer's handling on hexadecimal
2057 numeric entities are rewritten.
2058
2059 2007-04-30 Wakaba <wakaba@suika.fam.cx>
2060
2061 * HTML.pm.src: Some tokenizer bugs are fixed.
2062
2063 2007-04-30 Wakaba <wakaba@suika.fam.cx>
2064
2065 * HTML.pm.src: The tree construction stage is implemented.
2066
2067 * mkhtmlparser.pl: New macros are added.
2068
2069 2007-04-28 Wakaba <wakaba@suika.fam.cx>
2070
2071 * ContentType.pm: A note on bug in the specification
2072 is removed since it's been now fixed.
2073
2074 * .cvsignore: New file.
2075
2076 2007-04-28 Wakaba <wakaba@suika.fam.cx>
2077
2078 * HTML.pm.src, HTML-consume-entity.src: New files.
2079
2080 * Makefile (HTML.pm): New rule.
2081
2082 * mkhtmlparser.pl: New script.
2083
2084 2007-04-25 Wakaba <wakaba@suika.fam.cx>
2085
2086 * Makefile: New file.
2087
2088 2007-04-24 Wakaba <wakaba@suika.fam.cx>
2089
2090 * ContentType.pm: An error in pod is fixed.
2091
2092 2007-04-24 Wakaba <wakaba@suika.fam.cx>
2093
2094 * ContentType.pm: Documentation is added.
2095
2096 2007-04-24 Wakaba <wakaba@suika.fam.cx>
2097
2098 * ContentType.pm: New Perl module.
2099
2100 * ChangeLog: New file.
2101
2102

admin@suikawiki.org
ViewVC Help
Powered by ViewVC 1.1.24