/[suikacvs]/markup/html/whatpm/Whatpm/ChangeLog
Suika

Contents of /markup/html/whatpm/Whatpm/ChangeLog

Parent Directory Parent Directory | Revision Log Revision Log


Revision 1.335 - (show annotations) (download)
Thu Oct 2 10:59:04 2008 UTC (16 years, 1 month ago) by wakaba
Branch: MAIN
Changes since 1.334: +7 -0 lines
++ whatpm/t/ChangeLog	2 Oct 2008 10:58:49 -0000
2008-10-02  Wakaba  <wakaba@suika.fam.cx>

	* ContentType.t: More test cases on BOM are added.  Some test
	results updated according to HTML5 revision 1927.  Tests on the
	returned values in the list context are added.

++ whatpm/Whatpm/ChangeLog	2 Oct 2008 10:57:47 -0000
2008-10-02  Wakaba  <wakaba@suika.fam.cx>

	* ContentType.pm (get_sniffed_type): Support for the "better"
	content sniffing (HTML5 revision 1927).  In a case the official
	type was not returned when the method is invoked in the list
	context.

1 2008-10-02 Wakaba <wakaba@suika.fam.cx>
2
3 * ContentType.pm (get_sniffed_type): Support for the "better"
4 content sniffing (HTML5 revision 1927). In a case the official
5 type was not returned when the method is invoked in the list
6 context.
7
8 2008-09-22 Wakaba <wakaba@suika.fam.cx>
9
10 * HTML.pm.src: Character references for non-space C0 characters,
11 including U+000B VT, DEL character, noncharacter code points, are
12 now converted to the U+FFFD character (cf. HTML5 revision 2138).
13
14 2008-09-21 Wakaba <wakaba@suika.fam.cx>
15
16 * ContentChecker.pm: |form=""| check support added.
17
18 2008-09-21 Wakaba <wakaba@suika.fam.cx>
19
20 * ContentChecker.pm: |contextmenu| validness is now checked using
21 |id| and |id_type| properties, and |menu| property is removed.
22
23 2008-09-21 Wakaba <wakaba@suika.fam.cx>
24
25 * ContentChecker.pm: Prepare for |form| |name| attribute's
26 duplication checking.
27
28 2008-09-21 Wakaba <wakaba@suika.fam.cx>
29
30 * HTML.pm.src (parse_byte_stream): Support (or non-support) for
31 unsupported charset="" parameter value (HTML5 revision 2131).
32
33 2008-09-20 Wakaba <wakaba@suika.fam.cx>
34
35 * HTML.pm.src: Reminding places where U+000B is allowed as a space
36 character is fixed (cf. HTML5 revision 1738).
37
38 * ContentChecker.pm, HTMLTable.pm: U+000B is no longer part of
39 space characters (HTML5 revision 1738).
40
41 2008-09-20 Wakaba <wakaba@suika.fam.cx>
42
43 * HTML.pm.src: The "anything else" case for the "after after body"
44 insertion mode was not updated to swtich to the "in body"
45 insertion mode. U+000B is no longer a space character for the
46 purpose of tree construction phase (HTML5 revision 1738).
47
48 2008-09-20 Wakaba <wakaba@suika.fam.cx>
49
50 * HTML.pm.src: U+000B is no longer a space character (HTML5
51 revision 1738).
52
53 2008-09-20 Wakaba <wakaba@suika.fam.cx>
54
55 * ContentType.pm: 0x0B is no longer a space character (HTML5
56 revision 1738).
57
58 * HTML.pm.src: U+000B is no longer a space character for the
59 algorithm for extracting an encoding from a Content-Type (HTML5
60 revision 1738).
61
62 2008-09-20 Wakaba <wakaba@suika.fam.cx>
63
64 * ContentChecker.pm ($IsInHTMLInteractiveContent): New.
65
66 2008-09-18 Wakaba <wakaba@suika.fam.cx>
67
68 * LangTag.pm: Add checks for remaining requirements from RFC 4646.
69
70 * mklangreg.pl: Sort 'Prefix' values by their length, to ease
71 matching.
72
73 2008-09-18 Wakaba <wakaba@suika.fam.cx>
74
75 * LangTag.pm: Warn for private use language subtags. Error level
76 typos fixed. Support for Suppress-Script field.
77
78 * mklangreg.pl: Support for dumping of nested structure.
79
80 2008-09-18 Wakaba <wakaba@suika.fam.cx>
81
82 * LangTag.pm (check_rfc4646_langtag): Check if a tag is in the
83 recommended case as per RFC 4646.
84
85 2008-09-18 Wakaba <wakaba@suika.fam.cx>
86
87 * LangTag.pm (check_rfc4646_langtag): New method.
88
89 2008-09-18 Wakaba <wakaba@suika.fam.cx>
90
91 * mklangreg.pl: New script.
92
93 * Makefile: Updated for creation of the module for language subtag
94 registry.
95
96 2008-09-16 Wakaba <wakaba@suika.fam.cx>
97
98 * Makefile: WebIDL.html added.
99
100 * WebIDL.pod: New documentation.
101
102 2008-09-16 Wakaba <wakaba@suika.fam.cx>
103
104 * WebIDL.pm: Checker's error types are redefined.
105
106 2008-09-16 Wakaba <wakaba@suika.fam.cx>
107
108 * WebIDL.pm: Parser's error types are redefined. Some forward
109 compatible parsing bugs are fixed. Some unreachable codes are
110 commented out.
111
112 2008-09-16 Wakaba <wakaba@suika.fam.cx>
113
114 * WebIDL.pm: Support for the reminding extended attributes are
115 added. It does not satisfy the definition that a forward
116 interface declaration has an extended attribute. It seems that
117 unless explicitly allowed multiple extended attributes with the
118 same name is not allowed, though it is not explicitly mentioned in
119 the spec.
120
121 2008-09-16 Wakaba <wakaba@suika.fam.cx>
122
123 * WebIDL.pm: Unescapes extended attribute names and extended
124 attribute identifiers. Preserve whether an extended attribute has
125 an argument list of not. Support for extended attributes:
126 Constructor, ExceptionConsts, IndexGetter, IndexSetter,
127 NameGetter, NameSetter, and Null.
128 (has_argument_list): New attribute.
129 (idl_text): Stringifies argument lists, if any, even if it is
130 empty.
131
132 2008-09-15 Wakaba <wakaba@suika.fam.cx>
133
134 * HTML.pm.src: New state |PCDATA_STATE|. Use an empty string for
135 |{s_kwd}| in DATA_STATE as default.
136
137 2008-09-15 Wakaba <wakaba@suika.fam.cx>
138
139 * HTML.pm.src, mkhtmlparser.pl: Replace |{prev_char}|
140 by |{s_kwd}| in DATA_STATE.
141
142 2008-09-15 Wakaba <wakaba@suika.fam.cx>
143
144 * HTML.pm.src: Shorten keys.
145
146 2008-09-15 Wakaba <wakaba@suika.fam.cx>
147
148 * HTML.pm.src: Remove checking for control character, surrogate
149 pair, or noncharacter code points and non-Unicode code
150 points (they should be handled by Whatpm::Charset::UnicodeChecker).
151 (parse_char_stream): Support for the |$get_wrapper| argument and
152 character stream error handlers.
153
154 2008-09-15 Wakaba <wakaba@suika.fam.cx>
155
156 * ContentChecker.pm: Don't call |loda_ns_module|
157 for null-namespace elements/attributes.
158
159 * HTML.pm.src: Fact out $disallowed_control_chars
160 as a hash.
161
162 2008-09-14 Wakaba <wakaba@suika.fam.cx>
163
164 * HTML.pm.src: Regexp typo fixed. |{prev_char}|
165 and |{next_char}| initializations are moved to initialization
166 method. |{read_until}| now supports buffering. Sync |set_inner_html|
167 with |parse_char_stream|.
168
169 2008-09-14 Wakaba <wakaba@suika.fam.cx>
170
171 * HTML.pm.src (parse_char_stream): Make |set_next_char|
172 invoke |manakai_read_until|, not only |read|, where
173 possible, to decrease the number of |read| method calls.
174
175 * mkhtmlparser.pl: Related changes to the aforementioned
176 modification.
177
178 2008-09-14 Wakaba <wakaba@suika.fam.cx>
179
180 * HTML.pm.src: Use |read| instead of |getc|. |set_inner_html|
181 would report character error from now.
182
183 2008-09-14 Wakaba <wakaba@suika.fam.cx>
184
185 * HTML.pm.src: White-space-leaded non-white-space character
186 tokens in "before head insertion mode" was not
187 correctly handled.
188 (set_inner_html): Reimplemented using CharString decodehandle
189 class. Support for $get_wrapper argument. Support
190 for |{read_until}| feature.
191
192 2008-09-14 Wakaba <wakaba@suika.fam.cx>
193
194 * HTML.pm.src: Make a "bare ero" error for unknown
195 entities point the "&" character.
196
197 2008-09-14 Wakaba <wakaba@suika.fam.cx>
198
199 * HTML.pm.src: It turns out that U+FFFD don't have to
200 be added to the list of excluded characters.
201
202 2008-09-14 Wakaba <wakaba@suika.fam.cx>
203
204 * HTML.pm.src ($char_onerror): Have character decoder's |line|
205 and |column| a higher priority than the one set by the
206 tokenizer's input handler.
207 ($self->{read_until}): Exclude U+FFFD (but this might
208 not be necessary, since now we do line/column fixup in
209 the character decode handle).
210
211 2008-09-14 Wakaba <wakaba@suika.fam.cx>
212
213 * HTML.pm.src: Use |{read_until}| where possible.
214
215 2008-09-14 Wakaba <wakaba@suika.fam.cx>
216
217 * HTML.pm.src: Change |{getc_until}| to |{read_until}|
218 and |manakai_getc_until| to |manakai_read_until| to
219 reduce the number of string copies.
220
221 2008-09-14 Wakaba <wakaba@suika.fam.cx>
222
223 * HTML.pm.src (parse_char_string): Use newly created
224 |Whatpm::Charset::DecodeHandle::CharString| instead of Perl's
225 standard feature to |open| a string as a filehandle,
226 since Perl's string filehandle seems not supporting |ungetc|
227 method correctly.
228 (parse_char_stream): Define |{getc_until}| method.
229 (DATA_STATE): Experimental support for |getc_until| feature.
230
231 2008-09-13 Wakaba <wakaba@suika.fam.cx>
232
233 * HTML.pm.src: Check points added to newly added branches.
234
235 2008-09-13 Wakaba <wakaba@suika.fam.cx>
236
237 * HTML.pm.src: Remove |{char}|, which is no longer used.
238 Remove |{entity_in_attr}| and |{last_attribute_value_state}|
239 and replaced by |{prev_state}|.
240
241 * mkhtmlparser.pl: Remove |{char}| feature.
242 Remove |!!!back-next-input-character;| macro.
243
244 2008-09-13 Wakaba <wakaba@suika.fam.cx>
245
246 * HTML.pm.src: Finally we get rid of all the inner loops. Remove
247 entity related tokenizer states in favor of new states
248 implementing the consume character reference algorithm.
249
250 2008-09-13 Wakaba <wakaba@suika.fam.cx>
251
252 * HTML.pm.src: "Consume a character reference" algorithm is
253 now implemented as a tokenizer's state, rather than
254 a method, with minimum changes (more changes will
255 be made, in due course). "Bogus comment state"'s inner
256 loop gets removed.
257
258 2008-09-13 Wakaba <wakaba@suika.fam.cx>
259
260 * HTML.pm.src: Make |PUBLIC| and |SYSTEM| keyword tokenizing
261 into their own tokenizer states.
262
263 2008-09-13 Wakaba <wakaba@suika.fam.cx>
264
265 * HTML.pm.src: |CDATA_SECTION_STATE| (formally |CDATA_BLOCK_STATE|
266 is split into three states.
267
268 2008-09-13 Wakaba <wakaba@suika.fam.cx>
269
270 * HTML.pm.src: |CLOSE_TAG_OPEN_STATE| is broken into
271 itself and new |CDATA_PCDATA_CLOSE_TAG_STATE| so that
272 no longer does the tokenizer have to push back next input
273 characters in those states.
274
275 2008-09-13 Wakaba <wakaba@suika.fam.cx>
276
277 * HTML.pm.src: |MARKUP_DECLARATION_OPEN_STATE| broken
278 into four states so that no longer does the tokenizer have to push
279 back next input characters in that state.
280
281 2008-09-11 Wakaba <wakaba@suika.fam.cx>
282
283 * HTML.pm.src: Methods now accept additional parameter, $get_wrapper,
284 which can be used to insert some wrapper between the character
285 stream handle and the tokenizer. (It is currently not supported
286 for |set_inner_html| for |Element|s).
287
288 2008-09-10 Wakaba <wakaba@suika.fam.cx>
289
290 * HTML.pm.src: Ignore punctuations in charset names.
291
292 2008-09-10 Wakaba <wakaba@suika.fam.cx>
293
294 * ContentChecker.pm: Support for charset-layer error levels.
295
296 * HTML.pm.src: Don't specify |text| argument for the
297 |chardecode:fallback| error, since it is not the encoding
298 being used alternatively.
299
300 2008-09-06 Wakaba <wakaba@suika.fam.cx>
301
302 * HTML.pm.src: Support for |XSLT-compat| (HTML5 revision 2141).
303
304 2008-08-31 Wakaba <wakaba@suika.fam.cx>
305
306 * CacheManifest.pm: Support for extensibility (HTML5 revision 2051).
307
308 2008-08-31 Wakaba <wakaba@suika.fam.cx>
309
310 * HTML.pm.src: Bug fix and sync with the spec with regard
311 to after after frameset insertion mode processing (HTML5
312 revision 1909). Note that the implementation was wrong
313 per the old spec before the r1909 changes.
314
315 2008-08-30 Wakaba <wakaba@suika.fam.cx>
316
317 * HTMLTable.pm: scope=auto algorithm fix synced with the
318 spec (HTML5 revision 2093).
319 ($process_row): Algorithm step numbers synced with the
320 spec (HTML5 revision 2092).
321
322 2008-08-30 Wakaba <wakaba@suika.fam.cx>
323
324 * HTMLTable.pm: Zs is not what we want; we want White_Space! (HTML5
325 revision 2094).
326
327 2008-08-30 Wakaba <wakaba@suika.fam.cx>
328
329 * ContentType.pm: Support for image/svg+xml (HTML5 revision 2096).
330
331 2008-08-30 Wakaba <wakaba@suika.fam.cx>
332
333 * HTML.pm.src: '"' and "'" at the end of attribute
334 name (after another attribute) now raise parse error (HTML5
335 revision 2123). Empty unquoted attribute values are no
336 longer allowed (HTML5 revision 2122).
337
338 2008-08-30 Wakaba <wakaba@suika.fam.cx>
339
340 * mkhtmlparser.pl: Support for MathML |definitionURL| attribute (HTML5
341 revision 2130).
342
343 2008-08-30 Wakaba <wakaba@suika.fam.cx>
344
345 * ContentChecker.pm: |xml:lang| attribute value must be same
346 as |lang| attribute value for HTML elements (HTML5 revision 2062
347 and so on).
348
349 2008-08-30 Wakaba <wakaba@suika.fam.cx>
350
351 * ContentChecker.pm: Error level definition for |xml_id_error|
352 was missing.
353
354 * URIChecker.pm: The end of the URL should be marked as the
355 error location for an empty path error. The position
356 between the userinfo and the port components should be
357 marked as the error location for an empty host error.
358
359 2008-08-30 Wakaba <wakaba@suika.fam.cx>
360
361 * URIChecker.pm: Set parameters representing where in the
362 value the error occurs for errors. Report unknown
363 address format error in warning level, since address
364 formats are rarely added. Path segments starting with "/.."
365 were misinterpreted as a dot-segment.
366
367 2008-08-30 Wakaba <wakaba@suika.fam.cx>
368
369 * URIChecker.pm (check_iri_reference): Requires
370 |Message::DOM::DOMImplementation|.
371
372 2008-08-29 Wakaba <wakaba@suika.fam.cx>
373
374 * IMTChecker.pm: Updated for the new error reporting architecture.
375
376 * ContentChecker.pm: Error levels for IMTs are added.
377
378 2008-08-17 Wakaba <wakaba@suika.fam.cx>
379
380 * H2H.pm (_shift_token): Support for unquoted HTML attribute
381 values.
382
383 2008-08-16 Wakaba <wakaba@suika.fam.cx>
384
385 * CacheManifest.pm: Support for new style of error
386 reports.
387
388 * HTML.pm.src: Set line=1, column=1 to the document node.
389
390 2008-08-16 Wakaba <wakaba@suika.fam.cx>
391
392 * ContentChecker.pm, RDFXML.pm: Pass {level} object to language tag
393 and URL checkers. Support for more error levels for bogus
394 langauge tag and URL "standards".
395
396 * LangTag.pm, URIChecker.pm: Support for new style error
397 level reporting.
398
399 2008-08-15 Wakaba <wakaba@suika.fam.cx>
400
401 * ContentChecker.pm: Support for RDF/XML error levels.
402
403 * HTMLTable.pm, RDFXML.pm: Support for new style of error level
404 specifying. Error types are revised.
405
406 2008-08-15 Wakaba <wakaba@suika.fam.cx>
407
408 * ContentChecker.pm: All error reporting method calls are
409 renewed.
410
411 2008-08-15 Wakaba <wakaba@suika.fam.cx>
412
413 * HTML.pm.src: All error type names and "text" parameters
414 are revised. Use new style for "level" specification.
415
416 * mkhtmlparser.pl: Use new style for "level" specification.
417
418 2008-08-03 Wakaba <wakaba@suika.fam.cx>
419
420 * WebIDL.pm (parse_char_string): Simplified error
421 reporting process for broken ignored valuetype definition.
422 (Valuetype idl_text): Support for special "DOMString" name.
423
424 2008-08-03 Wakaba <wakaba@suika.fam.cx>
425
426 * WebIDL.pm ($get_scoped_name): Append "::::" if the last
427 terminal of the ScopedName is "DOMString", such that whether
428 the last part of the scoped name is "DOMString" or "_DOMString"
429 later. It is necessary to determine whether a |typedef|
430 definition should be ignored or not.
431 (parse_char_string): Unescape the identifier of
432 exception members.
433 ($resolve): Return undef for builtin types and sequence<T>
434 types (we might not have to do this, however...).
435 (check): Support checking for Exceptions, Valuetypes,
436 and Typedefs.
437 ($serialize_type): Support for "DOMString::::" syntax.
438 (Typedef idl_text): Output Type as "DOMString" if it
439 is really "DOMString" (i.e. its internal representation
440 is "::DOMString::").
441
442 2008-08-03 Wakaba <wakaba@suika.fam.cx>
443
444 * WebIDL.pm ($resolve): New code, based on resolve code
445 for constant types in the |check| method.
446 (check): Support for checking of attributes, operations, and
447 arguments.
448 (Attribute/Operation idl_text): Exception names in getraises,
449 setraises, and raises clauses is serizlied by |$serialize_type|
450 code.
451
452 2008-08-02 Wakaba <wakaba@suika.fam.cx>
453
454 * WebIDL.pm ($integer): Order of selections are changed to match
455 hexadecimal numbers (the original pattern, taken from the spec,
456 was not work for hexadecimal numbers, because the "0" prefix
457 matches to the [0-7]* part (as an empty string) and therefore
458 it does not match with remaining "x..." part of a "0x..." integer
459 literal.
460 ($get_type): It now returns a string, not an array reference,
461 for regular types and |sequence| types (i.e. it in any case
462 returns a string).
463 ($get_next_token): The second item in the array that represents
464 a integer or float token is now a Perl number value, not the
465 original string representation of the number.
466 (check): Support for const value consistency checking.
467 No extended attribute is defined for constants.
468 (Node subclasses): Use simple strings rather than array references
469 for default data type values.
470 ($serialize_type): Type values are now simple strings.
471 (value): If the new attribute value is a false value, then
472 a FALSE value is set to the attribute.
473
474 2008-08-02 Wakaba <wakaba@suika.fam.cx>
475
476 * WebIDL.pm ($get_scoped_name): Now scoped names are stored
477 in its stringified format ("scoped name" as defined in the
478 spec). Note that future version of this module should not use
479 array references for type values and the |type_text| attribute
480 should be made obsolete.
481 (parse_char_string): Unescape attribute names.
482 (check): Support for checking of whether inherited interfaces
483 are actually defined or not. Support for checking of whether
484 interface member identifiers are duplicated or not.
485 ($serialize_type): Scoped names are returned as is. A future
486 version of this code should escape identifiers other than "DOMString",
487 otherwise the idl_text would be non-conforming.
488
489 2008-08-02 Wakaba <wakaba@suika.fam.cx>
490
491 * WebIDL.pm (parse_char_string): Set line/column numbers
492 to generated nodes. Unescape identifiers. Extended attributes
493 for Definition's were ignored.
494 (append_child): Set |parent_node| attribute.
495 (parent_node): New attribute.
496 (check): Support interface/exception members. Support
497 extended attributes. Support definition identifier uniqueness
498 constraint.
499 (qualified_name): New attribute.
500 (Interface/Exception idl_text): Extended attributes were
501 not prepended to the returned text.
502
503 2008-08-02 Wakaba <wakaba@suika.fam.cx>
504
505 * WebIDL.pm (parse_char_string): Set line/column numbers
506 to interface object experimentally. s/shift/pop/g, shift
507 would make things wrong. Support for interface forward
508 declarations was missing. Broken interface declarations
509 with no block were not ignored entirely.
510 (Whatpm::WebIDL::Node): New abstract class. This class
511 makes things easier.
512 (child_nodes): New attribute. Unlike DOM's attribute with
513 same name, this attribute returns a dead list of nodes for
514 simplicity.
515 (get_user_data, set_user_data): New methods.
516 (Module idl_text): A SPACE character should be inserted
517 before the |{| character.
518 (Interface idl_text): Support for interface forward declarations.
519 (is_forward_declaration): New attribute.
520
521 2008-07-19 Wakaba <wakaba@suika.fam.cx>
522
523 * WebIDL.pm (type_text): Better serializer.
524
525 2008-07-19 Wakaba <wakaba@suika.fam.cx>
526
527 * WebIDL.pm: Revise forward-compatible parsing so that
528 it now can handle broken extended attributes and as such.
529
530 2008-07-19 Wakaba <wakaba@suika.fam.cx>
531
532 * WebIDL.pm: Real support for extended attributes.
533 Support for extended attributes with arguments.
534
535 2008-07-19 Wakaba <wakaba@suika.fam.cx>
536
537 * WebIDL.pm: Support for |exception| syntax.
538 (Interface->idl_text): Tentative support for inheritances.
539
540 2008-07-19 Wakaba <wakaba@suika.fam.cx>
541
542 * WebIDL.pm: Hierarchical scoped name support was broken.
543 Support for raises, setraises, and getraises syntaxes.
544
545 2008-07-18 Wakaba <wakaba@suika.fam.cx>
546
547 * WebIDL.pm: Support for |idl_text| attribute, version 1 (no
548 proper support for types, extended attributes, and exceptions yet).
549 WebIDL parser, version 1 (no support for exceptions yet,
550 no proper support for extended attributes yet).
551
552 2008-07-09 Wakaba <wakaba@suika.fam.cx>
553
554 * WebIDL.pm (parse_char_string): Support for basic attribute syntax.
555
556 2008-06-29 Wakaba <wakaba@suika.fam.cx>
557
558 * WebIDL.pm: Support for valuetype and const.
559
560 2008-06-29 Wakaba <wakaba@suika.fam.cx>
561
562 * WebIDL.pm: New module.
563
564 2008-06-15 Wakaba <wakaba@suika.fam.cx>
565
566 * Makefile (Entities.html): URI changed.
567
568 2008-06-08 Wakaba <wakaba@suika.fam.cx>
569
570 * HTML.pm.src: Support for ruby parsing (HTML5 revision 1704).
571
572 2008-06-01 Wakaba <wakaba@suika.fam.cx>
573
574 * HTML.pm.src (_get_next_token): A parse error was missing.
575
576 2008-06-01 Wakaba <wakaba@suika.fam.cx>
577
578 * mklinktypelist.pl: rel=contact is no longer part of the HTML5
579 spec (commented out). (HTML5 revision 1711).
580
581 2008-05-25 Wakaba <wakaba@suika.fam.cx>
582
583 * ContentType.pm: Drop support for UTF-32 (HTML5 revision 1701).
584
585 * HTML.pm.src: UTF-16BE and UTF-16LE should be considered
586 as UTF-16 (HTML5 revision 1701).
587
588 2008-05-25 Wakaba <wakaba@suika.fam.cx>
589
590 * HTML.pm.src: Support for <noframes> in <head> (HTML5 revision
591 1692).
592
593 2008-05-25 Wakaba <wakaba@suika.fam.cx>
594
595 * HTML.pm.src: The secondary insertion mode used when switching
596 to foreign content is the "in body" insertion mode (HTML5 revision
597 1696).
598
599 2008-05-25 Wakaba <wakaba@suika.fam.cx>
600
601 * HTML.pm.src: Don't raise parse error for <isindex/> (HTML5
602 revision 1697).
603
604 2008-05-24 Wakaba <wakaba@suika.fam.cx>
605
606 * HTML.pm.src: Support for end-of-file token in foreign content
607 insertion mode (HTML5 revision 1693). Update SVG camelCase
608 attribute list (HTML5 revision 1700). <textarea> closes
609 </select> (HTML5 revision 1699). More start tags close in
610 foreign content insertion mode (HTML5 revision 1698).
611
612 2008-05-24 Wakaba <wakaba@suika.fam.cx>
613
614 * HTML.pm.src: ";" is not part of charset name (HTML5 revision 1665).
615
616 2008-05-24 Wakaba <wakaba@suika.fam.cx>
617
618 * HTML.pm.src: More robust charset parameter detection (HTML5
619 revision 1674).
620
621 2008-05-24 Wakaba <wakaba@suika.fam.cx>
622
623 * ContentType.pm: Support for image/vnd.microsoft.icon (HTML5
624 revision 1676).
625
626 2008-05-24 Wakaba <wakaba@suika.fam.cx>
627
628 * HTML.pm.src: Ignore language part of public identifiers for
629 quriks mode detection (HTML5 revision 1679).
630
631 2008-05-24 Wakaba <wakaba@suika.fam.cx>
632
633 * HTML.pm.src: Reduce the number of errors in truncated doctypes (HTML5
634 revision 1685).
635
636 2008-05-24 Wakaba <wakaba@suika.fam.cx>
637
638 * HTML.pm.src: Support for EOF in new states for tags (HTML5
639 revision 1684).
640
641 2008-05-24 Wakaba <wakaba@suika.fam.cx>
642
643 * HTML.pm.src (_reset_insertion_mode): Make <td>.innerHTML
644 work (HTML5 revision 1690).
645
646 2008-05-24 Wakaba <wakaba@suika.fam.cx>
647
648 * HTML.pm.src (_tree_construction_main): Change handling of
649 end tags in head insertion modes (HTML5 revision 1686).
650 (parse_char_string): Bug fix for non-utf8 character string handlings.
651 (parse_char_stream): |ungetc| does not work well for this context.
652
653 2008-05-18 Wakaba <wakaba@suika.fam.cx>
654
655 * HTML.pm.src (parse_byte_string): Redefined to invoke
656 |parse_byte_stream|.
657 (parse_byte_stream): New method.
658
659 2008-05-18 Wakaba <wakaba@suika.fam.cx>
660
661 * HTML.pm.src (parse_byte_string): Fix the column number reported
662 by encoding layer error reporter.
663
664 2008-05-17 Wakaba <wakaba@suika.fam.cx>
665
666 * HTML.pm.src (parse_byte_string): Use streaming decoder
667 rather than converting the whole byte string and then parsing.
668 Propagate errors in character encoding layer.
669 (get_next_token): Precise error reporting for |bare stago| error.
670
671 2008-05-17 Wakaba <wakaba@suika.fam.cx>
672
673 * HTML.pm.src (parse_char_stream): New method.
674 (parse_char_string): This method is now defined as an invocation
675 of the |parse_char_stream| method.
676
677 2008-05-17 Wakaba <wakaba@suika.fam.cx>
678
679 * HTML.pm.src (parse_byte_string): Report various status
680 of the sniffing as info-level errors. Support for new
681 decoding framework in parser resestting.
682 (new): Various default error levels were not set.
683
684 2008-05-17 Wakaba <wakaba@suika.fam.cx>
685
686 * HTML.pm.src (parse_byte_string): HTML5 encoding siniffing
687 algorithm, except for the actual sniffing, is implemented
688 with new framework with Message::Charset::Info.
689
690 2008-05-16 Wakaba <wakaba@suika.fam.cx>
691
692 * CacheManifest.pm (_parse): Drop fragment identifiers from
693 URIs in fallback section (HTML5 revision 1596).
694
695 2008-05-10 Wakaba <wakaba@suika.fam.cx>
696
697 * Makefile (Entities.html): URI has changed.
698
699 2008-05-10 Wakaba <wakaba@suika.fam.cx>
700
701 * CacheManifest.pm: Don't replace U+0000 NULL (HTML5 revision
702 1553).
703
704 2008-05-06 Wakaba <wakaba@suika.fam.cx>
705
706 * ContentChecker.pm: Noted that those returned in |table| are
707 no longer table elements, but table objects returned
708 by Whatpm::HTMLTable.
709
710 * HTMLTable.pm (form_table): Return table element node
711 as |$table->{element}|.
712 (assign_header): Support for the |headers=""| attribute.
713
714 2008-05-06 Wakaba <wakaba@suika.fam.cx>
715
716 * HTMLTable.pm (assign_header): New function; first version
717 with no support for headers="".
718 (form_table): Include table width and height to the returned
719 table object for covenience. Indexing in column assignement was wrong.
720 Set whether a data cell is empty or not for convenience.
721
722 2008-05-05 Wakaba <wakaba@suika.fam.cx>
723
724 * HTMLTable.pm: Robuster caption support (HTML5 revision 1393).
725
726 2008-05-05 Wakaba <wakaba@suika.fam.cx>
727
728 * HTMLTable.pm: How table model errors are detected is
729 changed (HTML5 revision 1387).
730
731 2008-05-05 Wakaba <wakaba@suika.fam.cx>
732
733 * HTMLTable.pm: The algorithm now moves |tfoot| elements
734 to the end of the table (HTML5 revision 1380).
735
736 2008-05-05 Wakaba <wakaba@suika.fam.cx>
737
738 * HTMLTable.pm: The algorithm is now 0-based indexing, instead
739 of 1-based (HTML5 revision 1376).
740
741 2008-05-05 Wakaba <wakaba@suika.fam.cx>
742
743 * ContentType.pm: "Content-Type: text/plain; charset=UTF-8"
744 and "Content-Encoding" no longer prevent sniffing (HTML5
745 revision 1288).
746
747 2008-05-05 Wakaba <wakaba@suika.fam.cx>
748
749 * ContentType.pm: Skip BOMs n feed or HTML algorithm (HTML5 revision
750 1282).
751
752 2008-05-03 Wakaba <wakaba@suika.fam.cx>
753
754 * ContentChecker.pm: Support for global attributes.
755 Status of XML specs are added.
756
757 2008-05-03 Wakaba <wakaba@suika.fam.cx>
758
759 * ContentChecker.pm (check_element): Support for |template=""|
760 and |ref=""| attribute (referent element type checking).
761
762 2008-04-29 Wakaba <wakaba@suika.fam.cx>
763
764 * CacheManifest.pm (_parse): New same origin definition (HTML5 revision
765 1500) is implemented (except for IDNA part and URI-scheme-specific
766 knowledge). Line number counting was wrong for LF-only
767 documents.
768
769 2008-04-13 Wakaba <wakaba@suika.fam.cx>
770
771 * HTML.pm.src: Raise an parse error for any disallowed
772 character (HTML5 revision 1263).
773
774 2008-04-13 Wakaba <wakaba@suika.fam.cx>
775
776 * mkentitylist.pl: Support for new HTML5 entity table format (the
777 definition for |AElig;| was missing).
778
779 2008-04-13 Wakaba <wakaba@suika.fam.cx>
780
781 * HTML.pm.src, mkhtmlparser.pl: Support for element/attribute
782 name/namespace fixup (HTML5 revisions 1413, 1415, 1416, and 1417).
783
784 2008-04-13 Wakaba <wakaba@suika.fam.cx>
785
786 * HTML.pm.src: List of element names that close foreign content
787 insertion mode is added (HTML5 revisions 1412 and 1418).
788
789 2008-04-13 Wakaba <wakaba@suika.fam.cx>
790
791 * HTML.pm.src: Support for |mglyph| and |malignmark|
792 elements (HTML5 revision 1410).
793
794 2008-04-13 Wakaba <wakaba@suika.fam.cx>
795
796 * HTML.pm.src: Support for new long MathML entities (HTML5
797 revision 1406).
798
799 2008-04-13 Wakaba <wakaba@suika.fam.cx>
800
801 * HTML.pm.src: CDATA section support for MathML and SVG
802 elements (HTML5 revision 1404 and 1420).
803
804 2008-04-12 Wakaba <wakaba@suika.fam.cx>
805
806 * HTML.pm.src, mkhtmlparser.pl: Support for MathML and SVG
807 elements (HTML5 revision 1404). Unused !!!macro definitions
808 are removed.
809
810 2008-04-12 Wakaba <wakaba@suika.fam.cx>
811
812 * HTML.pm.src, mkhtmlparser.pl: The way permitted slash errors
813 are raised is changed (HTML5 revision 1404).
814
815 2008-04-06 Wakaba <wakaba@suika.fam.cx>
816
817 * HTML.pm.src: Category sets that are no longer used are removed.
818
819 2008-04-06 Wakaba <wakaba@suika.fam.cx>
820
821 * HTML.pm.src: The ->[1] property of stack entries are now
822 replaced by constants representing element category.
823
824 2008-04-06 Wakaba <wakaba@suika.fam.cx>
825
826 * HTML.pm.src: Don't use local name stored in stack (i.e. ->[1])
827 for error reporting. (This is a preparation for using constant
828 value for ->[1].)
829
830 2008-03-22 Wakaba <wakaba@suika.fam.cx>
831
832 * RDFXML.pm: Typo fixed.
833
834 2008-03-22 Wakaba <wakaba@suika.fam.cx>
835
836 * ContentChecker.pm: |fact_level| is now treated
837 as same as |must_level|, i.e. level = |m|.
838 (check_element): Make list of URIs in the DOM.
839
840 2008-03-21 Wakaba <wakaba@suika.fam.cx>
841
842 * RDFXML.pm: Language accessor implemented.
843 Local (null-namespace) attribute support.
844
845 2008-03-21 Wakaba <wakaba@suika.fam.cx>
846
847 * RDFXML.pm: Factored out ID checking code.
848
849 2008-03-21 Wakaba <wakaba@suika.fam.cx>
850
851 * RDFXML.pm: TODO items noted. Validation of ID
852 and URI attributes is implemented. Warn if unknown
853 value is used in rdf:parseType="" attribute.
854
855 * URIChecker.pm (check_rdf_uri_reference): New function.
856
857 2008-03-21 Wakaba <wakaba@suika.fam.cx>
858
859 * RDFXML.pm: bnodeid implemented. Relative references
860 are now resolved.
861
862 2008-03-21 Wakaba <wakaba@suika.fam.cx>
863
864 * ContentChecker.pm: RDF reification implemented.
865
866 * RDFXML.pm: undef vs false bug fixed.
867 Reification implemented.
868
869 2008-03-21 Wakaba <wakaba@suika.fam.cx>
870
871 * RDFXML.pm: s/id/ID/ for attribute name.
872 The |node| arguments are added for |ontriple| calls.
873 Too many "attribute not allowed" errors were raised.
874
875 * ContentChecker.pm: Initial experimental support for rdf:RDF
876 element.
877
878 2008-03-21 Wakaba <wakaba@suika.fam.cx>
879
880 * RDFXML.pm: New module.
881
882 2008-03-20 Wakaba <wakaba@suika.fam.cx>
883
884 * HTML.pm.src (set_inner_html): Line/column number
885 code was old one yet.
886
887 2008-03-20 Wakaba <wakaba@suika.fam.cx>
888
889 * HTML.pm.src: Better line/column reporting for "duplicate attribute"
890 errors. Line/column markings for DOCTYPE, comment, and
891 character tokens are reintroduced; otherwise,
892 error location for "not HTML5" error and errors
893 for implied elements are not attached.
894
895 2008-03-20 Wakaba <wakaba@suika.fam.cx>
896
897 * HTML.pm.src: Set line/column numbers to attributes.
898
899 * NanoDOM.pm (create_attribute_ns, set_attribute_node_ns): Added.
900 (value): Setter implemented.
901
902 * mkhtmlparser.pl: Set line/column numbers to Attr nodes.
903
904 2008-03-20 Wakaba <wakaba@suika.fam.cx>
905
906 * HTML.pm.src: Unused line/column markings are removed.
907
908 2008-03-20 Wakaba <wakaba@suika.fam.cx>
909
910 * HTML.pm.src (_get_next_token): Remove |first_start_tag|
911 flag, which is no longer used.
912
913 2008-03-17 Wakaba <wakaba@suika.fam.cx>
914
915 * HTML.pm.src: Set line/column information to element nodes.
916
917 * mkhtmlparser.pl (!!!create-element, !!!insert-element,
918 and !!!insert-element-t): Set line/column information to
919 element nodes.
920
921 2008-03-17 Wakaba <wakaba@suika.fam.cx>
922
923 * HTML.pm.src (_get_next_token): The first "<" character
924 in "<?", "<>", or "</>" should be the error point.
925
926 2008-03-16 Wakaba <wakaba@suika.fam.cx>
927
928 * HTML.pm.src: Some more fixes on error position reporting.
929
930 2008-03-16 Wakaba <wakaba@suika.fam.cx>
931
932 * HTML.pm.src: Token-level precious error reporting.
933
934 2008-03-16 Wakaba <wakaba@suika.fam.cx>
935
936 * HTML.pm.src: Preparation for more precious error point
937 reporting.
938
939 2008-03-11 Wakaba <wakaba@suika.fam.cx>
940
941 * HTML.pm.src: Error type revised.
942
943 2008-03-11 Wakaba <wakaba@suika.fam.cx>
944
945 * HTML.pm.src: Similar codes are merged together, again.
946
947 2008-03-11 Wakaba <wakaba@suika.fam.cx>
948
949 * HTML.pm.src: Similar codes are merged together.
950
951 2008-03-10 Wakaba <wakaba@suika.fam.cx>
952
953 * mkhtmlparser.pl: Set "level" parameter to parse errors.
954
955 * HTML.pm.src: Code refined.
956
957 2008-03-09 Wakaba <wakaba@suika.fam.cx>
958
959 * HTML.pm.src: |</body>| treatement has been changed (HTML5
960 revision 1348). Note that I really don't know this makes
961 any difference in the black-box behavior of the parser.
962
963 2008-03-09 Wakaba <wakaba@suika.fam.cx>
964
965 * HTML.pm.src: New end-of-file token implementation (HTML5
966 revision 1348).
967
968 2008-03-09 Wakaba <wakaba@suika.fam.cx>
969
970 * HTML.pm.src: |applet| support (HTML5 revision 1347).
971
972 2008-03-09 Wakaba <wakaba@suika.fam.cx>
973
974 * HTML.pm.src: Foster parenting in AAA (HTML5 revision 1343).
975
976 2008-03-09 Wakaba <wakaba@suika.fam.cx>
977
978 * HTML.pm.src: Support for |<input>| in the "in select" insertion
979 mode and support for the "in select in table" insertion mode (HTML5
980 revision 1342).
981
982 2008-03-09 Wakaba <wakaba@suika.fam.cx>
983
984 * HTML.pm.src: No foster parenting for <script> and <script>
985 in non-tainted <table>s (HTML5 revision 1336).
986
987 2008-03-09 Wakaba <wakaba@suika.fam.cx>
988
989 * HTML.pm.src: Ignore white space characters between <html>
990 and <head> (HTML5 revision 1332).
991
992 2008-03-09 Wakaba <wakaba@suika.fam.cx>
993
994 * HTML.pm.src: Treat <input type=hidden> as if it were a
995 white space (HTML5 revision 1331).
996
997 2008-03-08 Wakaba <wakaba@suika.fam.cx>
998
999 * HTML.pm.src: Ignore U+000A at the beginning of a |listing|
1000 element (HTML5 revision 1330).
1001
1002 2008-03-08 Wakaba <wakaba@suika.fam.cx>
1003
1004 * HTML.pm.src: <title> is always appended to the current
1005 element (HTML5 revision 1328).
1006
1007 2008-03-08 Wakaba <wakaba@suika.fam.cx>
1008
1009 * HTML.pm.src: White space in tainted tables are moved
1010 into foster parents (HTML5 revision 1326).
1011
1012 2008-03-08 Wakaba <wakaba@suika.fam.cx>
1013
1014 * HTML.pm.src: Reduce errors from foster parenting cases (HTML5
1015 revision 1321).
1016
1017 2008-03-08 Wakaba <wakaba@suika.fam.cx>
1018
1019 * HTML.pm.src: |</h/n/>| case code rearranged to align with
1020 the spec (HTML5 revision 1320). Note that we finally complete
1021 all of HTML5 revision 1320 changes.
1022
1023 2008-03-08 Wakaba <wakaba@suika.fam.cx>
1024
1025 * HTML.pm.src: |</form>| now works similar to |</div>| for unclosed
1026 tags (HTML5 revision 1320).
1027
1028 2008-03-08 Wakaba <wakaba@suika.fam.cx>
1029
1030 * HTML.pm.src: |</p>| case rearranged with no actual change in fact.
1031
1032 2008-03-08 Wakaba <wakaba@suika.fam.cx>
1033
1034 * HTML.pm.src: A "generate implied end tags" code (t409.1)
1035 could not be reached so that it is now removed (HTML5 revision
1036 1320).
1037
1038 2008-03-08 Wakaba <wakaba@suika.fam.cx>
1039
1040 * HTML.pm.src: Code for the case of |</div>| and so on are revised
1041 to align with new spec text (HTML5 revision 1320).
1042
1043 2008-03-08 Wakaba <wakaba@suika.fam.cx>
1044
1045 * HTML.pm.src: Remove strange |if| condition; however, it should
1046 have had no harm in theory.
1047
1048 2008-03-08 Wakaba <wakaba@suika.fam.cx>
1049
1050 * HTML.pm.src (_tree_construction_main): '</p>' in body
1051 case is split from other end tags for the preparation
1052 of implementing HTML revision 1320.
1053
1054 2008-03-07 Wakaba <wakaba@suika.fam.cx>
1055
1056 * HTML.pm.src: Simplified "generate implied end tag" (HTML5
1057 revision 1320).
1058
1059 2008-03-07 Wakaba <wakaba@suika.fam.cx>
1060
1061 * HTML.pm.src (_tree_construction_main): Merge rules for "h1"
1062 and "div" (HTML5 revision 1318). Add comments to where
1063 |form| pointer association codes should be inserted (HTML5 revision
1064 1319).
1065
1066 2008-03-06 Wakaba <wakaba@suika.fam.cx>
1067
1068 * HTML.pm.src: <html> treatement refined (HTML5 revision 1314).
1069
1070 2008-03-05 Wakaba <wakaba@suika.fam.cx>
1071
1072 * HTML.pm.src: Since the case t268 should never be reached (no
1073 other token type, there are), it is replaced by a |die| statement.
1074
1075 2008-03-05 Wakaba <wakaba@suika.fam.cx>
1076
1077 * HTML.pm.src: Typo fixed.
1078
1079 2008-03-04 Wakaba <wakaba@suika.fam.cx>
1080
1081 * HTML.pm.src (_tree_construction_initial): Some limited quirks
1082 doctypes were not uppercased for comparison.
1083
1084 2008-03-03 Wakaba <wakaba@suika.fam.cx>
1085
1086 * HTML.pm.src (tree construction and set_inner_html): Checkpoints
1087 are added.
1088
1089 2008-03-03 Wakaba <wakaba@suika.fam.cx>
1090
1091 * HTML.pm.src (_tokenize_attempt_to_consume_an_entity): Checkpoints
1092 are set. Cases that are unlikely reached are noted as so.
1093
1094 2008-03-03 Wakaba <wakaba@suika.fam.cx>
1095
1096 * HTML.pm.src: Checkpoints for debugging are added.
1097
1098 * mkhtmlparser.pl: Support for |!!!cp| syntax.
1099
1100 2008-03-03 Wakaba <wakaba@suika.fam.cx>
1101
1102 * HTML.pm.src, mkhtmlparser.pl: s/_input_character/_char/g
1103 for simplicity.
1104
1105 2008-03-03 Wakaba <wakaba@suika.fam.cx>
1106
1107 * HTML.pm.src: Flag name changed: s/correct/force-quirks/g (HTML5
1108 revision 1307).
1109
1110 2008-03-03 Wakaba <wakaba@suika.fam.cx>
1111
1112 * HTML.pm.src (_get_next_token): Where the /incorrect/ flag
1113 is set are changed (HTML5 revision 1305).
1114
1115 2008-03-02 Wakaba <wakaba@suika.fam.cx>
1116
1117 * HTML.pm.src: Raise a parse error for |<span ===>| (HTML5 revision
1118 1292). Entities are not parsed in comment-like part in RCDATA
1119 elements (HTML5 revision 1294). Allow bare & at the end
1120 of attribute value literals (HTML5 revision 1296). More
1121 quirks mode doctypes (HTML5 revision 1302). Requires spaces
1122 between attributes and ban attribute names or unquoted
1123 attribute values containing single or double quotes (HTML5
1124 revision 1303).
1125
1126 2008-03-02 Wakaba <wakaba@suika.fam.cx>
1127
1128 * ContentChecker.pm: Typo fixed. Don't raise "character encoding"
1129 and related errors unless it is an HTML document (though the spec
1130 is unclear on whether it is applied to XHTML document).
1131
1132 * HTML.pm (%HTMLAttrStatus): WF2 repetition model attributes
1133 are added.
1134
1135 2008-03-02 Wakaba <wakaba@suika.fam.cx>
1136
1137 * HTML.pm.src: s/local_name/manakai_local_name/g.
1138
1139 2008-03-01 Wakaba <wakaba@suika.fam.cx>
1140
1141 * _NamedEntityList.pm: Updated (HTML5 revision 1286).
1142
1143 * HTML.pm.src: |charset| in |content| attribute is
1144 case-insensitive (HTML5 revision 1270).
1145
1146 2008-02-26 Wakaba <wakaba@suika.fam.cx>
1147
1148 * ContentChecker.pm: New status constants are added.
1149 ($ElementDefault): |status| added.
1150 (check_element): Err for non-standard or deprecated elements.
1151 (_attr_status_info): For non-standard or deprecated attributes.
1152
1153 2008-02-24 Wakaba <wakaba@suika.fam.cx>
1154
1155 * ContentChecker.pm (_attr_status_info): New internal method.
1156
1157 2008-02-24 Wakaba <wakaba@suika.fam.cx>
1158
1159 * ContentChecker.pm (check_element): Element standardized
1160 status information is now dispatched.
1161
1162 2008-02-24 Wakaba <wakaba@suika.fam.cx>
1163
1164 * ContentChecker.pm (check_element): Fix |del|-and-significant
1165 problem by adding some more arguments.
1166
1167 2008-02-24 Wakaba <wakaba@suika.fam.cx>
1168
1169 * ContentChecker.pm (check_element): Use context of
1170 container-for-the-purpose-of-content-model element (not transparent
1171 element) for |check_child_element| calling and significant
1172 text flag marking. This reintroduces |<del>|-and-significant
1173 problem again.
1174
1175 2008-02-24 Wakaba <wakaba@suika.fam.cx>
1176
1177 * ContentChecker.pm (check_element): Make semi-transparent
1178 elements ignored for the purpose of phase changes in
1179 content model checking.
1180
1181 2008-02-23 Wakaba <wakaba@suika.fam.cx>
1182
1183 * ContentChecker.pm (check_element): In-element state
1184 was not properly managed for transparent cases.
1185
1186 2008-02-23 Wakaba <wakaba@suika.fam.cx>
1187
1188 * ContentChecker.pm (check_element): Support for |video|
1189 and |audio| as semi-transparent elements.
1190
1191 2008-02-23 Wakaba <wakaba@suika.fam.cx>
1192
1193 * ContentChecker.pm ($HTMLSemiTransparentElements): New.
1194 (check_element): s/minuses/minus_elements/, s/pluses/plus_elements/.
1195 Support for |html:object| as a semi-transparent element.
1196
1197 2008-02-23 Wakaba <wakaba@suika.fam.cx>
1198
1199 * ContentChecker.pm (check_element): The way to traverse
1200 the tree is entirely revised to make it easier to track
1201 the state of ancestors/descendants. As a result of this
1202 revision (which rewrites almost all of Whatpm::ContentChecker::HTML),
1203 support for content model checking for HTML elements |figure|,
1204 |object|, |video|, and |audio| and checking for XML elements (and
1205 some XMLNS checkings) are dropped for now. They will be
1206 reimplemented in due cource.
1207
1208 2008-02-17 Wakaba <wakaba@suika.fam.cx>
1209
1210 * HTML.pm.src: |>| in public or system literal closes the DOCTYPE
1211 token (HTML5 revision 1225).
1212
1213 2008-02-17 Wakaba <wakaba@suika.fam.cx>
1214
1215 * ContentChecker.pm ({unsupported_level}): New value.
1216
1217 * HTML.pm.src: Save whether |meta| |content| attribute
1218 contains character references or not.
1219
1220 2008-02-17 Wakaba <wakaba@suika.fam.cx>
1221
1222 * ContentChecker.pm (_get_children): (Incompleted) attempt to
1223 imlement significant content checking for contents
1224 with |del| elements.
1225
1226 2008-02-17 Wakaba <wakaba@suika.fam.cx>
1227
1228 * ContenteChecker.pm ($HTMLTransparentElements): More
1229 elements are added.
1230 (_get_children): HTML |object| elements are now semi-transparent.
1231
1232 * NanoDOM.pm (manakai_html, manakai_head): New methods.
1233
1234 2008-02-16 Wakaba <wakaba@suika.fam.cx>
1235
1236 * CacheManifest.pm: HTML5 revision 1211 implemented.
1237
1238 * CacheManifest.pod: Updated.
1239
1240 2008-02-10 Wakaba <wakaba@suika.fam.cx>
1241
1242 * ContentChecker.pm (check_document, check_element): Support
1243 for second argument ($onsubdoc).
1244 (_get_css_parser): Removed (now it is part of WDCC).
1245
1246 2008-02-09 Wakaba <wakaba@suika.fam.cx>
1247
1248 * ContentChecker.pm (_get_css_parser): New.
1249
1250 2007-11-25 Wakaba <wakaba@suika.fam.cx>
1251
1252 * ContentChecker.pm ($AnyChecker): Old way to add child elements
1253 for checking had been used.
1254
1255 2007-11-25 Wakaba <wakaba@suika.fam.cx>
1256
1257 * ContentChecker.pm (check_element): New todo item type |descendant|.
1258
1259 2007-11-23 Wakaba <wakaba@suika.fam.cx>
1260
1261 * IMTChecker.pm: Revised to raise errors and warnings as (poorly)
1262 specced in RFC 2046 and RFC 4288.
1263 (application/atom+xml): Definition added.
1264
1265 2007-11-23 Wakaba <wakaba@suika.fam.cx>
1266
1267 * URIChecker.pm: Make RFC 3986 should-level errors
1268 warnings (rather than SHOULD-level errors).
1269
1270 2007-11-23 Wakaba <wakaba@suika.fam.cx>
1271
1272 * NanoDOM.pm (get_user_data, set_user_data): New methods.
1273
1274 * HTML.pm.src: A flag for character references in attribute
1275 values are added. Set |manakai_has_reference| user data
1276 to |charset| attribute.
1277
1278 2007-11-23 Wakaba <wakaba@suika.fam.cx>
1279
1280 * NanoDOM.pm (input_encoding, manakai_charset, manakai_has_bom): New
1281 attributes.
1282
1283 * ContentChecker.pm (check_document): Warn if charset requirements
1284 cannot be tested.
1285
1286 2007-11-19 Wakaba <wakaba@suika.fam.cx>
1287
1288 * HTML.pm.src (parse_byte_string): Detect charset
1289 by universalchardet if charset parameter is not specified.
1290
1291 * Makefile (Charset-all, Charset-clean): New rules.
1292
1293 2007-11-18 Wakaba <wakaba@suika.fam.cx>
1294
1295 * ContentChecker.pm (check_document): Check the existence
1296 of character encoding declaration and selection of encoding
1297 for HTML document.
1298
1299 2007-11-18 Wakaba <wakaba@suika.fam.cx>
1300
1301 * ContentType.pm (get_sniffed_type): Return also the official
1302 type in list context.
1303
1304 2007-11-18 Wakaba <wakaba@suika.fam.cx>
1305
1306 * ContentType.pm: Sniffing with leading white space
1307 ignoring (HTML5 revisions 1013 and 1016).
1308
1309 2007-11-18 Wakaba <wakaba@suika.fam.cx>
1310
1311 * ContentType.pm: HTML5 revision 1013 changes, except for leading
1312 white spaces, are implemented.
1313
1314 2007-11-11 Wakaba <wakaba@suika.fam.cx>
1315
1316 * HTML.pm.src (parse_char_string): Set |inner_encoding|
1317 attribute if possible.
1318
1319 2007-11-11 Wakaba <wakaba@suika.fam.cx>
1320
1321 * HTML.pm.src (parse_byte_string): New method.
1322 (parse_char_string): New alias for |parse_string|.
1323 (main phase): Invoking "change the encoding" algorithm if desired.
1324
1325 * HTML.pod: Updated.
1326
1327 2007-11-11 Wakaba <wakaba@suika.fam.cx>
1328
1329 * HTML.pod (get_inner_html): Removed.
1330
1331 * Makefile (HTML-all, HTML-clean): New.
1332
1333 2007-11-11 Wakaba <wakaba@suika.fam.cx>
1334
1335 * HTML.pm.src (get_inner_html): Removed (moved to HTML/Serializer.pm).
1336
1337 2007-11-08 Wakaba <wakaba@suika.fam.cx>
1338
1339 * mklinktypelist.pl: s/noreferer/noreferrer/ (HTML5 revision 1132).
1340
1341 2007-11-04 Wakaba <wakaba@suika.fam.cx>
1342
1343 * Makefile: |CacheManifest.html| is added.
1344
1345 * CacheManifest.pod: New file.
1346
1347 2007-11-04 Wakaba <wakaba@suika.fam.cx>
1348
1349 * CacheManifest.pm: New module.
1350
1351 2007-11-04 Wakaba <wakaba@suika.fam.cx>
1352
1353 * HTML.pm.src: Support for application cache selection algorithm
1354 callback.
1355
1356 2007-11-04 Wakaba <wakaba@suika.fam.cx>
1357
1358 * mklinktypelist.pl: Support for rel=noreferer (HTML5 revision 1118).
1359
1360 2007-10-17 Wakaba <wakaba@suika.fam.cx>
1361
1362 * Makefile (clean): New rule.
1363
1364 * NanoDOM.pm (public_id, system_id): New attributes.
1365
1366 2007-10-17 Wakaba <wakaba@suika.fam.cx>
1367
1368 * Makefile (CSS-all, CSS-clean, clean): New rules.
1369
1370 2007-10-14 Wakaba <wakaba@suika.fam.cx>
1371
1372 * ContentChecker.pm (check_document): Support for
1373 new |is_xml_root| flag.
1374 (check_element): Support for new |pluses| state.
1375 (_add_pluses): New method.
1376 (_remove_minuses): Support for new |minus| item.
1377
1378 2007-09-24 Wakaba <wakaba@suika.fam.cx>
1379
1380 * ContentChecker.pm: Raise specific error for invalid
1381 root element.
1382
1383 2007-09-24 Wakaba <wakaba@suika.fam.cx>
1384
1385 * ContentChecker.pm: Set level values for later uses.
1386
1387 2007-09-09 Wakaba <wakaba@suika.fam.cx>
1388
1389 * ContentChecker.pm: Support for language tag validation.
1390
1391 2007-09-09 Wakaba <wakaba@suika.fam.cx>
1392
1393 * LangTag.pm (check_rfc3066_language_tag): New method.
1394
1395 2007-09-09 Wakaba <wakaba@suika.fam.cx>
1396
1397 * LangTag.pm: New module.
1398
1399 2007-09-04 Wakaba <wakaba@suika.fam.cx>
1400
1401 * HTML.pm.src: Some error types were wrong.
1402
1403 2007-08-17 Wakaba <wakaba@suika.fam.cx>
1404
1405 * CSS/: New directory.
1406
1407 2007-08-17 Wakaba <wakaba@suika.fam.cx>
1408
1409 * ContentChecker.pm (_check_get_children): Support
1410 for |noscript| in |head|.
1411
1412 2007-08-12 Wakaba <wakaba@suika.fam.cx>
1413
1414 * URI/: New directory.
1415
1416 2007-08-11 Wakaba <wakaba@suika.fam.cx>
1417
1418 * HTML.pm.src: Tokenizer's states are now represented in
1419 number.
1420
1421 2007-08-11 Wakaba <wakaba@suika.fam.cx>
1422
1423 * HTML.pm.src: |or|s for insertion modes are replaced
1424 by |&|s.
1425
1426 2007-08-11 Wakaba <wakaba@suika.fam.cx>
1427
1428 * HTML.pm.src: Token types are now represented in number.
1429
1430 2007-08-11 Wakaba <wakaba@suika.fam.cx>
1431
1432 * ContentType.pm (SEE ALSO): Updated.
1433
1434 * HTML.pm.src: Insertion modes are now represented in number.
1435
1436 2007-08-11 Wakaba <wakaba@suika.fam.cx>
1437
1438 * ContentType.pm: Sniffing for bitmap images (HTML5 revision
1439 999) is implemented.
1440
1441 2007-08-08 Wakaba <wakaba@suika.fam.cx>
1442
1443 * ContentType.pm: Sniffing for |<script| (HTML5 revision
1444 983) is implemented.
1445
1446 2007-08-06 Wakaba <wakaba@suika.fam.cx>
1447
1448 * ContentChecker.pod: New documentation.
1449
1450 * Makefile: A rule for |ContentChecker.html| is added.
1451
1452 * ContentChecker.pm: A pod "LICENSE" section is added.
1453
1454 * NanoDOM.pm ($VERSION): New variable.
1455
1456 2007-08-05 Wakaba <wakaba@suika.fam.cx>
1457
1458 * H2H.pm: |b|, |i|, and |sub| are added to the
1459 list of allowed HTML elements.
1460
1461 2007-08-05 Wakaba <wakaba@suika.fam.cx>
1462
1463 * H2H.pm: |samp| is added to the list of allowed
1464 HTML elements.
1465
1466 * URIChecker.pm (check_iri): New.
1467 (check_iri_reference): Error type for IRI reference
1468 syntax error is changed.
1469
1470 2007-08-04 Wakaba <wakaba@suika.fam.cx>
1471
1472 * ContentChecker.pm: Reference to the |Whatpm::ContentChecker::Atom|
1473 is added.
1474 (check_document): Load appropriate module before validation.
1475
1476 2007-08-04 Wakaba <wakaba@suika.fam.cx>
1477
1478 * ContentChecker/: New directory.
1479
1480 2007-08-04 Wakaba <wakaba@suika.fam.cx>
1481
1482 * ContentChecker.pm: HTML |time| element is implemented.
1483
1484 * HTMLTable.pm: Comments are updated as HTML5 is revised.
1485
1486 2007-07-21 Wakaba <wakaba@suika.fam.cx>
1487
1488 * ContentChecker.pm (check_document): Return value
1489 even if no document element is found.
1490
1491 2007-07-21 Wakaba <wakaba@suika.fam.cx>
1492
1493 * HTML.pm.src: |$in_body| is no longer a function.
1494
1495 2007-07-21 Wakaba <wakaba@suika.fam.cx>
1496
1497 * HTML.pm.src: The |$in_body| code has been moved down.
1498
1499 2007-07-21 Wakaba <wakaba@suika.fam.cx>
1500
1501 * HTML.pm.src: The "trailing end" insertion mode
1502 is split into "after html body" and "after html frameset"
1503 insertion modes. Their codes are merged with "after body"
1504 and "after frameset" codes. |$previous_insertion_mode|
1505 has been removed. "after frameset" code is
1506 merged with "in frameset" code.
1507
1508 2007-07-21 Wakaba <wakaba@suika.fam.cx>
1509
1510 * HTML.pm.src: The "before head" insertion mode is
1511 merged with the "in head" insertion mode.
1512
1513 2007-07-21 Wakaba <wakaba@suika.fam.cx>
1514
1515 * HTML.pm.src: Cases in "in head" insertion mode are
1516 reorganized.
1517
1518 2007-07-21 Wakaba <wakaba@suika.fam.cx>
1519
1520 * HTML.pm.src: Some cases in "in table" insertion mode
1521 are merged.
1522
1523 2007-07-21 Wakaba <wakaba@suika.fam.cx>
1524
1525 * HTML.pm.src: The "in row" insertion mode is merged
1526 with "in table" insertion mode.
1527
1528 2007-07-21 Wakaba <wakaba@suika.fam.cx>
1529
1530 * HTML.pm.src: The "in table" and "in table body" insertion
1531 modes are merged.
1532
1533 2007-07-21 Wakaba <wakaba@suika.fam.cx>
1534
1535 * HTML.pm.src: There is no "in table head" or "in table foot"
1536 insertion mode!
1537
1538 2007-07-21 Wakaba <wakaba@suika.fam.cx>
1539
1540 * HTML.pm.src: |<noframes>| "in frameset" and "in noframes"
1541 now directly invoke the handler.
1542
1543 2007-07-21 Wakaba <wakaba@suika.fam.cx>
1544
1545 * HTML.pm.src: Codes for "in cell" insertion mode
1546 is merged to the "in body" insertion mode code.
1547
1548 2007-07-21 Wakaba <wakaba@suika.fam.cx>
1549
1550 * HTML.pm.src: Codes for "in body" and "in caption"
1551 insertion modes are merged.
1552
1553 2007-07-21 Wakaba <wakaba@suika.fam.cx>
1554
1555 * HTML.pm.src: Two |!!!next-token|s were missing.
1556
1557 2007-07-21 Wakaba <wakaba@suika.fam.cx>
1558
1559 * HTML.pm.src: Use numeric constant for |{content_mode}|
1560 instead of string constant for |{content_model_flag}|.
1561
1562 2007-07-21 Wakaba <wakaba@suika.fam.cx>
1563
1564 * HTML.pm.src: Add the name of the attribute
1565 to the "duplicate attribute" error.
1566
1567 2007-07-17 Wakaba <wakaba@suika.fam.cx>
1568
1569 * ContentChecker.pm: Return the |class| node list.
1570
1571 2007-07-17 Wakaba <wakaba@suika.fam.cx>
1572
1573 * ContentChecker.pm: Return the |id| node list.
1574
1575 * HTML.pm.src: A typo is fixed.
1576
1577 2007-07-16 Wakaba <wakaba@suika.fam.cx>
1578
1579 * ContentChecker.pm: Drop wrong |level => 'error'| specification
1580 from "in HTML:xml:lang" error. Character position
1581 is now the last part of the error type in the URI error
1582 description. Report "unsupported" status for language
1583 tags, media queries, script codes, and style sheets.
1584
1585 2007-07-16 Wakaba <wakaba@suika.fam.cx>
1586
1587 * ContentChecker.pm: Report error if |xml:lang|
1588 in HTML, |lang| in XML, |xmlns| in XML, and |meta| |charset|
1589 in XML.
1590
1591 * NanoDOM.pm (Attr.owner_document): New attribute.
1592
1593 2007-07-16 Wakaba <wakaba@suika.fam.cx>
1594
1595 * HTML.pm.src: The character immediately following
1596 a bare |hcro| was discarded. Fix handling of
1597 entity references in attribute values.
1598
1599 2007-07-16 Wakaba <wakaba@suika.fam.cx>
1600
1601 * HTML.pm.src (main and trailin end phases): Token
1602 types |DOCTYPE|, |comment|, |end-of-file|, and |<html>| are
1603 factored out. Error types |in html:#DOCTYPE|
1604 and |after html:#DOCTYPE| are merged
1605 into |DOCTYPE in the middle|. |</frameset>|
1606 in fragment parsing mode changed the
1607 insertion mode.
1608
1609 2007-07-16 Wakaba <wakaba@suika.fam.cx>
1610
1611 * HTML.pm.src: |$phase| has been removed; The |trailing end|
1612 phase is now an insertion mode. Treatments for white
1613 space character tokens were incorrect for some
1614 insertion modes. An old |meta| case was not removed.
1615
1616 2007-07-16 Wakaba <wakaba@suika.fam.cx>
1617
1618 * HTML.pm.src: |meta| charset declaration extraction
1619 implemented (but changing the encoding is not yet:-).
1620
1621 2007-07-15 Wakaba <wakaba@suika.fam.cx>
1622
1623 * Charset/: New directory.
1624
1625 2007-07-15 Wakaba <wakaba@suika.fam.cx>
1626
1627 * H2H.pm: New Perl module (created from
1628 manakai's H2H.dis).
1629
1630 2007-07-15 Wakaba <wakaba@suika.fam.cx>
1631
1632 * XMLSerializer.pm: New Perl module (created from
1633 manakai's SimpleLS.dis).
1634
1635 2007-07-07 Wakaba <wakaba@suika.fam.cx>
1636
1637 * HTML.pm.src: |<!---x-->| was not processed correctly.
1638
1639 2007-07-01 Wakaba <wakaba@suika.fam.cx>
1640
1641 * HTML.pm.src: Report correct error message
1642 for |<body></div></body>|.
1643
1644 2007-07-01 Wakaba <wakaba@suika.fam.cx>
1645
1646 * HTMLTable.pm: An error description was incorrect.
1647
1648 2007-06-30 Wakaba <wakaba@suika.fam.cx>
1649
1650 * ContentChecker.pm: Return |{term}| list.
1651
1652 2007-06-30 Wakaba <wakaba@suika.fam.cx>
1653
1654 * HTML.pm.src: HTML5 revisions 961-966 (</p>, </br>,
1655 nested <nobr>, implied </tbody>, </tfoot>, and </thead>,
1656 and <title> outside of head).
1657
1658 2007-06-30 Wakaba <wakaba@suika.fam.cx>
1659
1660 * IMTChecker.pm: Report warning for unregistered
1661 and private types/subtypes.
1662
1663 * ContentChecker.pm, HTML.pm.src, IMTChecker.pm,
1664 URIChecker.pm, HTMLTable.pm: Error messages are now
1665 consistent; they are all listed in
1666 <http://suika.fam.cx/gate/2005/sw/Whatpm%20Error%20Types>.
1667
1668 2007-06-25 Wakaba <wakaba@suika.fam.cx>
1669
1670 * ContentChecker.pm: |<img ismap>| not in |<a></a>|
1671 is now erred. |<datalist>| is implemented.
1672 Attribute checker for |<command>| and |<menu>| are
1673 added. Support for |contextmenu| global attribute
1674 is added.
1675
1676 2007-06-25 Wakaba <wakaba@suika.fam.cx>
1677
1678 * HTML.pm.src (_reset_insertion_mode): Interpretation
1679 of Step 3 has been changed.
1680
1681 2007-06-25 Wakaba <wakaba@suika.fam.cx>
1682
1683 * HTML.pm.src: Late |<html>| parse error is implemented.
1684
1685 2007-06-24 Wakaba <wakaba@suika.fam.cx>
1686
1687 * URIChecker.pm (check_iri_reference): A |decode| method name was
1688 incorrect.
1689
1690 * ContentChecker.pm: Support for the |footer| element.
1691 Check URI syntax for space-separated URI attributes.
1692 Support for the |tabindex| attribute. Support
1693 for |datetime| attribute.
1694
1695 2007-06-24 Wakaba <wakaba@suika.fam.cx>
1696
1697 * HTML.pm.src: HTML5 revision 1.144 (&#x0D;) and 1.145 (invalid
1698 character references). HTML5 revision 1.146 (white space
1699 characters before root start tag). HTML5 revision
1700 1.148 (named character references in attribute values).
1701 HTML5 revision 1.152 (<plaintext>.innerHTML get).
1702
1703 2007-06-24 Wakaba <wakaba@suika.fam.cx>
1704
1705 * HTML.pm.src: HTML5 revisions 1.142 and 1.143 (<noscript>
1706 in <head>).
1707
1708 2007-06-24 Wakaba <wakaba@suika.fam.cx>
1709
1710 * HTML.pm.src: HTML5 revision 935 (<base>, <link>, <meta>
1711 in body).
1712
1713 * ContentChecker.pm: HTML5 revision 938 (scoped="").
1714
1715 2007-06-24 Wakaba <wakaba@suika.fam.cx>
1716
1717 * HTML.pm.src: HTML5 revision 923 (matching end tag in CDATA
1718 or RCDATA in fragment parsing mode). HTML5 revision
1719 924 (<!--> and <!--->). HTML5 revision 926 (hn in hn).
1720
1721 2007-06-23 Wakaba <wakaba@suika.fam.cx>
1722
1723 * HTML.pm.src (get_inner_html): HTML5 revision 922 (inner_html
1724 for <pre> and <textarea>).
1725
1726 2007-06-23 Wakaba <wakaba@suika.fam.cx>
1727
1728 * HTML.pm.src: HTML5 revision 920 (<isindex>).
1729
1730 2007-06-23 Wakaba <wakaba@suika.fam.cx>
1731
1732 * HTML.pm.src: HTML5 revision 918 (</head>, </body>, </html>).
1733
1734 2007-06-23 Wakaba <wakaba@suika.fam.cx>
1735
1736 * HTML.pm.src: HTML5 revision 916 (</body>).
1737 HTML5 revision 917 (conforming bare &).
1738
1739 2007-06-23 Wakaba <wakaba@suika.fam.cx>
1740
1741 * NanoDOM.pm (manakai_is_html): Setting to false did not work.
1742
1743 * HTML.pm.src: HTML5 revision 914 (</ in CDATA, RCDATA).
1744 HTML5 revision 915 (<nobr>).
1745
1746 2007-06-23 Wakaba <wakaba@suika.fam.cx>
1747
1748 * HTML.pm.src: HTML5 revisions 908, 909, 912, and 913 (quirks mode).
1749
1750 * NanoDOM.pm (manakai_is_html, manakai_compat_mode, compat_mode):
1751 New attributes.
1752
1753 2007-06-23 Wakaba <wakaba@suika.fam.cx>
1754
1755 * HTML.pm.src: HTML5 revisions 900, 901, 902, and 911 (<
1756 in tags).
1757
1758 2007-06-23 Wakaba <wakaba@suika.fam.cx>
1759
1760 * .cvsignore: |Entities.html| is added.
1761
1762 * HTML.pm.src: |$entity_char| is removed and
1763 requires |Whatpm::_NamedEntityList| instead.
1764 HTML5 revision 898 (refc), except that lack of refc
1765 is parse error.
1766
1767 * mkentitylist.pl: New script.
1768
1769 * Makefile (all): |_NamedEntityList.pm| is added.
1770 (_NamedEntityList.pm, Entities.html): New rules.
1771
1772 2007-06-23 Wakaba <wakaba@suika.fam.cx>
1773
1774 * HTML.pm.src: Parse errors immediately after U+000D
1775 were ignored and U+000D immediately following another
1776 U+000D was not converted to U+000A.
1777
1778 2007-06-23 Wakaba <wakaba@suika.fam.cx>
1779
1780 * HTML.pm.src (set_inner_html): HTML5 revision 892 (adopt
1781 nodes before appended). Parser was not ready for NULL
1782 parse error and escape flag.
1783
1784 * NanoDOM.pm (adopt_node): New.
1785
1786 2007-06-23 Wakaba <wakaba@suika.fam.cx>
1787
1788 * HTML.pm.src: HTML5 revision 886 (insane comment in
1789 CDATA and RCDATA). Note that current implementation
1790 is simply repeating what the spec says and it is maybe not
1791 a best way to do it.
1792
1793 2007-06-23 Wakaba <wakaba@suika.fam.cx>
1794
1795 * HTML.pm.src: HTML5 revision 884 (</form> don't close
1796 the form element if a descendant element without implied end tag has
1797 still been open).
1798
1799 2007-06-23 Wakaba <wakaba@suika.fam.cx>
1800
1801 * ContentChecker.pm: HTML5 revision 881 (Make |id| attribute with
1802 space characters non-conforming).
1803
1804 2007-06-23 Wakaba <wakaba@suika.fam.cx>
1805
1806 * HTML.pm.src: An error message was incorrect.
1807 HTML5 revision 869 (C1 character references).
1808
1809 2007-06-23 Wakaba <wakaba@suika.fam.cx>
1810
1811 * HTML.pm.src: HTML5 revision 867 (a LF at the beginning of
1812 a |textarea| is removed).
1813
1814 2007-06-05 Wakaba <wakaba@suika.fam.cx>
1815
1816 * NanoDOM.pm (get_attribute_node_ns): New method.
1817
1818 * ContentChecker.pm: |script| |async| and |defer|
1819 no longer require |src|. |async| MUST NOT be
1820 specified if |defer|. (HTML5 revision 858).
1821
1822 2007-05-30 Wakaba <wakaba@suika.fam.cx>
1823
1824 * HTML.pm.src: |<form><form>| went to inifinite loop.
1825
1826 2007-05-27 Wakaba <wakaba@suika.fam.cx>
1827
1828 * ContentChecker.pm (html): Set |is_root| (allowed
1829 as a document element) flag on.
1830 (new): Removed.
1831 (check_document): New method.
1832
1833 2007-05-27 Wakaba <wakaba@suika.fam.cx>
1834
1835 * ContentChecker.pm (thead, tfoot): Checker specifications
1836 were incorrect.
1837
1838 2007-05-27 Wakaba <wakaba@suika.fam.cx>
1839
1840 * ContentChecker.pm ($HTMLURIAttrChecker): Include
1841 error position in the |type| option of the error.
1842
1843 * HTMLTable.pm (form_table): The |$onerror| parameter
1844 is now optional. Some bugs are fixed.
1845
1846 2007-05-27 Wakaba <wakaba@suika.fam.cx>
1847
1848 * HTMLTable.pm: New module.
1849
1850 * ContentChecker.pm (table): Invoke table model error checker.
1851
1852 * NanoDOM.pm (first_child, get_attribute_ns): New.
1853
1854 2007-05-26 Wakaba <wakaba@suika.fam.cx>
1855
1856 * ContentChecker.pm ($HTMLLinkTypesAttrChecker): New checker.
1857 (link/@rel, a/@rel, area/@rel): Use new checker.
1858
1859 * Makefile (_LinkTypeList.pm, RelExtensions.html): New rules.
1860
1861 * _LinkTypeList.pm: New file.
1862
1863 * mklinktypelist.pl: New file.
1864
1865 * .cvsignore: |RelExtensions.html| added.
1866
1867 * NanoDOM.pm (child_nodes): Returns an empty array
1868 for non-child-containing node types.
1869 (text_content): New attribute.
1870
1871 2007-05-26 Wakaba <wakaba@suika.fam.cx>
1872
1873 * IMTChecker.pm: New module.
1874
1875 * ContentChecker.pm ($HTMLIMTAttrChecker): Call IMTChecker
1876 to test parameter value validity.
1877
1878 * HTML.pm.src ($style_start_tag): Attributes were
1879 discarded.
1880
1881 2007-05-25 Wakaba <wakaba@suika.fam.cx>
1882
1883 * ContentChecker.pm ($HTMLURIAttrChecker): Implemented.
1884
1885 2007-05-25 Wakaba <wakaba@suika.fam.cx>
1886
1887 * URIChecker.pm: All recommendations from RFC 3986
1888 and RFC 3987 are listed (not all testable items are checked yet).
1889
1890 2007-05-25 Wakaba <wakaba@suika.fam.cx>
1891
1892 * URIChecker.pm: New module.
1893
1894 2007-05-20 Wakaba <wakaba@suika.fam.cx>
1895
1896 * ContentChecker.pm: Now most attributes are implemented
1897 or associated to some placeholder.
1898 ($ElementDefault): Warn unknown attributes
1899 for unknown elements as "attribute not supported".
1900 ($HTMLLanguageTagAttrChecker, $HTMLMQAttrChecker): New placeholders.
1901 ($HTMLUsemapAttrChecker, $HTMLTargetAttrChecker): New checkers.
1902 (|a| attribute checker): Reimplemented.
1903
1904 2007-05-20 Wakaba <wakaba@suika.fam.cx>
1905
1906 * ContentChecker.pm ($HTMLEventHandlerAttrChecker): New placeholder.
1907 ($HTMLAttrChecker): Event handler content attributes
1908 are added.
1909 (link, embed): Required attribute is now checked.
1910 (embed): Unknown local attributes are no longer warned.
1911
1912 2007-05-20 Wakaba <wakaba@suika.fam.cx>
1913
1914 * ContentChecker.pm ($HTMLSpaceURIsAttrChecker): New placeholder.
1915 ($HTMLIMTAttrChecker): New checker.
1916 (link@rel, link@href, link@type, style@type,
1917 a@href, a@ping, a@ping, a@type, embed@src, embed@type,
1918 object@data, object@type, source@src, source@type, area@alt,
1919 area@shape, area@coords, area@href,
1920 area@ping, area@rel, area@type, script@src,
1921 script@defer, script@async, script@type): Checkers added.
1922
1923 2007-05-20 Wakaba <wakaba@suika.fam.cx>
1924
1925 * ContentChecker.pm: Descendant checking was incorrect.
1926
1927 2007-05-19 Wakaba <wakaba@suika.fam.cx>
1928
1929 * ContentChecker.pm: Support |xml:*| and |xmlns:*|
1930 attributes. Report an error if |Element.prefix|
1931 is |xmlns|.
1932
1933 * NanoDOM.pm (prefix): New attribute.
1934
1935 2007-05-19 Wakaba <wakaba@suika.fam.cx>
1936
1937 * HTML.pm.src: In |main| phase, |in body| insertion
1938 mode, action for |<iframe>| was missing.
1939
1940 2007-05-19 Wakaba <wakaba@suika.fam.cx>
1941
1942 * ContentChecker.pm: Support for many of HTML5 elements.
1943 ($GetHTMLNonNegativeIntegerAttrChecker): New.
1944
1945 2007-05-19 Wakaba <wakaba@suika.fam.cx>
1946
1947 * ContentChecker.pm: Support for most elements up to |progress|.
1948 ($HTMLURIAttrChecker): Placeholder.
1949 ($HTMLIntegerAttrChecker, $GetHTMLFloatingPointNumberAttrChecker):
1950 New.
1951
1952 2007-05-19 Wakaba <wakaba@suika.fam.cx>
1953
1954 * ContentChecker.pm: Attribute checkers
1955 for global attributes, |html|, |base|, |style|, and |meta|.
1956
1957 * NanoDOM.pm (insert_before): Weaken reference
1958 to the parent node.
1959 (Attr::new): Set |owner_element| attribute.
1960 (namespace_uri, manakai_local_name): New attribute implementations.
1961 (owner_element): New attribute.
1962
1963 2007-05-19 Wakaba <wakaba@suika.fam.cx>
1964
1965 * ContentChecker.pm ($AttrChecker, $HTMLAttrChecker,
1966 $AnyChecker->{attr_checker}, $HTMLAttrsChecker,
1967 $Element->{$HTML_NS}->{''}): New.
1968 (check_element): Invoke attrs_checker for each element.
1969
1970 2007-05-13 Wakaba <wakaba@suika.fam.cx>
1971
1972 * ContentChecker.pm: Don't use |manakai_element_type_match|.
1973
1974 2007-05-13 Wakaba <wakaba@suika.fam.cx>
1975
1976 * ContentChecker.pm: Use hashs rather than lists for
1977 element type testings.
1978
1979 2007-05-13 Wakaba <wakaba@suika.fam.cx>
1980
1981 * ContentChecker.pm: Don't generate duplicate
1982 error when an element type is put in the "minus" list
1983 and the element type is not allowed explicitly in the particular
1984 element content model.
1985 (html:a checker): New checker.
1986 (html:details, html:datagrid): New checkers.
1987 (html:legend): New checker.
1988
1989 2007-05-13 Wakaba <wakaba@suika.fam.cx>
1990
1991 * ContentChecker.pm (html:li checker): Implemented.
1992
1993 2007-05-13 Wakaba <wakaba@suika.fam.cx>
1994
1995 * ContentChecker.pm ($HTMLInlineOrStriclyInlineChecker): New
1996 checker.
1997 (html:dd checker): New checker.
1998 (html:q, html:em, html:strong, html:small,
1999 html:m, html:dfn, html:code, html:samp, html:span): New checkers.
2000
2001 2007-05-13 Wakaba <wakaba@suika.fam.cx>
2002
2003 * ContentChecker.pm ($AnyChecker): Renamed
2004 from |$ElementDefault->{checker}|.
2005 ($ElementDefault->{checker}): Throw an error that
2006 the element type is not supported by the checker.
2007 ($HTMLMetadataElement): |html:base| was missing.
2008 ($HTMLEmptyChecker): Don't throw an error
2009 for inter-element whitespace nodes.
2010 (html:html checker): Errors were not
2011 thrown even if |html:head| and/or |html:body|
2012 children were missing.
2013 (html:head checker): An error was not
2014 thrown if <meta charset> appered after other
2015 elements.
2016
2017 2007-05-05 Wakaba <wakaba@suika.fam.cx>
2018
2019 * ContentChecker.pm: |footer|, |video|, |audio|, |script|,
2020 and |noscript| elements are implemented.
2021 (new): New method.
2022
2023 2007-05-04 Wakaba <wakaba@suika.fam.cx>
2024
2025 * ContentChecker.pm: New module.
2026
2027 2007-05-04 Wakaba <wakaba@suika.fam.cx>
2028
2029 * NanoDOM.pm (manakai_parent_element,
2030 document_element, manakai_local_name,
2031 manakai_element_type_match): New method.
2032
2033 2007-05-03 Wakaba <wakaba@suika.fam.cx>
2034
2035 * HTML.pm.src: Replace decimal and hexadecimal numeric
2036 entities in C1 range using Windows-1252 mapping. Bare LF
2037 did not count as new line for error reporting.
2038
2039 2007-05-02 Wakaba <wakaba@suika.fam.cx>
2040
2041 * NanoDOM.pm (DOMImplementation): New class.
2042 (append_child): Weaken the |parent_node| reference.
2043 (create_element_ns, Element new): Set the |owner_document|
2044 reference.
2045 (implementation): New attribute.
2046 (owner_document, local_name, namespace_uri): New attributes.
2047
2048 * HTML.pm.src (parse_string): Line and column numbers
2049 are now provided to error handler.
2050 (!!!parse-error): Short descriptions are added.
2051 (_construct_tree): Split into three methods; support
2052 for innerHTML mode.
2053 (set_inner_html): New method.
2054
2055 2007-05-01 Wakaba <wakaba@suika.fam.cx>
2056
2057 * NanoDOM.html: Documentation is added.
2058
2059 * HTML.pod, ContentType.html: Documentation is revised.
2060
2061 * .cvsignore: Pod2html temporary files are added.
2062
2063 * Makefile: Make |NanoDOM.html|.
2064
2065 2007-05-01 Wakaba <wakaba@suika.fam.cx>
2066
2067 * HTML.pm.src (parse_string): New method.
2068 (get_inner_html): Renamed from |inner_html|.
2069
2070 * Makefile: A rule for |HTML.html| is added.
2071
2072 * HTML.pod: New documentation.
2073
2074 2007-05-01 Wakaba <wakaba@suika.fam.cx>
2075
2076 * NanoDOM.pm (last_child, previous_sibling): New attributes.
2077 (clone_node): Attribute nodes were not completely copied.
2078
2079 * HTML.pm.src: Many bugs are fixed.
2080
2081 2007-04-30 Wakaba <wakaba@suika.fam.cx>
2082
2083 * HTML.pm.src: Some typos are fixed.
2084
2085 2007-04-30 Wakaba <wakaba@suika.fam.cx>
2086
2087 * mkhtmlparser.pl, Makefile: References to the |HTML-consume-entity.src|
2088 are removed.
2089
2090 * HTML.pm.src: Tokenizer's handling on named entities are rewritten.
2091
2092 * HTML-consume-entity.src: Removed.
2093
2094 2007-04-30 Wakaba <wakaba@suika.fam.cx>
2095
2096 * HTML.pm.src: Tokenizer's handling on hexadecimal
2097 numeric entities are rewritten.
2098
2099 2007-04-30 Wakaba <wakaba@suika.fam.cx>
2100
2101 * HTML.pm.src: Some tokenizer bugs are fixed.
2102
2103 2007-04-30 Wakaba <wakaba@suika.fam.cx>
2104
2105 * HTML.pm.src: The tree construction stage is implemented.
2106
2107 * mkhtmlparser.pl: New macros are added.
2108
2109 2007-04-28 Wakaba <wakaba@suika.fam.cx>
2110
2111 * ContentType.pm: A note on bug in the specification
2112 is removed since it's been now fixed.
2113
2114 * .cvsignore: New file.
2115
2116 2007-04-28 Wakaba <wakaba@suika.fam.cx>
2117
2118 * HTML.pm.src, HTML-consume-entity.src: New files.
2119
2120 * Makefile (HTML.pm): New rule.
2121
2122 * mkhtmlparser.pl: New script.
2123
2124 2007-04-25 Wakaba <wakaba@suika.fam.cx>
2125
2126 * Makefile: New file.
2127
2128 2007-04-24 Wakaba <wakaba@suika.fam.cx>
2129
2130 * ContentType.pm: An error in pod is fixed.
2131
2132 2007-04-24 Wakaba <wakaba@suika.fam.cx>
2133
2134 * ContentType.pm: Documentation is added.
2135
2136 2007-04-24 Wakaba <wakaba@suika.fam.cx>
2137
2138 * ContentType.pm: New Perl module.
2139
2140 * ChangeLog: New file.
2141
2142

admin@suikawiki.org
ViewVC Help
Powered by ViewVC 1.1.24