/[suikacvs]/markup/html/whatpm/Whatpm/ChangeLog
Suika

Contents of /markup/html/whatpm/Whatpm/ChangeLog

Parent Directory Parent Directory | Revision Log Revision Log


Revision 1.331 - (show annotations) (download)
Sun Sep 21 09:45:02 2008 UTC (16 years, 1 month ago) by wakaba
Branch: MAIN
Changes since 1.330: +5 -0 lines
++ whatpm/t/ChangeLog	21 Sep 2008 09:32:48 -0000
2008-09-21  Wakaba  <wakaba@suika.fam.cx>

	* content-model-2.dat: Test data for |form| |name| are added.

++ whatpm/Whatpm/ChangeLog	21 Sep 2008 09:33:18 -0000
	* ContentChecker.pm: Prepare for |form| |name| attribute's
	duplication checking.

2008-09-21  Wakaba  <wakaba@suika.fam.cx>

++ whatpm/Whatpm/ContentChecker/ChangeLog	21 Sep 2008 09:32:26 -0000
2008-09-21  Wakaba  <wakaba@suika.fam.cx>

	* HTML.pm: |form| element's |name| attribute is implemented.

1 2008-09-21 Wakaba <wakaba@suika.fam.cx>
2
3 * ContentChecker.pm: Prepare for |form| |name| attribute's
4 duplication checking.
5
6 2008-09-21 Wakaba <wakaba@suika.fam.cx>
7
8 * HTML.pm.src (parse_byte_stream): Support (or non-support) for
9 unsupported charset="" parameter value (HTML5 revision 2131).
10
11 2008-09-20 Wakaba <wakaba@suika.fam.cx>
12
13 * HTML.pm.src: Reminding places where U+000B is allowed as a space
14 character is fixed (cf. HTML5 revision 1738).
15
16 * ContentChecker.pm, HTMLTable.pm: U+000B is no longer part of
17 space characters (HTML5 revision 1738).
18
19 2008-09-20 Wakaba <wakaba@suika.fam.cx>
20
21 * HTML.pm.src: The "anything else" case for the "after after body"
22 insertion mode was not updated to swtich to the "in body"
23 insertion mode. U+000B is no longer a space character for the
24 purpose of tree construction phase (HTML5 revision 1738).
25
26 2008-09-20 Wakaba <wakaba@suika.fam.cx>
27
28 * HTML.pm.src: U+000B is no longer a space character (HTML5
29 revision 1738).
30
31 2008-09-20 Wakaba <wakaba@suika.fam.cx>
32
33 * ContentType.pm: 0x0B is no longer a space character (HTML5
34 revision 1738).
35
36 * HTML.pm.src: U+000B is no longer a space character for the
37 algorithm for extracting an encoding from a Content-Type (HTML5
38 revision 1738).
39
40 2008-09-20 Wakaba <wakaba@suika.fam.cx>
41
42 * ContentChecker.pm ($IsInHTMLInteractiveContent): New.
43
44 2008-09-18 Wakaba <wakaba@suika.fam.cx>
45
46 * LangTag.pm: Add checks for remaining requirements from RFC 4646.
47
48 * mklangreg.pl: Sort 'Prefix' values by their length, to ease
49 matching.
50
51 2008-09-18 Wakaba <wakaba@suika.fam.cx>
52
53 * LangTag.pm: Warn for private use language subtags. Error level
54 typos fixed. Support for Suppress-Script field.
55
56 * mklangreg.pl: Support for dumping of nested structure.
57
58 2008-09-18 Wakaba <wakaba@suika.fam.cx>
59
60 * LangTag.pm (check_rfc4646_langtag): Check if a tag is in the
61 recommended case as per RFC 4646.
62
63 2008-09-18 Wakaba <wakaba@suika.fam.cx>
64
65 * LangTag.pm (check_rfc4646_langtag): New method.
66
67 2008-09-18 Wakaba <wakaba@suika.fam.cx>
68
69 * mklangreg.pl: New script.
70
71 * Makefile: Updated for creation of the module for language subtag
72 registry.
73
74 2008-09-16 Wakaba <wakaba@suika.fam.cx>
75
76 * Makefile: WebIDL.html added.
77
78 * WebIDL.pod: New documentation.
79
80 2008-09-16 Wakaba <wakaba@suika.fam.cx>
81
82 * WebIDL.pm: Checker's error types are redefined.
83
84 2008-09-16 Wakaba <wakaba@suika.fam.cx>
85
86 * WebIDL.pm: Parser's error types are redefined. Some forward
87 compatible parsing bugs are fixed. Some unreachable codes are
88 commented out.
89
90 2008-09-16 Wakaba <wakaba@suika.fam.cx>
91
92 * WebIDL.pm: Support for the reminding extended attributes are
93 added. It does not satisfy the definition that a forward
94 interface declaration has an extended attribute. It seems that
95 unless explicitly allowed multiple extended attributes with the
96 same name is not allowed, though it is not explicitly mentioned in
97 the spec.
98
99 2008-09-16 Wakaba <wakaba@suika.fam.cx>
100
101 * WebIDL.pm: Unescapes extended attribute names and extended
102 attribute identifiers. Preserve whether an extended attribute has
103 an argument list of not. Support for extended attributes:
104 Constructor, ExceptionConsts, IndexGetter, IndexSetter,
105 NameGetter, NameSetter, and Null.
106 (has_argument_list): New attribute.
107 (idl_text): Stringifies argument lists, if any, even if it is
108 empty.
109
110 2008-09-15 Wakaba <wakaba@suika.fam.cx>
111
112 * HTML.pm.src: New state |PCDATA_STATE|. Use an empty string for
113 |{s_kwd}| in DATA_STATE as default.
114
115 2008-09-15 Wakaba <wakaba@suika.fam.cx>
116
117 * HTML.pm.src, mkhtmlparser.pl: Replace |{prev_char}|
118 by |{s_kwd}| in DATA_STATE.
119
120 2008-09-15 Wakaba <wakaba@suika.fam.cx>
121
122 * HTML.pm.src: Shorten keys.
123
124 2008-09-15 Wakaba <wakaba@suika.fam.cx>
125
126 * HTML.pm.src: Remove checking for control character, surrogate
127 pair, or noncharacter code points and non-Unicode code
128 points (they should be handled by Whatpm::Charset::UnicodeChecker).
129 (parse_char_stream): Support for the |$get_wrapper| argument and
130 character stream error handlers.
131
132 2008-09-15 Wakaba <wakaba@suika.fam.cx>
133
134 * ContentChecker.pm: Don't call |loda_ns_module|
135 for null-namespace elements/attributes.
136
137 * HTML.pm.src: Fact out $disallowed_control_chars
138 as a hash.
139
140 2008-09-14 Wakaba <wakaba@suika.fam.cx>
141
142 * HTML.pm.src: Regexp typo fixed. |{prev_char}|
143 and |{next_char}| initializations are moved to initialization
144 method. |{read_until}| now supports buffering. Sync |set_inner_html|
145 with |parse_char_stream|.
146
147 2008-09-14 Wakaba <wakaba@suika.fam.cx>
148
149 * HTML.pm.src (parse_char_stream): Make |set_next_char|
150 invoke |manakai_read_until|, not only |read|, where
151 possible, to decrease the number of |read| method calls.
152
153 * mkhtmlparser.pl: Related changes to the aforementioned
154 modification.
155
156 2008-09-14 Wakaba <wakaba@suika.fam.cx>
157
158 * HTML.pm.src: Use |read| instead of |getc|. |set_inner_html|
159 would report character error from now.
160
161 2008-09-14 Wakaba <wakaba@suika.fam.cx>
162
163 * HTML.pm.src: White-space-leaded non-white-space character
164 tokens in "before head insertion mode" was not
165 correctly handled.
166 (set_inner_html): Reimplemented using CharString decodehandle
167 class. Support for $get_wrapper argument. Support
168 for |{read_until}| feature.
169
170 2008-09-14 Wakaba <wakaba@suika.fam.cx>
171
172 * HTML.pm.src: Make a "bare ero" error for unknown
173 entities point the "&" character.
174
175 2008-09-14 Wakaba <wakaba@suika.fam.cx>
176
177 * HTML.pm.src: It turns out that U+FFFD don't have to
178 be added to the list of excluded characters.
179
180 2008-09-14 Wakaba <wakaba@suika.fam.cx>
181
182 * HTML.pm.src ($char_onerror): Have character decoder's |line|
183 and |column| a higher priority than the one set by the
184 tokenizer's input handler.
185 ($self->{read_until}): Exclude U+FFFD (but this might
186 not be necessary, since now we do line/column fixup in
187 the character decode handle).
188
189 2008-09-14 Wakaba <wakaba@suika.fam.cx>
190
191 * HTML.pm.src: Use |{read_until}| where possible.
192
193 2008-09-14 Wakaba <wakaba@suika.fam.cx>
194
195 * HTML.pm.src: Change |{getc_until}| to |{read_until}|
196 and |manakai_getc_until| to |manakai_read_until| to
197 reduce the number of string copies.
198
199 2008-09-14 Wakaba <wakaba@suika.fam.cx>
200
201 * HTML.pm.src (parse_char_string): Use newly created
202 |Whatpm::Charset::DecodeHandle::CharString| instead of Perl's
203 standard feature to |open| a string as a filehandle,
204 since Perl's string filehandle seems not supporting |ungetc|
205 method correctly.
206 (parse_char_stream): Define |{getc_until}| method.
207 (DATA_STATE): Experimental support for |getc_until| feature.
208
209 2008-09-13 Wakaba <wakaba@suika.fam.cx>
210
211 * HTML.pm.src: Check points added to newly added branches.
212
213 2008-09-13 Wakaba <wakaba@suika.fam.cx>
214
215 * HTML.pm.src: Remove |{char}|, which is no longer used.
216 Remove |{entity_in_attr}| and |{last_attribute_value_state}|
217 and replaced by |{prev_state}|.
218
219 * mkhtmlparser.pl: Remove |{char}| feature.
220 Remove |!!!back-next-input-character;| macro.
221
222 2008-09-13 Wakaba <wakaba@suika.fam.cx>
223
224 * HTML.pm.src: Finally we get rid of all the inner loops. Remove
225 entity related tokenizer states in favor of new states
226 implementing the consume character reference algorithm.
227
228 2008-09-13 Wakaba <wakaba@suika.fam.cx>
229
230 * HTML.pm.src: "Consume a character reference" algorithm is
231 now implemented as a tokenizer's state, rather than
232 a method, with minimum changes (more changes will
233 be made, in due course). "Bogus comment state"'s inner
234 loop gets removed.
235
236 2008-09-13 Wakaba <wakaba@suika.fam.cx>
237
238 * HTML.pm.src: Make |PUBLIC| and |SYSTEM| keyword tokenizing
239 into their own tokenizer states.
240
241 2008-09-13 Wakaba <wakaba@suika.fam.cx>
242
243 * HTML.pm.src: |CDATA_SECTION_STATE| (formally |CDATA_BLOCK_STATE|
244 is split into three states.
245
246 2008-09-13 Wakaba <wakaba@suika.fam.cx>
247
248 * HTML.pm.src: |CLOSE_TAG_OPEN_STATE| is broken into
249 itself and new |CDATA_PCDATA_CLOSE_TAG_STATE| so that
250 no longer does the tokenizer have to push back next input
251 characters in those states.
252
253 2008-09-13 Wakaba <wakaba@suika.fam.cx>
254
255 * HTML.pm.src: |MARKUP_DECLARATION_OPEN_STATE| broken
256 into four states so that no longer does the tokenizer have to push
257 back next input characters in that state.
258
259 2008-09-11 Wakaba <wakaba@suika.fam.cx>
260
261 * HTML.pm.src: Methods now accept additional parameter, $get_wrapper,
262 which can be used to insert some wrapper between the character
263 stream handle and the tokenizer. (It is currently not supported
264 for |set_inner_html| for |Element|s).
265
266 2008-09-10 Wakaba <wakaba@suika.fam.cx>
267
268 * HTML.pm.src: Ignore punctuations in charset names.
269
270 2008-09-10 Wakaba <wakaba@suika.fam.cx>
271
272 * ContentChecker.pm: Support for charset-layer error levels.
273
274 * HTML.pm.src: Don't specify |text| argument for the
275 |chardecode:fallback| error, since it is not the encoding
276 being used alternatively.
277
278 2008-09-06 Wakaba <wakaba@suika.fam.cx>
279
280 * HTML.pm.src: Support for |XSLT-compat| (HTML5 revision 2141).
281
282 2008-08-31 Wakaba <wakaba@suika.fam.cx>
283
284 * CacheManifest.pm: Support for extensibility (HTML5 revision 2051).
285
286 2008-08-31 Wakaba <wakaba@suika.fam.cx>
287
288 * HTML.pm.src: Bug fix and sync with the spec with regard
289 to after after frameset insertion mode processing (HTML5
290 revision 1909). Note that the implementation was wrong
291 per the old spec before the r1909 changes.
292
293 2008-08-30 Wakaba <wakaba@suika.fam.cx>
294
295 * HTMLTable.pm: scope=auto algorithm fix synced with the
296 spec (HTML5 revision 2093).
297 ($process_row): Algorithm step numbers synced with the
298 spec (HTML5 revision 2092).
299
300 2008-08-30 Wakaba <wakaba@suika.fam.cx>
301
302 * HTMLTable.pm: Zs is not what we want; we want White_Space! (HTML5
303 revision 2094).
304
305 2008-08-30 Wakaba <wakaba@suika.fam.cx>
306
307 * ContentType.pm: Support for image/svg+xml (HTML5 revision 2096).
308
309 2008-08-30 Wakaba <wakaba@suika.fam.cx>
310
311 * HTML.pm.src: '"' and "'" at the end of attribute
312 name (after another attribute) now raise parse error (HTML5
313 revision 2123). Empty unquoted attribute values are no
314 longer allowed (HTML5 revision 2122).
315
316 2008-08-30 Wakaba <wakaba@suika.fam.cx>
317
318 * mkhtmlparser.pl: Support for MathML |definitionURL| attribute (HTML5
319 revision 2130).
320
321 2008-08-30 Wakaba <wakaba@suika.fam.cx>
322
323 * ContentChecker.pm: |xml:lang| attribute value must be same
324 as |lang| attribute value for HTML elements (HTML5 revision 2062
325 and so on).
326
327 2008-08-30 Wakaba <wakaba@suika.fam.cx>
328
329 * ContentChecker.pm: Error level definition for |xml_id_error|
330 was missing.
331
332 * URIChecker.pm: The end of the URL should be marked as the
333 error location for an empty path error. The position
334 between the userinfo and the port components should be
335 marked as the error location for an empty host error.
336
337 2008-08-30 Wakaba <wakaba@suika.fam.cx>
338
339 * URIChecker.pm: Set parameters representing where in the
340 value the error occurs for errors. Report unknown
341 address format error in warning level, since address
342 formats are rarely added. Path segments starting with "/.."
343 were misinterpreted as a dot-segment.
344
345 2008-08-30 Wakaba <wakaba@suika.fam.cx>
346
347 * URIChecker.pm (check_iri_reference): Requires
348 |Message::DOM::DOMImplementation|.
349
350 2008-08-29 Wakaba <wakaba@suika.fam.cx>
351
352 * IMTChecker.pm: Updated for the new error reporting architecture.
353
354 * ContentChecker.pm: Error levels for IMTs are added.
355
356 2008-08-17 Wakaba <wakaba@suika.fam.cx>
357
358 * H2H.pm (_shift_token): Support for unquoted HTML attribute
359 values.
360
361 2008-08-16 Wakaba <wakaba@suika.fam.cx>
362
363 * CacheManifest.pm: Support for new style of error
364 reports.
365
366 * HTML.pm.src: Set line=1, column=1 to the document node.
367
368 2008-08-16 Wakaba <wakaba@suika.fam.cx>
369
370 * ContentChecker.pm, RDFXML.pm: Pass {level} object to language tag
371 and URL checkers. Support for more error levels for bogus
372 langauge tag and URL "standards".
373
374 * LangTag.pm, URIChecker.pm: Support for new style error
375 level reporting.
376
377 2008-08-15 Wakaba <wakaba@suika.fam.cx>
378
379 * ContentChecker.pm: Support for RDF/XML error levels.
380
381 * HTMLTable.pm, RDFXML.pm: Support for new style of error level
382 specifying. Error types are revised.
383
384 2008-08-15 Wakaba <wakaba@suika.fam.cx>
385
386 * ContentChecker.pm: All error reporting method calls are
387 renewed.
388
389 2008-08-15 Wakaba <wakaba@suika.fam.cx>
390
391 * HTML.pm.src: All error type names and "text" parameters
392 are revised. Use new style for "level" specification.
393
394 * mkhtmlparser.pl: Use new style for "level" specification.
395
396 2008-08-03 Wakaba <wakaba@suika.fam.cx>
397
398 * WebIDL.pm (parse_char_string): Simplified error
399 reporting process for broken ignored valuetype definition.
400 (Valuetype idl_text): Support for special "DOMString" name.
401
402 2008-08-03 Wakaba <wakaba@suika.fam.cx>
403
404 * WebIDL.pm ($get_scoped_name): Append "::::" if the last
405 terminal of the ScopedName is "DOMString", such that whether
406 the last part of the scoped name is "DOMString" or "_DOMString"
407 later. It is necessary to determine whether a |typedef|
408 definition should be ignored or not.
409 (parse_char_string): Unescape the identifier of
410 exception members.
411 ($resolve): Return undef for builtin types and sequence<T>
412 types (we might not have to do this, however...).
413 (check): Support checking for Exceptions, Valuetypes,
414 and Typedefs.
415 ($serialize_type): Support for "DOMString::::" syntax.
416 (Typedef idl_text): Output Type as "DOMString" if it
417 is really "DOMString" (i.e. its internal representation
418 is "::DOMString::").
419
420 2008-08-03 Wakaba <wakaba@suika.fam.cx>
421
422 * WebIDL.pm ($resolve): New code, based on resolve code
423 for constant types in the |check| method.
424 (check): Support for checking of attributes, operations, and
425 arguments.
426 (Attribute/Operation idl_text): Exception names in getraises,
427 setraises, and raises clauses is serizlied by |$serialize_type|
428 code.
429
430 2008-08-02 Wakaba <wakaba@suika.fam.cx>
431
432 * WebIDL.pm ($integer): Order of selections are changed to match
433 hexadecimal numbers (the original pattern, taken from the spec,
434 was not work for hexadecimal numbers, because the "0" prefix
435 matches to the [0-7]* part (as an empty string) and therefore
436 it does not match with remaining "x..." part of a "0x..." integer
437 literal.
438 ($get_type): It now returns a string, not an array reference,
439 for regular types and |sequence| types (i.e. it in any case
440 returns a string).
441 ($get_next_token): The second item in the array that represents
442 a integer or float token is now a Perl number value, not the
443 original string representation of the number.
444 (check): Support for const value consistency checking.
445 No extended attribute is defined for constants.
446 (Node subclasses): Use simple strings rather than array references
447 for default data type values.
448 ($serialize_type): Type values are now simple strings.
449 (value): If the new attribute value is a false value, then
450 a FALSE value is set to the attribute.
451
452 2008-08-02 Wakaba <wakaba@suika.fam.cx>
453
454 * WebIDL.pm ($get_scoped_name): Now scoped names are stored
455 in its stringified format ("scoped name" as defined in the
456 spec). Note that future version of this module should not use
457 array references for type values and the |type_text| attribute
458 should be made obsolete.
459 (parse_char_string): Unescape attribute names.
460 (check): Support for checking of whether inherited interfaces
461 are actually defined or not. Support for checking of whether
462 interface member identifiers are duplicated or not.
463 ($serialize_type): Scoped names are returned as is. A future
464 version of this code should escape identifiers other than "DOMString",
465 otherwise the idl_text would be non-conforming.
466
467 2008-08-02 Wakaba <wakaba@suika.fam.cx>
468
469 * WebIDL.pm (parse_char_string): Set line/column numbers
470 to generated nodes. Unescape identifiers. Extended attributes
471 for Definition's were ignored.
472 (append_child): Set |parent_node| attribute.
473 (parent_node): New attribute.
474 (check): Support interface/exception members. Support
475 extended attributes. Support definition identifier uniqueness
476 constraint.
477 (qualified_name): New attribute.
478 (Interface/Exception idl_text): Extended attributes were
479 not prepended to the returned text.
480
481 2008-08-02 Wakaba <wakaba@suika.fam.cx>
482
483 * WebIDL.pm (parse_char_string): Set line/column numbers
484 to interface object experimentally. s/shift/pop/g, shift
485 would make things wrong. Support for interface forward
486 declarations was missing. Broken interface declarations
487 with no block were not ignored entirely.
488 (Whatpm::WebIDL::Node): New abstract class. This class
489 makes things easier.
490 (child_nodes): New attribute. Unlike DOM's attribute with
491 same name, this attribute returns a dead list of nodes for
492 simplicity.
493 (get_user_data, set_user_data): New methods.
494 (Module idl_text): A SPACE character should be inserted
495 before the |{| character.
496 (Interface idl_text): Support for interface forward declarations.
497 (is_forward_declaration): New attribute.
498
499 2008-07-19 Wakaba <wakaba@suika.fam.cx>
500
501 * WebIDL.pm (type_text): Better serializer.
502
503 2008-07-19 Wakaba <wakaba@suika.fam.cx>
504
505 * WebIDL.pm: Revise forward-compatible parsing so that
506 it now can handle broken extended attributes and as such.
507
508 2008-07-19 Wakaba <wakaba@suika.fam.cx>
509
510 * WebIDL.pm: Real support for extended attributes.
511 Support for extended attributes with arguments.
512
513 2008-07-19 Wakaba <wakaba@suika.fam.cx>
514
515 * WebIDL.pm: Support for |exception| syntax.
516 (Interface->idl_text): Tentative support for inheritances.
517
518 2008-07-19 Wakaba <wakaba@suika.fam.cx>
519
520 * WebIDL.pm: Hierarchical scoped name support was broken.
521 Support for raises, setraises, and getraises syntaxes.
522
523 2008-07-18 Wakaba <wakaba@suika.fam.cx>
524
525 * WebIDL.pm: Support for |idl_text| attribute, version 1 (no
526 proper support for types, extended attributes, and exceptions yet).
527 WebIDL parser, version 1 (no support for exceptions yet,
528 no proper support for extended attributes yet).
529
530 2008-07-09 Wakaba <wakaba@suika.fam.cx>
531
532 * WebIDL.pm (parse_char_string): Support for basic attribute syntax.
533
534 2008-06-29 Wakaba <wakaba@suika.fam.cx>
535
536 * WebIDL.pm: Support for valuetype and const.
537
538 2008-06-29 Wakaba <wakaba@suika.fam.cx>
539
540 * WebIDL.pm: New module.
541
542 2008-06-15 Wakaba <wakaba@suika.fam.cx>
543
544 * Makefile (Entities.html): URI changed.
545
546 2008-06-08 Wakaba <wakaba@suika.fam.cx>
547
548 * HTML.pm.src: Support for ruby parsing (HTML5 revision 1704).
549
550 2008-06-01 Wakaba <wakaba@suika.fam.cx>
551
552 * HTML.pm.src (_get_next_token): A parse error was missing.
553
554 2008-06-01 Wakaba <wakaba@suika.fam.cx>
555
556 * mklinktypelist.pl: rel=contact is no longer part of the HTML5
557 spec (commented out). (HTML5 revision 1711).
558
559 2008-05-25 Wakaba <wakaba@suika.fam.cx>
560
561 * ContentType.pm: Drop support for UTF-32 (HTML5 revision 1701).
562
563 * HTML.pm.src: UTF-16BE and UTF-16LE should be considered
564 as UTF-16 (HTML5 revision 1701).
565
566 2008-05-25 Wakaba <wakaba@suika.fam.cx>
567
568 * HTML.pm.src: Support for <noframes> in <head> (HTML5 revision
569 1692).
570
571 2008-05-25 Wakaba <wakaba@suika.fam.cx>
572
573 * HTML.pm.src: The secondary insertion mode used when switching
574 to foreign content is the "in body" insertion mode (HTML5 revision
575 1696).
576
577 2008-05-25 Wakaba <wakaba@suika.fam.cx>
578
579 * HTML.pm.src: Don't raise parse error for <isindex/> (HTML5
580 revision 1697).
581
582 2008-05-24 Wakaba <wakaba@suika.fam.cx>
583
584 * HTML.pm.src: Support for end-of-file token in foreign content
585 insertion mode (HTML5 revision 1693). Update SVG camelCase
586 attribute list (HTML5 revision 1700). <textarea> closes
587 </select> (HTML5 revision 1699). More start tags close in
588 foreign content insertion mode (HTML5 revision 1698).
589
590 2008-05-24 Wakaba <wakaba@suika.fam.cx>
591
592 * HTML.pm.src: ";" is not part of charset name (HTML5 revision 1665).
593
594 2008-05-24 Wakaba <wakaba@suika.fam.cx>
595
596 * HTML.pm.src: More robust charset parameter detection (HTML5
597 revision 1674).
598
599 2008-05-24 Wakaba <wakaba@suika.fam.cx>
600
601 * ContentType.pm: Support for image/vnd.microsoft.icon (HTML5
602 revision 1676).
603
604 2008-05-24 Wakaba <wakaba@suika.fam.cx>
605
606 * HTML.pm.src: Ignore language part of public identifiers for
607 quriks mode detection (HTML5 revision 1679).
608
609 2008-05-24 Wakaba <wakaba@suika.fam.cx>
610
611 * HTML.pm.src: Reduce the number of errors in truncated doctypes (HTML5
612 revision 1685).
613
614 2008-05-24 Wakaba <wakaba@suika.fam.cx>
615
616 * HTML.pm.src: Support for EOF in new states for tags (HTML5
617 revision 1684).
618
619 2008-05-24 Wakaba <wakaba@suika.fam.cx>
620
621 * HTML.pm.src (_reset_insertion_mode): Make <td>.innerHTML
622 work (HTML5 revision 1690).
623
624 2008-05-24 Wakaba <wakaba@suika.fam.cx>
625
626 * HTML.pm.src (_tree_construction_main): Change handling of
627 end tags in head insertion modes (HTML5 revision 1686).
628 (parse_char_string): Bug fix for non-utf8 character string handlings.
629 (parse_char_stream): |ungetc| does not work well for this context.
630
631 2008-05-18 Wakaba <wakaba@suika.fam.cx>
632
633 * HTML.pm.src (parse_byte_string): Redefined to invoke
634 |parse_byte_stream|.
635 (parse_byte_stream): New method.
636
637 2008-05-18 Wakaba <wakaba@suika.fam.cx>
638
639 * HTML.pm.src (parse_byte_string): Fix the column number reported
640 by encoding layer error reporter.
641
642 2008-05-17 Wakaba <wakaba@suika.fam.cx>
643
644 * HTML.pm.src (parse_byte_string): Use streaming decoder
645 rather than converting the whole byte string and then parsing.
646 Propagate errors in character encoding layer.
647 (get_next_token): Precise error reporting for |bare stago| error.
648
649 2008-05-17 Wakaba <wakaba@suika.fam.cx>
650
651 * HTML.pm.src (parse_char_stream): New method.
652 (parse_char_string): This method is now defined as an invocation
653 of the |parse_char_stream| method.
654
655 2008-05-17 Wakaba <wakaba@suika.fam.cx>
656
657 * HTML.pm.src (parse_byte_string): Report various status
658 of the sniffing as info-level errors. Support for new
659 decoding framework in parser resestting.
660 (new): Various default error levels were not set.
661
662 2008-05-17 Wakaba <wakaba@suika.fam.cx>
663
664 * HTML.pm.src (parse_byte_string): HTML5 encoding siniffing
665 algorithm, except for the actual sniffing, is implemented
666 with new framework with Message::Charset::Info.
667
668 2008-05-16 Wakaba <wakaba@suika.fam.cx>
669
670 * CacheManifest.pm (_parse): Drop fragment identifiers from
671 URIs in fallback section (HTML5 revision 1596).
672
673 2008-05-10 Wakaba <wakaba@suika.fam.cx>
674
675 * Makefile (Entities.html): URI has changed.
676
677 2008-05-10 Wakaba <wakaba@suika.fam.cx>
678
679 * CacheManifest.pm: Don't replace U+0000 NULL (HTML5 revision
680 1553).
681
682 2008-05-06 Wakaba <wakaba@suika.fam.cx>
683
684 * ContentChecker.pm: Noted that those returned in |table| are
685 no longer table elements, but table objects returned
686 by Whatpm::HTMLTable.
687
688 * HTMLTable.pm (form_table): Return table element node
689 as |$table->{element}|.
690 (assign_header): Support for the |headers=""| attribute.
691
692 2008-05-06 Wakaba <wakaba@suika.fam.cx>
693
694 * HTMLTable.pm (assign_header): New function; first version
695 with no support for headers="".
696 (form_table): Include table width and height to the returned
697 table object for covenience. Indexing in column assignement was wrong.
698 Set whether a data cell is empty or not for convenience.
699
700 2008-05-05 Wakaba <wakaba@suika.fam.cx>
701
702 * HTMLTable.pm: Robuster caption support (HTML5 revision 1393).
703
704 2008-05-05 Wakaba <wakaba@suika.fam.cx>
705
706 * HTMLTable.pm: How table model errors are detected is
707 changed (HTML5 revision 1387).
708
709 2008-05-05 Wakaba <wakaba@suika.fam.cx>
710
711 * HTMLTable.pm: The algorithm now moves |tfoot| elements
712 to the end of the table (HTML5 revision 1380).
713
714 2008-05-05 Wakaba <wakaba@suika.fam.cx>
715
716 * HTMLTable.pm: The algorithm is now 0-based indexing, instead
717 of 1-based (HTML5 revision 1376).
718
719 2008-05-05 Wakaba <wakaba@suika.fam.cx>
720
721 * ContentType.pm: "Content-Type: text/plain; charset=UTF-8"
722 and "Content-Encoding" no longer prevent sniffing (HTML5
723 revision 1288).
724
725 2008-05-05 Wakaba <wakaba@suika.fam.cx>
726
727 * ContentType.pm: Skip BOMs n feed or HTML algorithm (HTML5 revision
728 1282).
729
730 2008-05-03 Wakaba <wakaba@suika.fam.cx>
731
732 * ContentChecker.pm: Support for global attributes.
733 Status of XML specs are added.
734
735 2008-05-03 Wakaba <wakaba@suika.fam.cx>
736
737 * ContentChecker.pm (check_element): Support for |template=""|
738 and |ref=""| attribute (referent element type checking).
739
740 2008-04-29 Wakaba <wakaba@suika.fam.cx>
741
742 * CacheManifest.pm (_parse): New same origin definition (HTML5 revision
743 1500) is implemented (except for IDNA part and URI-scheme-specific
744 knowledge). Line number counting was wrong for LF-only
745 documents.
746
747 2008-04-13 Wakaba <wakaba@suika.fam.cx>
748
749 * HTML.pm.src: Raise an parse error for any disallowed
750 character (HTML5 revision 1263).
751
752 2008-04-13 Wakaba <wakaba@suika.fam.cx>
753
754 * mkentitylist.pl: Support for new HTML5 entity table format (the
755 definition for |AElig;| was missing).
756
757 2008-04-13 Wakaba <wakaba@suika.fam.cx>
758
759 * HTML.pm.src, mkhtmlparser.pl: Support for element/attribute
760 name/namespace fixup (HTML5 revisions 1413, 1415, 1416, and 1417).
761
762 2008-04-13 Wakaba <wakaba@suika.fam.cx>
763
764 * HTML.pm.src: List of element names that close foreign content
765 insertion mode is added (HTML5 revisions 1412 and 1418).
766
767 2008-04-13 Wakaba <wakaba@suika.fam.cx>
768
769 * HTML.pm.src: Support for |mglyph| and |malignmark|
770 elements (HTML5 revision 1410).
771
772 2008-04-13 Wakaba <wakaba@suika.fam.cx>
773
774 * HTML.pm.src: Support for new long MathML entities (HTML5
775 revision 1406).
776
777 2008-04-13 Wakaba <wakaba@suika.fam.cx>
778
779 * HTML.pm.src: CDATA section support for MathML and SVG
780 elements (HTML5 revision 1404 and 1420).
781
782 2008-04-12 Wakaba <wakaba@suika.fam.cx>
783
784 * HTML.pm.src, mkhtmlparser.pl: Support for MathML and SVG
785 elements (HTML5 revision 1404). Unused !!!macro definitions
786 are removed.
787
788 2008-04-12 Wakaba <wakaba@suika.fam.cx>
789
790 * HTML.pm.src, mkhtmlparser.pl: The way permitted slash errors
791 are raised is changed (HTML5 revision 1404).
792
793 2008-04-06 Wakaba <wakaba@suika.fam.cx>
794
795 * HTML.pm.src: Category sets that are no longer used are removed.
796
797 2008-04-06 Wakaba <wakaba@suika.fam.cx>
798
799 * HTML.pm.src: The ->[1] property of stack entries are now
800 replaced by constants representing element category.
801
802 2008-04-06 Wakaba <wakaba@suika.fam.cx>
803
804 * HTML.pm.src: Don't use local name stored in stack (i.e. ->[1])
805 for error reporting. (This is a preparation for using constant
806 value for ->[1].)
807
808 2008-03-22 Wakaba <wakaba@suika.fam.cx>
809
810 * RDFXML.pm: Typo fixed.
811
812 2008-03-22 Wakaba <wakaba@suika.fam.cx>
813
814 * ContentChecker.pm: |fact_level| is now treated
815 as same as |must_level|, i.e. level = |m|.
816 (check_element): Make list of URIs in the DOM.
817
818 2008-03-21 Wakaba <wakaba@suika.fam.cx>
819
820 * RDFXML.pm: Language accessor implemented.
821 Local (null-namespace) attribute support.
822
823 2008-03-21 Wakaba <wakaba@suika.fam.cx>
824
825 * RDFXML.pm: Factored out ID checking code.
826
827 2008-03-21 Wakaba <wakaba@suika.fam.cx>
828
829 * RDFXML.pm: TODO items noted. Validation of ID
830 and URI attributes is implemented. Warn if unknown
831 value is used in rdf:parseType="" attribute.
832
833 * URIChecker.pm (check_rdf_uri_reference): New function.
834
835 2008-03-21 Wakaba <wakaba@suika.fam.cx>
836
837 * RDFXML.pm: bnodeid implemented. Relative references
838 are now resolved.
839
840 2008-03-21 Wakaba <wakaba@suika.fam.cx>
841
842 * ContentChecker.pm: RDF reification implemented.
843
844 * RDFXML.pm: undef vs false bug fixed.
845 Reification implemented.
846
847 2008-03-21 Wakaba <wakaba@suika.fam.cx>
848
849 * RDFXML.pm: s/id/ID/ for attribute name.
850 The |node| arguments are added for |ontriple| calls.
851 Too many "attribute not allowed" errors were raised.
852
853 * ContentChecker.pm: Initial experimental support for rdf:RDF
854 element.
855
856 2008-03-21 Wakaba <wakaba@suika.fam.cx>
857
858 * RDFXML.pm: New module.
859
860 2008-03-20 Wakaba <wakaba@suika.fam.cx>
861
862 * HTML.pm.src (set_inner_html): Line/column number
863 code was old one yet.
864
865 2008-03-20 Wakaba <wakaba@suika.fam.cx>
866
867 * HTML.pm.src: Better line/column reporting for "duplicate attribute"
868 errors. Line/column markings for DOCTYPE, comment, and
869 character tokens are reintroduced; otherwise,
870 error location for "not HTML5" error and errors
871 for implied elements are not attached.
872
873 2008-03-20 Wakaba <wakaba@suika.fam.cx>
874
875 * HTML.pm.src: Set line/column numbers to attributes.
876
877 * NanoDOM.pm (create_attribute_ns, set_attribute_node_ns): Added.
878 (value): Setter implemented.
879
880 * mkhtmlparser.pl: Set line/column numbers to Attr nodes.
881
882 2008-03-20 Wakaba <wakaba@suika.fam.cx>
883
884 * HTML.pm.src: Unused line/column markings are removed.
885
886 2008-03-20 Wakaba <wakaba@suika.fam.cx>
887
888 * HTML.pm.src (_get_next_token): Remove |first_start_tag|
889 flag, which is no longer used.
890
891 2008-03-17 Wakaba <wakaba@suika.fam.cx>
892
893 * HTML.pm.src: Set line/column information to element nodes.
894
895 * mkhtmlparser.pl (!!!create-element, !!!insert-element,
896 and !!!insert-element-t): Set line/column information to
897 element nodes.
898
899 2008-03-17 Wakaba <wakaba@suika.fam.cx>
900
901 * HTML.pm.src (_get_next_token): The first "<" character
902 in "<?", "<>", or "</>" should be the error point.
903
904 2008-03-16 Wakaba <wakaba@suika.fam.cx>
905
906 * HTML.pm.src: Some more fixes on error position reporting.
907
908 2008-03-16 Wakaba <wakaba@suika.fam.cx>
909
910 * HTML.pm.src: Token-level precious error reporting.
911
912 2008-03-16 Wakaba <wakaba@suika.fam.cx>
913
914 * HTML.pm.src: Preparation for more precious error point
915 reporting.
916
917 2008-03-11 Wakaba <wakaba@suika.fam.cx>
918
919 * HTML.pm.src: Error type revised.
920
921 2008-03-11 Wakaba <wakaba@suika.fam.cx>
922
923 * HTML.pm.src: Similar codes are merged together, again.
924
925 2008-03-11 Wakaba <wakaba@suika.fam.cx>
926
927 * HTML.pm.src: Similar codes are merged together.
928
929 2008-03-10 Wakaba <wakaba@suika.fam.cx>
930
931 * mkhtmlparser.pl: Set "level" parameter to parse errors.
932
933 * HTML.pm.src: Code refined.
934
935 2008-03-09 Wakaba <wakaba@suika.fam.cx>
936
937 * HTML.pm.src: |</body>| treatement has been changed (HTML5
938 revision 1348). Note that I really don't know this makes
939 any difference in the black-box behavior of the parser.
940
941 2008-03-09 Wakaba <wakaba@suika.fam.cx>
942
943 * HTML.pm.src: New end-of-file token implementation (HTML5
944 revision 1348).
945
946 2008-03-09 Wakaba <wakaba@suika.fam.cx>
947
948 * HTML.pm.src: |applet| support (HTML5 revision 1347).
949
950 2008-03-09 Wakaba <wakaba@suika.fam.cx>
951
952 * HTML.pm.src: Foster parenting in AAA (HTML5 revision 1343).
953
954 2008-03-09 Wakaba <wakaba@suika.fam.cx>
955
956 * HTML.pm.src: Support for |<input>| in the "in select" insertion
957 mode and support for the "in select in table" insertion mode (HTML5
958 revision 1342).
959
960 2008-03-09 Wakaba <wakaba@suika.fam.cx>
961
962 * HTML.pm.src: No foster parenting for <script> and <script>
963 in non-tainted <table>s (HTML5 revision 1336).
964
965 2008-03-09 Wakaba <wakaba@suika.fam.cx>
966
967 * HTML.pm.src: Ignore white space characters between <html>
968 and <head> (HTML5 revision 1332).
969
970 2008-03-09 Wakaba <wakaba@suika.fam.cx>
971
972 * HTML.pm.src: Treat <input type=hidden> as if it were a
973 white space (HTML5 revision 1331).
974
975 2008-03-08 Wakaba <wakaba@suika.fam.cx>
976
977 * HTML.pm.src: Ignore U+000A at the beginning of a |listing|
978 element (HTML5 revision 1330).
979
980 2008-03-08 Wakaba <wakaba@suika.fam.cx>
981
982 * HTML.pm.src: <title> is always appended to the current
983 element (HTML5 revision 1328).
984
985 2008-03-08 Wakaba <wakaba@suika.fam.cx>
986
987 * HTML.pm.src: White space in tainted tables are moved
988 into foster parents (HTML5 revision 1326).
989
990 2008-03-08 Wakaba <wakaba@suika.fam.cx>
991
992 * HTML.pm.src: Reduce errors from foster parenting cases (HTML5
993 revision 1321).
994
995 2008-03-08 Wakaba <wakaba@suika.fam.cx>
996
997 * HTML.pm.src: |</h/n/>| case code rearranged to align with
998 the spec (HTML5 revision 1320). Note that we finally complete
999 all of HTML5 revision 1320 changes.
1000
1001 2008-03-08 Wakaba <wakaba@suika.fam.cx>
1002
1003 * HTML.pm.src: |</form>| now works similar to |</div>| for unclosed
1004 tags (HTML5 revision 1320).
1005
1006 2008-03-08 Wakaba <wakaba@suika.fam.cx>
1007
1008 * HTML.pm.src: |</p>| case rearranged with no actual change in fact.
1009
1010 2008-03-08 Wakaba <wakaba@suika.fam.cx>
1011
1012 * HTML.pm.src: A "generate implied end tags" code (t409.1)
1013 could not be reached so that it is now removed (HTML5 revision
1014 1320).
1015
1016 2008-03-08 Wakaba <wakaba@suika.fam.cx>
1017
1018 * HTML.pm.src: Code for the case of |</div>| and so on are revised
1019 to align with new spec text (HTML5 revision 1320).
1020
1021 2008-03-08 Wakaba <wakaba@suika.fam.cx>
1022
1023 * HTML.pm.src: Remove strange |if| condition; however, it should
1024 have had no harm in theory.
1025
1026 2008-03-08 Wakaba <wakaba@suika.fam.cx>
1027
1028 * HTML.pm.src (_tree_construction_main): '</p>' in body
1029 case is split from other end tags for the preparation
1030 of implementing HTML revision 1320.
1031
1032 2008-03-07 Wakaba <wakaba@suika.fam.cx>
1033
1034 * HTML.pm.src: Simplified "generate implied end tag" (HTML5
1035 revision 1320).
1036
1037 2008-03-07 Wakaba <wakaba@suika.fam.cx>
1038
1039 * HTML.pm.src (_tree_construction_main): Merge rules for "h1"
1040 and "div" (HTML5 revision 1318). Add comments to where
1041 |form| pointer association codes should be inserted (HTML5 revision
1042 1319).
1043
1044 2008-03-06 Wakaba <wakaba@suika.fam.cx>
1045
1046 * HTML.pm.src: <html> treatement refined (HTML5 revision 1314).
1047
1048 2008-03-05 Wakaba <wakaba@suika.fam.cx>
1049
1050 * HTML.pm.src: Since the case t268 should never be reached (no
1051 other token type, there are), it is replaced by a |die| statement.
1052
1053 2008-03-05 Wakaba <wakaba@suika.fam.cx>
1054
1055 * HTML.pm.src: Typo fixed.
1056
1057 2008-03-04 Wakaba <wakaba@suika.fam.cx>
1058
1059 * HTML.pm.src (_tree_construction_initial): Some limited quirks
1060 doctypes were not uppercased for comparison.
1061
1062 2008-03-03 Wakaba <wakaba@suika.fam.cx>
1063
1064 * HTML.pm.src (tree construction and set_inner_html): Checkpoints
1065 are added.
1066
1067 2008-03-03 Wakaba <wakaba@suika.fam.cx>
1068
1069 * HTML.pm.src (_tokenize_attempt_to_consume_an_entity): Checkpoints
1070 are set. Cases that are unlikely reached are noted as so.
1071
1072 2008-03-03 Wakaba <wakaba@suika.fam.cx>
1073
1074 * HTML.pm.src: Checkpoints for debugging are added.
1075
1076 * mkhtmlparser.pl: Support for |!!!cp| syntax.
1077
1078 2008-03-03 Wakaba <wakaba@suika.fam.cx>
1079
1080 * HTML.pm.src, mkhtmlparser.pl: s/_input_character/_char/g
1081 for simplicity.
1082
1083 2008-03-03 Wakaba <wakaba@suika.fam.cx>
1084
1085 * HTML.pm.src: Flag name changed: s/correct/force-quirks/g (HTML5
1086 revision 1307).
1087
1088 2008-03-03 Wakaba <wakaba@suika.fam.cx>
1089
1090 * HTML.pm.src (_get_next_token): Where the /incorrect/ flag
1091 is set are changed (HTML5 revision 1305).
1092
1093 2008-03-02 Wakaba <wakaba@suika.fam.cx>
1094
1095 * HTML.pm.src: Raise a parse error for |<span ===>| (HTML5 revision
1096 1292). Entities are not parsed in comment-like part in RCDATA
1097 elements (HTML5 revision 1294). Allow bare & at the end
1098 of attribute value literals (HTML5 revision 1296). More
1099 quirks mode doctypes (HTML5 revision 1302). Requires spaces
1100 between attributes and ban attribute names or unquoted
1101 attribute values containing single or double quotes (HTML5
1102 revision 1303).
1103
1104 2008-03-02 Wakaba <wakaba@suika.fam.cx>
1105
1106 * ContentChecker.pm: Typo fixed. Don't raise "character encoding"
1107 and related errors unless it is an HTML document (though the spec
1108 is unclear on whether it is applied to XHTML document).
1109
1110 * HTML.pm (%HTMLAttrStatus): WF2 repetition model attributes
1111 are added.
1112
1113 2008-03-02 Wakaba <wakaba@suika.fam.cx>
1114
1115 * HTML.pm.src: s/local_name/manakai_local_name/g.
1116
1117 2008-03-01 Wakaba <wakaba@suika.fam.cx>
1118
1119 * _NamedEntityList.pm: Updated (HTML5 revision 1286).
1120
1121 * HTML.pm.src: |charset| in |content| attribute is
1122 case-insensitive (HTML5 revision 1270).
1123
1124 2008-02-26 Wakaba <wakaba@suika.fam.cx>
1125
1126 * ContentChecker.pm: New status constants are added.
1127 ($ElementDefault): |status| added.
1128 (check_element): Err for non-standard or deprecated elements.
1129 (_attr_status_info): For non-standard or deprecated attributes.
1130
1131 2008-02-24 Wakaba <wakaba@suika.fam.cx>
1132
1133 * ContentChecker.pm (_attr_status_info): New internal method.
1134
1135 2008-02-24 Wakaba <wakaba@suika.fam.cx>
1136
1137 * ContentChecker.pm (check_element): Element standardized
1138 status information is now dispatched.
1139
1140 2008-02-24 Wakaba <wakaba@suika.fam.cx>
1141
1142 * ContentChecker.pm (check_element): Fix |del|-and-significant
1143 problem by adding some more arguments.
1144
1145 2008-02-24 Wakaba <wakaba@suika.fam.cx>
1146
1147 * ContentChecker.pm (check_element): Use context of
1148 container-for-the-purpose-of-content-model element (not transparent
1149 element) for |check_child_element| calling and significant
1150 text flag marking. This reintroduces |<del>|-and-significant
1151 problem again.
1152
1153 2008-02-24 Wakaba <wakaba@suika.fam.cx>
1154
1155 * ContentChecker.pm (check_element): Make semi-transparent
1156 elements ignored for the purpose of phase changes in
1157 content model checking.
1158
1159 2008-02-23 Wakaba <wakaba@suika.fam.cx>
1160
1161 * ContentChecker.pm (check_element): In-element state
1162 was not properly managed for transparent cases.
1163
1164 2008-02-23 Wakaba <wakaba@suika.fam.cx>
1165
1166 * ContentChecker.pm (check_element): Support for |video|
1167 and |audio| as semi-transparent elements.
1168
1169 2008-02-23 Wakaba <wakaba@suika.fam.cx>
1170
1171 * ContentChecker.pm ($HTMLSemiTransparentElements): New.
1172 (check_element): s/minuses/minus_elements/, s/pluses/plus_elements/.
1173 Support for |html:object| as a semi-transparent element.
1174
1175 2008-02-23 Wakaba <wakaba@suika.fam.cx>
1176
1177 * ContentChecker.pm (check_element): The way to traverse
1178 the tree is entirely revised to make it easier to track
1179 the state of ancestors/descendants. As a result of this
1180 revision (which rewrites almost all of Whatpm::ContentChecker::HTML),
1181 support for content model checking for HTML elements |figure|,
1182 |object|, |video|, and |audio| and checking for XML elements (and
1183 some XMLNS checkings) are dropped for now. They will be
1184 reimplemented in due cource.
1185
1186 2008-02-17 Wakaba <wakaba@suika.fam.cx>
1187
1188 * HTML.pm.src: |>| in public or system literal closes the DOCTYPE
1189 token (HTML5 revision 1225).
1190
1191 2008-02-17 Wakaba <wakaba@suika.fam.cx>
1192
1193 * ContentChecker.pm ({unsupported_level}): New value.
1194
1195 * HTML.pm.src: Save whether |meta| |content| attribute
1196 contains character references or not.
1197
1198 2008-02-17 Wakaba <wakaba@suika.fam.cx>
1199
1200 * ContentChecker.pm (_get_children): (Incompleted) attempt to
1201 imlement significant content checking for contents
1202 with |del| elements.
1203
1204 2008-02-17 Wakaba <wakaba@suika.fam.cx>
1205
1206 * ContenteChecker.pm ($HTMLTransparentElements): More
1207 elements are added.
1208 (_get_children): HTML |object| elements are now semi-transparent.
1209
1210 * NanoDOM.pm (manakai_html, manakai_head): New methods.
1211
1212 2008-02-16 Wakaba <wakaba@suika.fam.cx>
1213
1214 * CacheManifest.pm: HTML5 revision 1211 implemented.
1215
1216 * CacheManifest.pod: Updated.
1217
1218 2008-02-10 Wakaba <wakaba@suika.fam.cx>
1219
1220 * ContentChecker.pm (check_document, check_element): Support
1221 for second argument ($onsubdoc).
1222 (_get_css_parser): Removed (now it is part of WDCC).
1223
1224 2008-02-09 Wakaba <wakaba@suika.fam.cx>
1225
1226 * ContentChecker.pm (_get_css_parser): New.
1227
1228 2007-11-25 Wakaba <wakaba@suika.fam.cx>
1229
1230 * ContentChecker.pm ($AnyChecker): Old way to add child elements
1231 for checking had been used.
1232
1233 2007-11-25 Wakaba <wakaba@suika.fam.cx>
1234
1235 * ContentChecker.pm (check_element): New todo item type |descendant|.
1236
1237 2007-11-23 Wakaba <wakaba@suika.fam.cx>
1238
1239 * IMTChecker.pm: Revised to raise errors and warnings as (poorly)
1240 specced in RFC 2046 and RFC 4288.
1241 (application/atom+xml): Definition added.
1242
1243 2007-11-23 Wakaba <wakaba@suika.fam.cx>
1244
1245 * URIChecker.pm: Make RFC 3986 should-level errors
1246 warnings (rather than SHOULD-level errors).
1247
1248 2007-11-23 Wakaba <wakaba@suika.fam.cx>
1249
1250 * NanoDOM.pm (get_user_data, set_user_data): New methods.
1251
1252 * HTML.pm.src: A flag for character references in attribute
1253 values are added. Set |manakai_has_reference| user data
1254 to |charset| attribute.
1255
1256 2007-11-23 Wakaba <wakaba@suika.fam.cx>
1257
1258 * NanoDOM.pm (input_encoding, manakai_charset, manakai_has_bom): New
1259 attributes.
1260
1261 * ContentChecker.pm (check_document): Warn if charset requirements
1262 cannot be tested.
1263
1264 2007-11-19 Wakaba <wakaba@suika.fam.cx>
1265
1266 * HTML.pm.src (parse_byte_string): Detect charset
1267 by universalchardet if charset parameter is not specified.
1268
1269 * Makefile (Charset-all, Charset-clean): New rules.
1270
1271 2007-11-18 Wakaba <wakaba@suika.fam.cx>
1272
1273 * ContentChecker.pm (check_document): Check the existence
1274 of character encoding declaration and selection of encoding
1275 for HTML document.
1276
1277 2007-11-18 Wakaba <wakaba@suika.fam.cx>
1278
1279 * ContentType.pm (get_sniffed_type): Return also the official
1280 type in list context.
1281
1282 2007-11-18 Wakaba <wakaba@suika.fam.cx>
1283
1284 * ContentType.pm: Sniffing with leading white space
1285 ignoring (HTML5 revisions 1013 and 1016).
1286
1287 2007-11-18 Wakaba <wakaba@suika.fam.cx>
1288
1289 * ContentType.pm: HTML5 revision 1013 changes, except for leading
1290 white spaces, are implemented.
1291
1292 2007-11-11 Wakaba <wakaba@suika.fam.cx>
1293
1294 * HTML.pm.src (parse_char_string): Set |inner_encoding|
1295 attribute if possible.
1296
1297 2007-11-11 Wakaba <wakaba@suika.fam.cx>
1298
1299 * HTML.pm.src (parse_byte_string): New method.
1300 (parse_char_string): New alias for |parse_string|.
1301 (main phase): Invoking "change the encoding" algorithm if desired.
1302
1303 * HTML.pod: Updated.
1304
1305 2007-11-11 Wakaba <wakaba@suika.fam.cx>
1306
1307 * HTML.pod (get_inner_html): Removed.
1308
1309 * Makefile (HTML-all, HTML-clean): New.
1310
1311 2007-11-11 Wakaba <wakaba@suika.fam.cx>
1312
1313 * HTML.pm.src (get_inner_html): Removed (moved to HTML/Serializer.pm).
1314
1315 2007-11-08 Wakaba <wakaba@suika.fam.cx>
1316
1317 * mklinktypelist.pl: s/noreferer/noreferrer/ (HTML5 revision 1132).
1318
1319 2007-11-04 Wakaba <wakaba@suika.fam.cx>
1320
1321 * Makefile: |CacheManifest.html| is added.
1322
1323 * CacheManifest.pod: New file.
1324
1325 2007-11-04 Wakaba <wakaba@suika.fam.cx>
1326
1327 * CacheManifest.pm: New module.
1328
1329 2007-11-04 Wakaba <wakaba@suika.fam.cx>
1330
1331 * HTML.pm.src: Support for application cache selection algorithm
1332 callback.
1333
1334 2007-11-04 Wakaba <wakaba@suika.fam.cx>
1335
1336 * mklinktypelist.pl: Support for rel=noreferer (HTML5 revision 1118).
1337
1338 2007-10-17 Wakaba <wakaba@suika.fam.cx>
1339
1340 * Makefile (clean): New rule.
1341
1342 * NanoDOM.pm (public_id, system_id): New attributes.
1343
1344 2007-10-17 Wakaba <wakaba@suika.fam.cx>
1345
1346 * Makefile (CSS-all, CSS-clean, clean): New rules.
1347
1348 2007-10-14 Wakaba <wakaba@suika.fam.cx>
1349
1350 * ContentChecker.pm (check_document): Support for
1351 new |is_xml_root| flag.
1352 (check_element): Support for new |pluses| state.
1353 (_add_pluses): New method.
1354 (_remove_minuses): Support for new |minus| item.
1355
1356 2007-09-24 Wakaba <wakaba@suika.fam.cx>
1357
1358 * ContentChecker.pm: Raise specific error for invalid
1359 root element.
1360
1361 2007-09-24 Wakaba <wakaba@suika.fam.cx>
1362
1363 * ContentChecker.pm: Set level values for later uses.
1364
1365 2007-09-09 Wakaba <wakaba@suika.fam.cx>
1366
1367 * ContentChecker.pm: Support for language tag validation.
1368
1369 2007-09-09 Wakaba <wakaba@suika.fam.cx>
1370
1371 * LangTag.pm (check_rfc3066_language_tag): New method.
1372
1373 2007-09-09 Wakaba <wakaba@suika.fam.cx>
1374
1375 * LangTag.pm: New module.
1376
1377 2007-09-04 Wakaba <wakaba@suika.fam.cx>
1378
1379 * HTML.pm.src: Some error types were wrong.
1380
1381 2007-08-17 Wakaba <wakaba@suika.fam.cx>
1382
1383 * CSS/: New directory.
1384
1385 2007-08-17 Wakaba <wakaba@suika.fam.cx>
1386
1387 * ContentChecker.pm (_check_get_children): Support
1388 for |noscript| in |head|.
1389
1390 2007-08-12 Wakaba <wakaba@suika.fam.cx>
1391
1392 * URI/: New directory.
1393
1394 2007-08-11 Wakaba <wakaba@suika.fam.cx>
1395
1396 * HTML.pm.src: Tokenizer's states are now represented in
1397 number.
1398
1399 2007-08-11 Wakaba <wakaba@suika.fam.cx>
1400
1401 * HTML.pm.src: |or|s for insertion modes are replaced
1402 by |&|s.
1403
1404 2007-08-11 Wakaba <wakaba@suika.fam.cx>
1405
1406 * HTML.pm.src: Token types are now represented in number.
1407
1408 2007-08-11 Wakaba <wakaba@suika.fam.cx>
1409
1410 * ContentType.pm (SEE ALSO): Updated.
1411
1412 * HTML.pm.src: Insertion modes are now represented in number.
1413
1414 2007-08-11 Wakaba <wakaba@suika.fam.cx>
1415
1416 * ContentType.pm: Sniffing for bitmap images (HTML5 revision
1417 999) is implemented.
1418
1419 2007-08-08 Wakaba <wakaba@suika.fam.cx>
1420
1421 * ContentType.pm: Sniffing for |<script| (HTML5 revision
1422 983) is implemented.
1423
1424 2007-08-06 Wakaba <wakaba@suika.fam.cx>
1425
1426 * ContentChecker.pod: New documentation.
1427
1428 * Makefile: A rule for |ContentChecker.html| is added.
1429
1430 * ContentChecker.pm: A pod "LICENSE" section is added.
1431
1432 * NanoDOM.pm ($VERSION): New variable.
1433
1434 2007-08-05 Wakaba <wakaba@suika.fam.cx>
1435
1436 * H2H.pm: |b|, |i|, and |sub| are added to the
1437 list of allowed HTML elements.
1438
1439 2007-08-05 Wakaba <wakaba@suika.fam.cx>
1440
1441 * H2H.pm: |samp| is added to the list of allowed
1442 HTML elements.
1443
1444 * URIChecker.pm (check_iri): New.
1445 (check_iri_reference): Error type for IRI reference
1446 syntax error is changed.
1447
1448 2007-08-04 Wakaba <wakaba@suika.fam.cx>
1449
1450 * ContentChecker.pm: Reference to the |Whatpm::ContentChecker::Atom|
1451 is added.
1452 (check_document): Load appropriate module before validation.
1453
1454 2007-08-04 Wakaba <wakaba@suika.fam.cx>
1455
1456 * ContentChecker/: New directory.
1457
1458 2007-08-04 Wakaba <wakaba@suika.fam.cx>
1459
1460 * ContentChecker.pm: HTML |time| element is implemented.
1461
1462 * HTMLTable.pm: Comments are updated as HTML5 is revised.
1463
1464 2007-07-21 Wakaba <wakaba@suika.fam.cx>
1465
1466 * ContentChecker.pm (check_document): Return value
1467 even if no document element is found.
1468
1469 2007-07-21 Wakaba <wakaba@suika.fam.cx>
1470
1471 * HTML.pm.src: |$in_body| is no longer a function.
1472
1473 2007-07-21 Wakaba <wakaba@suika.fam.cx>
1474
1475 * HTML.pm.src: The |$in_body| code has been moved down.
1476
1477 2007-07-21 Wakaba <wakaba@suika.fam.cx>
1478
1479 * HTML.pm.src: The "trailing end" insertion mode
1480 is split into "after html body" and "after html frameset"
1481 insertion modes. Their codes are merged with "after body"
1482 and "after frameset" codes. |$previous_insertion_mode|
1483 has been removed. "after frameset" code is
1484 merged with "in frameset" code.
1485
1486 2007-07-21 Wakaba <wakaba@suika.fam.cx>
1487
1488 * HTML.pm.src: The "before head" insertion mode is
1489 merged with the "in head" insertion mode.
1490
1491 2007-07-21 Wakaba <wakaba@suika.fam.cx>
1492
1493 * HTML.pm.src: Cases in "in head" insertion mode are
1494 reorganized.
1495
1496 2007-07-21 Wakaba <wakaba@suika.fam.cx>
1497
1498 * HTML.pm.src: Some cases in "in table" insertion mode
1499 are merged.
1500
1501 2007-07-21 Wakaba <wakaba@suika.fam.cx>
1502
1503 * HTML.pm.src: The "in row" insertion mode is merged
1504 with "in table" insertion mode.
1505
1506 2007-07-21 Wakaba <wakaba@suika.fam.cx>
1507
1508 * HTML.pm.src: The "in table" and "in table body" insertion
1509 modes are merged.
1510
1511 2007-07-21 Wakaba <wakaba@suika.fam.cx>
1512
1513 * HTML.pm.src: There is no "in table head" or "in table foot"
1514 insertion mode!
1515
1516 2007-07-21 Wakaba <wakaba@suika.fam.cx>
1517
1518 * HTML.pm.src: |<noframes>| "in frameset" and "in noframes"
1519 now directly invoke the handler.
1520
1521 2007-07-21 Wakaba <wakaba@suika.fam.cx>
1522
1523 * HTML.pm.src: Codes for "in cell" insertion mode
1524 is merged to the "in body" insertion mode code.
1525
1526 2007-07-21 Wakaba <wakaba@suika.fam.cx>
1527
1528 * HTML.pm.src: Codes for "in body" and "in caption"
1529 insertion modes are merged.
1530
1531 2007-07-21 Wakaba <wakaba@suika.fam.cx>
1532
1533 * HTML.pm.src: Two |!!!next-token|s were missing.
1534
1535 2007-07-21 Wakaba <wakaba@suika.fam.cx>
1536
1537 * HTML.pm.src: Use numeric constant for |{content_mode}|
1538 instead of string constant for |{content_model_flag}|.
1539
1540 2007-07-21 Wakaba <wakaba@suika.fam.cx>
1541
1542 * HTML.pm.src: Add the name of the attribute
1543 to the "duplicate attribute" error.
1544
1545 2007-07-17 Wakaba <wakaba@suika.fam.cx>
1546
1547 * ContentChecker.pm: Return the |class| node list.
1548
1549 2007-07-17 Wakaba <wakaba@suika.fam.cx>
1550
1551 * ContentChecker.pm: Return the |id| node list.
1552
1553 * HTML.pm.src: A typo is fixed.
1554
1555 2007-07-16 Wakaba <wakaba@suika.fam.cx>
1556
1557 * ContentChecker.pm: Drop wrong |level => 'error'| specification
1558 from "in HTML:xml:lang" error. Character position
1559 is now the last part of the error type in the URI error
1560 description. Report "unsupported" status for language
1561 tags, media queries, script codes, and style sheets.
1562
1563 2007-07-16 Wakaba <wakaba@suika.fam.cx>
1564
1565 * ContentChecker.pm: Report error if |xml:lang|
1566 in HTML, |lang| in XML, |xmlns| in XML, and |meta| |charset|
1567 in XML.
1568
1569 * NanoDOM.pm (Attr.owner_document): New attribute.
1570
1571 2007-07-16 Wakaba <wakaba@suika.fam.cx>
1572
1573 * HTML.pm.src: The character immediately following
1574 a bare |hcro| was discarded. Fix handling of
1575 entity references in attribute values.
1576
1577 2007-07-16 Wakaba <wakaba@suika.fam.cx>
1578
1579 * HTML.pm.src (main and trailin end phases): Token
1580 types |DOCTYPE|, |comment|, |end-of-file|, and |<html>| are
1581 factored out. Error types |in html:#DOCTYPE|
1582 and |after html:#DOCTYPE| are merged
1583 into |DOCTYPE in the middle|. |</frameset>|
1584 in fragment parsing mode changed the
1585 insertion mode.
1586
1587 2007-07-16 Wakaba <wakaba@suika.fam.cx>
1588
1589 * HTML.pm.src: |$phase| has been removed; The |trailing end|
1590 phase is now an insertion mode. Treatments for white
1591 space character tokens were incorrect for some
1592 insertion modes. An old |meta| case was not removed.
1593
1594 2007-07-16 Wakaba <wakaba@suika.fam.cx>
1595
1596 * HTML.pm.src: |meta| charset declaration extraction
1597 implemented (but changing the encoding is not yet:-).
1598
1599 2007-07-15 Wakaba <wakaba@suika.fam.cx>
1600
1601 * Charset/: New directory.
1602
1603 2007-07-15 Wakaba <wakaba@suika.fam.cx>
1604
1605 * H2H.pm: New Perl module (created from
1606 manakai's H2H.dis).
1607
1608 2007-07-15 Wakaba <wakaba@suika.fam.cx>
1609
1610 * XMLSerializer.pm: New Perl module (created from
1611 manakai's SimpleLS.dis).
1612
1613 2007-07-07 Wakaba <wakaba@suika.fam.cx>
1614
1615 * HTML.pm.src: |<!---x-->| was not processed correctly.
1616
1617 2007-07-01 Wakaba <wakaba@suika.fam.cx>
1618
1619 * HTML.pm.src: Report correct error message
1620 for |<body></div></body>|.
1621
1622 2007-07-01 Wakaba <wakaba@suika.fam.cx>
1623
1624 * HTMLTable.pm: An error description was incorrect.
1625
1626 2007-06-30 Wakaba <wakaba@suika.fam.cx>
1627
1628 * ContentChecker.pm: Return |{term}| list.
1629
1630 2007-06-30 Wakaba <wakaba@suika.fam.cx>
1631
1632 * HTML.pm.src: HTML5 revisions 961-966 (</p>, </br>,
1633 nested <nobr>, implied </tbody>, </tfoot>, and </thead>,
1634 and <title> outside of head).
1635
1636 2007-06-30 Wakaba <wakaba@suika.fam.cx>
1637
1638 * IMTChecker.pm: Report warning for unregistered
1639 and private types/subtypes.
1640
1641 * ContentChecker.pm, HTML.pm.src, IMTChecker.pm,
1642 URIChecker.pm, HTMLTable.pm: Error messages are now
1643 consistent; they are all listed in
1644 <http://suika.fam.cx/gate/2005/sw/Whatpm%20Error%20Types>.
1645
1646 2007-06-25 Wakaba <wakaba@suika.fam.cx>
1647
1648 * ContentChecker.pm: |<img ismap>| not in |<a></a>|
1649 is now erred. |<datalist>| is implemented.
1650 Attribute checker for |<command>| and |<menu>| are
1651 added. Support for |contextmenu| global attribute
1652 is added.
1653
1654 2007-06-25 Wakaba <wakaba@suika.fam.cx>
1655
1656 * HTML.pm.src (_reset_insertion_mode): Interpretation
1657 of Step 3 has been changed.
1658
1659 2007-06-25 Wakaba <wakaba@suika.fam.cx>
1660
1661 * HTML.pm.src: Late |<html>| parse error is implemented.
1662
1663 2007-06-24 Wakaba <wakaba@suika.fam.cx>
1664
1665 * URIChecker.pm (check_iri_reference): A |decode| method name was
1666 incorrect.
1667
1668 * ContentChecker.pm: Support for the |footer| element.
1669 Check URI syntax for space-separated URI attributes.
1670 Support for the |tabindex| attribute. Support
1671 for |datetime| attribute.
1672
1673 2007-06-24 Wakaba <wakaba@suika.fam.cx>
1674
1675 * HTML.pm.src: HTML5 revision 1.144 (&#x0D;) and 1.145 (invalid
1676 character references). HTML5 revision 1.146 (white space
1677 characters before root start tag). HTML5 revision
1678 1.148 (named character references in attribute values).
1679 HTML5 revision 1.152 (<plaintext>.innerHTML get).
1680
1681 2007-06-24 Wakaba <wakaba@suika.fam.cx>
1682
1683 * HTML.pm.src: HTML5 revisions 1.142 and 1.143 (<noscript>
1684 in <head>).
1685
1686 2007-06-24 Wakaba <wakaba@suika.fam.cx>
1687
1688 * HTML.pm.src: HTML5 revision 935 (<base>, <link>, <meta>
1689 in body).
1690
1691 * ContentChecker.pm: HTML5 revision 938 (scoped="").
1692
1693 2007-06-24 Wakaba <wakaba@suika.fam.cx>
1694
1695 * HTML.pm.src: HTML5 revision 923 (matching end tag in CDATA
1696 or RCDATA in fragment parsing mode). HTML5 revision
1697 924 (<!--> and <!--->). HTML5 revision 926 (hn in hn).
1698
1699 2007-06-23 Wakaba <wakaba@suika.fam.cx>
1700
1701 * HTML.pm.src (get_inner_html): HTML5 revision 922 (inner_html
1702 for <pre> and <textarea>).
1703
1704 2007-06-23 Wakaba <wakaba@suika.fam.cx>
1705
1706 * HTML.pm.src: HTML5 revision 920 (<isindex>).
1707
1708 2007-06-23 Wakaba <wakaba@suika.fam.cx>
1709
1710 * HTML.pm.src: HTML5 revision 918 (</head>, </body>, </html>).
1711
1712 2007-06-23 Wakaba <wakaba@suika.fam.cx>
1713
1714 * HTML.pm.src: HTML5 revision 916 (</body>).
1715 HTML5 revision 917 (conforming bare &).
1716
1717 2007-06-23 Wakaba <wakaba@suika.fam.cx>
1718
1719 * NanoDOM.pm (manakai_is_html): Setting to false did not work.
1720
1721 * HTML.pm.src: HTML5 revision 914 (</ in CDATA, RCDATA).
1722 HTML5 revision 915 (<nobr>).
1723
1724 2007-06-23 Wakaba <wakaba@suika.fam.cx>
1725
1726 * HTML.pm.src: HTML5 revisions 908, 909, 912, and 913 (quirks mode).
1727
1728 * NanoDOM.pm (manakai_is_html, manakai_compat_mode, compat_mode):
1729 New attributes.
1730
1731 2007-06-23 Wakaba <wakaba@suika.fam.cx>
1732
1733 * HTML.pm.src: HTML5 revisions 900, 901, 902, and 911 (<
1734 in tags).
1735
1736 2007-06-23 Wakaba <wakaba@suika.fam.cx>
1737
1738 * .cvsignore: |Entities.html| is added.
1739
1740 * HTML.pm.src: |$entity_char| is removed and
1741 requires |Whatpm::_NamedEntityList| instead.
1742 HTML5 revision 898 (refc), except that lack of refc
1743 is parse error.
1744
1745 * mkentitylist.pl: New script.
1746
1747 * Makefile (all): |_NamedEntityList.pm| is added.
1748 (_NamedEntityList.pm, Entities.html): New rules.
1749
1750 2007-06-23 Wakaba <wakaba@suika.fam.cx>
1751
1752 * HTML.pm.src: Parse errors immediately after U+000D
1753 were ignored and U+000D immediately following another
1754 U+000D was not converted to U+000A.
1755
1756 2007-06-23 Wakaba <wakaba@suika.fam.cx>
1757
1758 * HTML.pm.src (set_inner_html): HTML5 revision 892 (adopt
1759 nodes before appended). Parser was not ready for NULL
1760 parse error and escape flag.
1761
1762 * NanoDOM.pm (adopt_node): New.
1763
1764 2007-06-23 Wakaba <wakaba@suika.fam.cx>
1765
1766 * HTML.pm.src: HTML5 revision 886 (insane comment in
1767 CDATA and RCDATA). Note that current implementation
1768 is simply repeating what the spec says and it is maybe not
1769 a best way to do it.
1770
1771 2007-06-23 Wakaba <wakaba@suika.fam.cx>
1772
1773 * HTML.pm.src: HTML5 revision 884 (</form> don't close
1774 the form element if a descendant element without implied end tag has
1775 still been open).
1776
1777 2007-06-23 Wakaba <wakaba@suika.fam.cx>
1778
1779 * ContentChecker.pm: HTML5 revision 881 (Make |id| attribute with
1780 space characters non-conforming).
1781
1782 2007-06-23 Wakaba <wakaba@suika.fam.cx>
1783
1784 * HTML.pm.src: An error message was incorrect.
1785 HTML5 revision 869 (C1 character references).
1786
1787 2007-06-23 Wakaba <wakaba@suika.fam.cx>
1788
1789 * HTML.pm.src: HTML5 revision 867 (a LF at the beginning of
1790 a |textarea| is removed).
1791
1792 2007-06-05 Wakaba <wakaba@suika.fam.cx>
1793
1794 * NanoDOM.pm (get_attribute_node_ns): New method.
1795
1796 * ContentChecker.pm: |script| |async| and |defer|
1797 no longer require |src|. |async| MUST NOT be
1798 specified if |defer|. (HTML5 revision 858).
1799
1800 2007-05-30 Wakaba <wakaba@suika.fam.cx>
1801
1802 * HTML.pm.src: |<form><form>| went to inifinite loop.
1803
1804 2007-05-27 Wakaba <wakaba@suika.fam.cx>
1805
1806 * ContentChecker.pm (html): Set |is_root| (allowed
1807 as a document element) flag on.
1808 (new): Removed.
1809 (check_document): New method.
1810
1811 2007-05-27 Wakaba <wakaba@suika.fam.cx>
1812
1813 * ContentChecker.pm (thead, tfoot): Checker specifications
1814 were incorrect.
1815
1816 2007-05-27 Wakaba <wakaba@suika.fam.cx>
1817
1818 * ContentChecker.pm ($HTMLURIAttrChecker): Include
1819 error position in the |type| option of the error.
1820
1821 * HTMLTable.pm (form_table): The |$onerror| parameter
1822 is now optional. Some bugs are fixed.
1823
1824 2007-05-27 Wakaba <wakaba@suika.fam.cx>
1825
1826 * HTMLTable.pm: New module.
1827
1828 * ContentChecker.pm (table): Invoke table model error checker.
1829
1830 * NanoDOM.pm (first_child, get_attribute_ns): New.
1831
1832 2007-05-26 Wakaba <wakaba@suika.fam.cx>
1833
1834 * ContentChecker.pm ($HTMLLinkTypesAttrChecker): New checker.
1835 (link/@rel, a/@rel, area/@rel): Use new checker.
1836
1837 * Makefile (_LinkTypeList.pm, RelExtensions.html): New rules.
1838
1839 * _LinkTypeList.pm: New file.
1840
1841 * mklinktypelist.pl: New file.
1842
1843 * .cvsignore: |RelExtensions.html| added.
1844
1845 * NanoDOM.pm (child_nodes): Returns an empty array
1846 for non-child-containing node types.
1847 (text_content): New attribute.
1848
1849 2007-05-26 Wakaba <wakaba@suika.fam.cx>
1850
1851 * IMTChecker.pm: New module.
1852
1853 * ContentChecker.pm ($HTMLIMTAttrChecker): Call IMTChecker
1854 to test parameter value validity.
1855
1856 * HTML.pm.src ($style_start_tag): Attributes were
1857 discarded.
1858
1859 2007-05-25 Wakaba <wakaba@suika.fam.cx>
1860
1861 * ContentChecker.pm ($HTMLURIAttrChecker): Implemented.
1862
1863 2007-05-25 Wakaba <wakaba@suika.fam.cx>
1864
1865 * URIChecker.pm: All recommendations from RFC 3986
1866 and RFC 3987 are listed (not all testable items are checked yet).
1867
1868 2007-05-25 Wakaba <wakaba@suika.fam.cx>
1869
1870 * URIChecker.pm: New module.
1871
1872 2007-05-20 Wakaba <wakaba@suika.fam.cx>
1873
1874 * ContentChecker.pm: Now most attributes are implemented
1875 or associated to some placeholder.
1876 ($ElementDefault): Warn unknown attributes
1877 for unknown elements as "attribute not supported".
1878 ($HTMLLanguageTagAttrChecker, $HTMLMQAttrChecker): New placeholders.
1879 ($HTMLUsemapAttrChecker, $HTMLTargetAttrChecker): New checkers.
1880 (|a| attribute checker): Reimplemented.
1881
1882 2007-05-20 Wakaba <wakaba@suika.fam.cx>
1883
1884 * ContentChecker.pm ($HTMLEventHandlerAttrChecker): New placeholder.
1885 ($HTMLAttrChecker): Event handler content attributes
1886 are added.
1887 (link, embed): Required attribute is now checked.
1888 (embed): Unknown local attributes are no longer warned.
1889
1890 2007-05-20 Wakaba <wakaba@suika.fam.cx>
1891
1892 * ContentChecker.pm ($HTMLSpaceURIsAttrChecker): New placeholder.
1893 ($HTMLIMTAttrChecker): New checker.
1894 (link@rel, link@href, link@type, style@type,
1895 a@href, a@ping, a@ping, a@type, embed@src, embed@type,
1896 object@data, object@type, source@src, source@type, area@alt,
1897 area@shape, area@coords, area@href,
1898 area@ping, area@rel, area@type, script@src,
1899 script@defer, script@async, script@type): Checkers added.
1900
1901 2007-05-20 Wakaba <wakaba@suika.fam.cx>
1902
1903 * ContentChecker.pm: Descendant checking was incorrect.
1904
1905 2007-05-19 Wakaba <wakaba@suika.fam.cx>
1906
1907 * ContentChecker.pm: Support |xml:*| and |xmlns:*|
1908 attributes. Report an error if |Element.prefix|
1909 is |xmlns|.
1910
1911 * NanoDOM.pm (prefix): New attribute.
1912
1913 2007-05-19 Wakaba <wakaba@suika.fam.cx>
1914
1915 * HTML.pm.src: In |main| phase, |in body| insertion
1916 mode, action for |<iframe>| was missing.
1917
1918 2007-05-19 Wakaba <wakaba@suika.fam.cx>
1919
1920 * ContentChecker.pm: Support for many of HTML5 elements.
1921 ($GetHTMLNonNegativeIntegerAttrChecker): New.
1922
1923 2007-05-19 Wakaba <wakaba@suika.fam.cx>
1924
1925 * ContentChecker.pm: Support for most elements up to |progress|.
1926 ($HTMLURIAttrChecker): Placeholder.
1927 ($HTMLIntegerAttrChecker, $GetHTMLFloatingPointNumberAttrChecker):
1928 New.
1929
1930 2007-05-19 Wakaba <wakaba@suika.fam.cx>
1931
1932 * ContentChecker.pm: Attribute checkers
1933 for global attributes, |html|, |base|, |style|, and |meta|.
1934
1935 * NanoDOM.pm (insert_before): Weaken reference
1936 to the parent node.
1937 (Attr::new): Set |owner_element| attribute.
1938 (namespace_uri, manakai_local_name): New attribute implementations.
1939 (owner_element): New attribute.
1940
1941 2007-05-19 Wakaba <wakaba@suika.fam.cx>
1942
1943 * ContentChecker.pm ($AttrChecker, $HTMLAttrChecker,
1944 $AnyChecker->{attr_checker}, $HTMLAttrsChecker,
1945 $Element->{$HTML_NS}->{''}): New.
1946 (check_element): Invoke attrs_checker for each element.
1947
1948 2007-05-13 Wakaba <wakaba@suika.fam.cx>
1949
1950 * ContentChecker.pm: Don't use |manakai_element_type_match|.
1951
1952 2007-05-13 Wakaba <wakaba@suika.fam.cx>
1953
1954 * ContentChecker.pm: Use hashs rather than lists for
1955 element type testings.
1956
1957 2007-05-13 Wakaba <wakaba@suika.fam.cx>
1958
1959 * ContentChecker.pm: Don't generate duplicate
1960 error when an element type is put in the "minus" list
1961 and the element type is not allowed explicitly in the particular
1962 element content model.
1963 (html:a checker): New checker.
1964 (html:details, html:datagrid): New checkers.
1965 (html:legend): New checker.
1966
1967 2007-05-13 Wakaba <wakaba@suika.fam.cx>
1968
1969 * ContentChecker.pm (html:li checker): Implemented.
1970
1971 2007-05-13 Wakaba <wakaba@suika.fam.cx>
1972
1973 * ContentChecker.pm ($HTMLInlineOrStriclyInlineChecker): New
1974 checker.
1975 (html:dd checker): New checker.
1976 (html:q, html:em, html:strong, html:small,
1977 html:m, html:dfn, html:code, html:samp, html:span): New checkers.
1978
1979 2007-05-13 Wakaba <wakaba@suika.fam.cx>
1980
1981 * ContentChecker.pm ($AnyChecker): Renamed
1982 from |$ElementDefault->{checker}|.
1983 ($ElementDefault->{checker}): Throw an error that
1984 the element type is not supported by the checker.
1985 ($HTMLMetadataElement): |html:base| was missing.
1986 ($HTMLEmptyChecker): Don't throw an error
1987 for inter-element whitespace nodes.
1988 (html:html checker): Errors were not
1989 thrown even if |html:head| and/or |html:body|
1990 children were missing.
1991 (html:head checker): An error was not
1992 thrown if <meta charset> appered after other
1993 elements.
1994
1995 2007-05-05 Wakaba <wakaba@suika.fam.cx>
1996
1997 * ContentChecker.pm: |footer|, |video|, |audio|, |script|,
1998 and |noscript| elements are implemented.
1999 (new): New method.
2000
2001 2007-05-04 Wakaba <wakaba@suika.fam.cx>
2002
2003 * ContentChecker.pm: New module.
2004
2005 2007-05-04 Wakaba <wakaba@suika.fam.cx>
2006
2007 * NanoDOM.pm (manakai_parent_element,
2008 document_element, manakai_local_name,
2009 manakai_element_type_match): New method.
2010
2011 2007-05-03 Wakaba <wakaba@suika.fam.cx>
2012
2013 * HTML.pm.src: Replace decimal and hexadecimal numeric
2014 entities in C1 range using Windows-1252 mapping. Bare LF
2015 did not count as new line for error reporting.
2016
2017 2007-05-02 Wakaba <wakaba@suika.fam.cx>
2018
2019 * NanoDOM.pm (DOMImplementation): New class.
2020 (append_child): Weaken the |parent_node| reference.
2021 (create_element_ns, Element new): Set the |owner_document|
2022 reference.
2023 (implementation): New attribute.
2024 (owner_document, local_name, namespace_uri): New attributes.
2025
2026 * HTML.pm.src (parse_string): Line and column numbers
2027 are now provided to error handler.
2028 (!!!parse-error): Short descriptions are added.
2029 (_construct_tree): Split into three methods; support
2030 for innerHTML mode.
2031 (set_inner_html): New method.
2032
2033 2007-05-01 Wakaba <wakaba@suika.fam.cx>
2034
2035 * NanoDOM.html: Documentation is added.
2036
2037 * HTML.pod, ContentType.html: Documentation is revised.
2038
2039 * .cvsignore: Pod2html temporary files are added.
2040
2041 * Makefile: Make |NanoDOM.html|.
2042
2043 2007-05-01 Wakaba <wakaba@suika.fam.cx>
2044
2045 * HTML.pm.src (parse_string): New method.
2046 (get_inner_html): Renamed from |inner_html|.
2047
2048 * Makefile: A rule for |HTML.html| is added.
2049
2050 * HTML.pod: New documentation.
2051
2052 2007-05-01 Wakaba <wakaba@suika.fam.cx>
2053
2054 * NanoDOM.pm (last_child, previous_sibling): New attributes.
2055 (clone_node): Attribute nodes were not completely copied.
2056
2057 * HTML.pm.src: Many bugs are fixed.
2058
2059 2007-04-30 Wakaba <wakaba@suika.fam.cx>
2060
2061 * HTML.pm.src: Some typos are fixed.
2062
2063 2007-04-30 Wakaba <wakaba@suika.fam.cx>
2064
2065 * mkhtmlparser.pl, Makefile: References to the |HTML-consume-entity.src|
2066 are removed.
2067
2068 * HTML.pm.src: Tokenizer's handling on named entities are rewritten.
2069
2070 * HTML-consume-entity.src: Removed.
2071
2072 2007-04-30 Wakaba <wakaba@suika.fam.cx>
2073
2074 * HTML.pm.src: Tokenizer's handling on hexadecimal
2075 numeric entities are rewritten.
2076
2077 2007-04-30 Wakaba <wakaba@suika.fam.cx>
2078
2079 * HTML.pm.src: Some tokenizer bugs are fixed.
2080
2081 2007-04-30 Wakaba <wakaba@suika.fam.cx>
2082
2083 * HTML.pm.src: The tree construction stage is implemented.
2084
2085 * mkhtmlparser.pl: New macros are added.
2086
2087 2007-04-28 Wakaba <wakaba@suika.fam.cx>
2088
2089 * ContentType.pm: A note on bug in the specification
2090 is removed since it's been now fixed.
2091
2092 * .cvsignore: New file.
2093
2094 2007-04-28 Wakaba <wakaba@suika.fam.cx>
2095
2096 * HTML.pm.src, HTML-consume-entity.src: New files.
2097
2098 * Makefile (HTML.pm): New rule.
2099
2100 * mkhtmlparser.pl: New script.
2101
2102 2007-04-25 Wakaba <wakaba@suika.fam.cx>
2103
2104 * Makefile: New file.
2105
2106 2007-04-24 Wakaba <wakaba@suika.fam.cx>
2107
2108 * ContentType.pm: An error in pod is fixed.
2109
2110 2007-04-24 Wakaba <wakaba@suika.fam.cx>
2111
2112 * ContentType.pm: Documentation is added.
2113
2114 2007-04-24 Wakaba <wakaba@suika.fam.cx>
2115
2116 * ContentType.pm: New Perl module.
2117
2118 * ChangeLog: New file.
2119
2120

admin@suikawiki.org
ViewVC Help
Powered by ViewVC 1.1.24