/[suikacvs]/markup/html/whatpm/Whatpm/ChangeLog
Suika

Contents of /markup/html/whatpm/Whatpm/ChangeLog

Parent Directory Parent Directory | Revision Log Revision Log


Revision 1.332 - (show annotations) (download)
Sun Sep 21 11:55:49 2008 UTC (16 years, 1 month ago) by wakaba
Branch: MAIN
Changes since 1.331: +5 -0 lines
++ whatpm/Whatpm/ChangeLog	21 Sep 2008 11:41:39 -0000
	* ContentChecker.pm: |contextmenu| validness is now checked using
	|id| and |id_type| properties, and |menu| property is removed.

2008-09-21  Wakaba  <wakaba@suika.fam.cx>

++ whatpm/Whatpm/ContentChecker/ChangeLog	21 Sep 2008 11:44:45 -0000
	* HTML.pm: |id| attribute checkers for the |menu| and |map|
	elements are removed; the |id| attribute of those elements is now
	checked by the global |id| attribute.

2008-09-21  Wakaba  <wakaba@suika.fam.cx>

1 2008-09-21 Wakaba <wakaba@suika.fam.cx>
2
3 * ContentChecker.pm: |contextmenu| validness is now checked using
4 |id| and |id_type| properties, and |menu| property is removed.
5
6 2008-09-21 Wakaba <wakaba@suika.fam.cx>
7
8 * ContentChecker.pm: Prepare for |form| |name| attribute's
9 duplication checking.
10
11 2008-09-21 Wakaba <wakaba@suika.fam.cx>
12
13 * HTML.pm.src (parse_byte_stream): Support (or non-support) for
14 unsupported charset="" parameter value (HTML5 revision 2131).
15
16 2008-09-20 Wakaba <wakaba@suika.fam.cx>
17
18 * HTML.pm.src: Reminding places where U+000B is allowed as a space
19 character is fixed (cf. HTML5 revision 1738).
20
21 * ContentChecker.pm, HTMLTable.pm: U+000B is no longer part of
22 space characters (HTML5 revision 1738).
23
24 2008-09-20 Wakaba <wakaba@suika.fam.cx>
25
26 * HTML.pm.src: The "anything else" case for the "after after body"
27 insertion mode was not updated to swtich to the "in body"
28 insertion mode. U+000B is no longer a space character for the
29 purpose of tree construction phase (HTML5 revision 1738).
30
31 2008-09-20 Wakaba <wakaba@suika.fam.cx>
32
33 * HTML.pm.src: U+000B is no longer a space character (HTML5
34 revision 1738).
35
36 2008-09-20 Wakaba <wakaba@suika.fam.cx>
37
38 * ContentType.pm: 0x0B is no longer a space character (HTML5
39 revision 1738).
40
41 * HTML.pm.src: U+000B is no longer a space character for the
42 algorithm for extracting an encoding from a Content-Type (HTML5
43 revision 1738).
44
45 2008-09-20 Wakaba <wakaba@suika.fam.cx>
46
47 * ContentChecker.pm ($IsInHTMLInteractiveContent): New.
48
49 2008-09-18 Wakaba <wakaba@suika.fam.cx>
50
51 * LangTag.pm: Add checks for remaining requirements from RFC 4646.
52
53 * mklangreg.pl: Sort 'Prefix' values by their length, to ease
54 matching.
55
56 2008-09-18 Wakaba <wakaba@suika.fam.cx>
57
58 * LangTag.pm: Warn for private use language subtags. Error level
59 typos fixed. Support for Suppress-Script field.
60
61 * mklangreg.pl: Support for dumping of nested structure.
62
63 2008-09-18 Wakaba <wakaba@suika.fam.cx>
64
65 * LangTag.pm (check_rfc4646_langtag): Check if a tag is in the
66 recommended case as per RFC 4646.
67
68 2008-09-18 Wakaba <wakaba@suika.fam.cx>
69
70 * LangTag.pm (check_rfc4646_langtag): New method.
71
72 2008-09-18 Wakaba <wakaba@suika.fam.cx>
73
74 * mklangreg.pl: New script.
75
76 * Makefile: Updated for creation of the module for language subtag
77 registry.
78
79 2008-09-16 Wakaba <wakaba@suika.fam.cx>
80
81 * Makefile: WebIDL.html added.
82
83 * WebIDL.pod: New documentation.
84
85 2008-09-16 Wakaba <wakaba@suika.fam.cx>
86
87 * WebIDL.pm: Checker's error types are redefined.
88
89 2008-09-16 Wakaba <wakaba@suika.fam.cx>
90
91 * WebIDL.pm: Parser's error types are redefined. Some forward
92 compatible parsing bugs are fixed. Some unreachable codes are
93 commented out.
94
95 2008-09-16 Wakaba <wakaba@suika.fam.cx>
96
97 * WebIDL.pm: Support for the reminding extended attributes are
98 added. It does not satisfy the definition that a forward
99 interface declaration has an extended attribute. It seems that
100 unless explicitly allowed multiple extended attributes with the
101 same name is not allowed, though it is not explicitly mentioned in
102 the spec.
103
104 2008-09-16 Wakaba <wakaba@suika.fam.cx>
105
106 * WebIDL.pm: Unescapes extended attribute names and extended
107 attribute identifiers. Preserve whether an extended attribute has
108 an argument list of not. Support for extended attributes:
109 Constructor, ExceptionConsts, IndexGetter, IndexSetter,
110 NameGetter, NameSetter, and Null.
111 (has_argument_list): New attribute.
112 (idl_text): Stringifies argument lists, if any, even if it is
113 empty.
114
115 2008-09-15 Wakaba <wakaba@suika.fam.cx>
116
117 * HTML.pm.src: New state |PCDATA_STATE|. Use an empty string for
118 |{s_kwd}| in DATA_STATE as default.
119
120 2008-09-15 Wakaba <wakaba@suika.fam.cx>
121
122 * HTML.pm.src, mkhtmlparser.pl: Replace |{prev_char}|
123 by |{s_kwd}| in DATA_STATE.
124
125 2008-09-15 Wakaba <wakaba@suika.fam.cx>
126
127 * HTML.pm.src: Shorten keys.
128
129 2008-09-15 Wakaba <wakaba@suika.fam.cx>
130
131 * HTML.pm.src: Remove checking for control character, surrogate
132 pair, or noncharacter code points and non-Unicode code
133 points (they should be handled by Whatpm::Charset::UnicodeChecker).
134 (parse_char_stream): Support for the |$get_wrapper| argument and
135 character stream error handlers.
136
137 2008-09-15 Wakaba <wakaba@suika.fam.cx>
138
139 * ContentChecker.pm: Don't call |loda_ns_module|
140 for null-namespace elements/attributes.
141
142 * HTML.pm.src: Fact out $disallowed_control_chars
143 as a hash.
144
145 2008-09-14 Wakaba <wakaba@suika.fam.cx>
146
147 * HTML.pm.src: Regexp typo fixed. |{prev_char}|
148 and |{next_char}| initializations are moved to initialization
149 method. |{read_until}| now supports buffering. Sync |set_inner_html|
150 with |parse_char_stream|.
151
152 2008-09-14 Wakaba <wakaba@suika.fam.cx>
153
154 * HTML.pm.src (parse_char_stream): Make |set_next_char|
155 invoke |manakai_read_until|, not only |read|, where
156 possible, to decrease the number of |read| method calls.
157
158 * mkhtmlparser.pl: Related changes to the aforementioned
159 modification.
160
161 2008-09-14 Wakaba <wakaba@suika.fam.cx>
162
163 * HTML.pm.src: Use |read| instead of |getc|. |set_inner_html|
164 would report character error from now.
165
166 2008-09-14 Wakaba <wakaba@suika.fam.cx>
167
168 * HTML.pm.src: White-space-leaded non-white-space character
169 tokens in "before head insertion mode" was not
170 correctly handled.
171 (set_inner_html): Reimplemented using CharString decodehandle
172 class. Support for $get_wrapper argument. Support
173 for |{read_until}| feature.
174
175 2008-09-14 Wakaba <wakaba@suika.fam.cx>
176
177 * HTML.pm.src: Make a "bare ero" error for unknown
178 entities point the "&" character.
179
180 2008-09-14 Wakaba <wakaba@suika.fam.cx>
181
182 * HTML.pm.src: It turns out that U+FFFD don't have to
183 be added to the list of excluded characters.
184
185 2008-09-14 Wakaba <wakaba@suika.fam.cx>
186
187 * HTML.pm.src ($char_onerror): Have character decoder's |line|
188 and |column| a higher priority than the one set by the
189 tokenizer's input handler.
190 ($self->{read_until}): Exclude U+FFFD (but this might
191 not be necessary, since now we do line/column fixup in
192 the character decode handle).
193
194 2008-09-14 Wakaba <wakaba@suika.fam.cx>
195
196 * HTML.pm.src: Use |{read_until}| where possible.
197
198 2008-09-14 Wakaba <wakaba@suika.fam.cx>
199
200 * HTML.pm.src: Change |{getc_until}| to |{read_until}|
201 and |manakai_getc_until| to |manakai_read_until| to
202 reduce the number of string copies.
203
204 2008-09-14 Wakaba <wakaba@suika.fam.cx>
205
206 * HTML.pm.src (parse_char_string): Use newly created
207 |Whatpm::Charset::DecodeHandle::CharString| instead of Perl's
208 standard feature to |open| a string as a filehandle,
209 since Perl's string filehandle seems not supporting |ungetc|
210 method correctly.
211 (parse_char_stream): Define |{getc_until}| method.
212 (DATA_STATE): Experimental support for |getc_until| feature.
213
214 2008-09-13 Wakaba <wakaba@suika.fam.cx>
215
216 * HTML.pm.src: Check points added to newly added branches.
217
218 2008-09-13 Wakaba <wakaba@suika.fam.cx>
219
220 * HTML.pm.src: Remove |{char}|, which is no longer used.
221 Remove |{entity_in_attr}| and |{last_attribute_value_state}|
222 and replaced by |{prev_state}|.
223
224 * mkhtmlparser.pl: Remove |{char}| feature.
225 Remove |!!!back-next-input-character;| macro.
226
227 2008-09-13 Wakaba <wakaba@suika.fam.cx>
228
229 * HTML.pm.src: Finally we get rid of all the inner loops. Remove
230 entity related tokenizer states in favor of new states
231 implementing the consume character reference algorithm.
232
233 2008-09-13 Wakaba <wakaba@suika.fam.cx>
234
235 * HTML.pm.src: "Consume a character reference" algorithm is
236 now implemented as a tokenizer's state, rather than
237 a method, with minimum changes (more changes will
238 be made, in due course). "Bogus comment state"'s inner
239 loop gets removed.
240
241 2008-09-13 Wakaba <wakaba@suika.fam.cx>
242
243 * HTML.pm.src: Make |PUBLIC| and |SYSTEM| keyword tokenizing
244 into their own tokenizer states.
245
246 2008-09-13 Wakaba <wakaba@suika.fam.cx>
247
248 * HTML.pm.src: |CDATA_SECTION_STATE| (formally |CDATA_BLOCK_STATE|
249 is split into three states.
250
251 2008-09-13 Wakaba <wakaba@suika.fam.cx>
252
253 * HTML.pm.src: |CLOSE_TAG_OPEN_STATE| is broken into
254 itself and new |CDATA_PCDATA_CLOSE_TAG_STATE| so that
255 no longer does the tokenizer have to push back next input
256 characters in those states.
257
258 2008-09-13 Wakaba <wakaba@suika.fam.cx>
259
260 * HTML.pm.src: |MARKUP_DECLARATION_OPEN_STATE| broken
261 into four states so that no longer does the tokenizer have to push
262 back next input characters in that state.
263
264 2008-09-11 Wakaba <wakaba@suika.fam.cx>
265
266 * HTML.pm.src: Methods now accept additional parameter, $get_wrapper,
267 which can be used to insert some wrapper between the character
268 stream handle and the tokenizer. (It is currently not supported
269 for |set_inner_html| for |Element|s).
270
271 2008-09-10 Wakaba <wakaba@suika.fam.cx>
272
273 * HTML.pm.src: Ignore punctuations in charset names.
274
275 2008-09-10 Wakaba <wakaba@suika.fam.cx>
276
277 * ContentChecker.pm: Support for charset-layer error levels.
278
279 * HTML.pm.src: Don't specify |text| argument for the
280 |chardecode:fallback| error, since it is not the encoding
281 being used alternatively.
282
283 2008-09-06 Wakaba <wakaba@suika.fam.cx>
284
285 * HTML.pm.src: Support for |XSLT-compat| (HTML5 revision 2141).
286
287 2008-08-31 Wakaba <wakaba@suika.fam.cx>
288
289 * CacheManifest.pm: Support for extensibility (HTML5 revision 2051).
290
291 2008-08-31 Wakaba <wakaba@suika.fam.cx>
292
293 * HTML.pm.src: Bug fix and sync with the spec with regard
294 to after after frameset insertion mode processing (HTML5
295 revision 1909). Note that the implementation was wrong
296 per the old spec before the r1909 changes.
297
298 2008-08-30 Wakaba <wakaba@suika.fam.cx>
299
300 * HTMLTable.pm: scope=auto algorithm fix synced with the
301 spec (HTML5 revision 2093).
302 ($process_row): Algorithm step numbers synced with the
303 spec (HTML5 revision 2092).
304
305 2008-08-30 Wakaba <wakaba@suika.fam.cx>
306
307 * HTMLTable.pm: Zs is not what we want; we want White_Space! (HTML5
308 revision 2094).
309
310 2008-08-30 Wakaba <wakaba@suika.fam.cx>
311
312 * ContentType.pm: Support for image/svg+xml (HTML5 revision 2096).
313
314 2008-08-30 Wakaba <wakaba@suika.fam.cx>
315
316 * HTML.pm.src: '"' and "'" at the end of attribute
317 name (after another attribute) now raise parse error (HTML5
318 revision 2123). Empty unquoted attribute values are no
319 longer allowed (HTML5 revision 2122).
320
321 2008-08-30 Wakaba <wakaba@suika.fam.cx>
322
323 * mkhtmlparser.pl: Support for MathML |definitionURL| attribute (HTML5
324 revision 2130).
325
326 2008-08-30 Wakaba <wakaba@suika.fam.cx>
327
328 * ContentChecker.pm: |xml:lang| attribute value must be same
329 as |lang| attribute value for HTML elements (HTML5 revision 2062
330 and so on).
331
332 2008-08-30 Wakaba <wakaba@suika.fam.cx>
333
334 * ContentChecker.pm: Error level definition for |xml_id_error|
335 was missing.
336
337 * URIChecker.pm: The end of the URL should be marked as the
338 error location for an empty path error. The position
339 between the userinfo and the port components should be
340 marked as the error location for an empty host error.
341
342 2008-08-30 Wakaba <wakaba@suika.fam.cx>
343
344 * URIChecker.pm: Set parameters representing where in the
345 value the error occurs for errors. Report unknown
346 address format error in warning level, since address
347 formats are rarely added. Path segments starting with "/.."
348 were misinterpreted as a dot-segment.
349
350 2008-08-30 Wakaba <wakaba@suika.fam.cx>
351
352 * URIChecker.pm (check_iri_reference): Requires
353 |Message::DOM::DOMImplementation|.
354
355 2008-08-29 Wakaba <wakaba@suika.fam.cx>
356
357 * IMTChecker.pm: Updated for the new error reporting architecture.
358
359 * ContentChecker.pm: Error levels for IMTs are added.
360
361 2008-08-17 Wakaba <wakaba@suika.fam.cx>
362
363 * H2H.pm (_shift_token): Support for unquoted HTML attribute
364 values.
365
366 2008-08-16 Wakaba <wakaba@suika.fam.cx>
367
368 * CacheManifest.pm: Support for new style of error
369 reports.
370
371 * HTML.pm.src: Set line=1, column=1 to the document node.
372
373 2008-08-16 Wakaba <wakaba@suika.fam.cx>
374
375 * ContentChecker.pm, RDFXML.pm: Pass {level} object to language tag
376 and URL checkers. Support for more error levels for bogus
377 langauge tag and URL "standards".
378
379 * LangTag.pm, URIChecker.pm: Support for new style error
380 level reporting.
381
382 2008-08-15 Wakaba <wakaba@suika.fam.cx>
383
384 * ContentChecker.pm: Support for RDF/XML error levels.
385
386 * HTMLTable.pm, RDFXML.pm: Support for new style of error level
387 specifying. Error types are revised.
388
389 2008-08-15 Wakaba <wakaba@suika.fam.cx>
390
391 * ContentChecker.pm: All error reporting method calls are
392 renewed.
393
394 2008-08-15 Wakaba <wakaba@suika.fam.cx>
395
396 * HTML.pm.src: All error type names and "text" parameters
397 are revised. Use new style for "level" specification.
398
399 * mkhtmlparser.pl: Use new style for "level" specification.
400
401 2008-08-03 Wakaba <wakaba@suika.fam.cx>
402
403 * WebIDL.pm (parse_char_string): Simplified error
404 reporting process for broken ignored valuetype definition.
405 (Valuetype idl_text): Support for special "DOMString" name.
406
407 2008-08-03 Wakaba <wakaba@suika.fam.cx>
408
409 * WebIDL.pm ($get_scoped_name): Append "::::" if the last
410 terminal of the ScopedName is "DOMString", such that whether
411 the last part of the scoped name is "DOMString" or "_DOMString"
412 later. It is necessary to determine whether a |typedef|
413 definition should be ignored or not.
414 (parse_char_string): Unescape the identifier of
415 exception members.
416 ($resolve): Return undef for builtin types and sequence<T>
417 types (we might not have to do this, however...).
418 (check): Support checking for Exceptions, Valuetypes,
419 and Typedefs.
420 ($serialize_type): Support for "DOMString::::" syntax.
421 (Typedef idl_text): Output Type as "DOMString" if it
422 is really "DOMString" (i.e. its internal representation
423 is "::DOMString::").
424
425 2008-08-03 Wakaba <wakaba@suika.fam.cx>
426
427 * WebIDL.pm ($resolve): New code, based on resolve code
428 for constant types in the |check| method.
429 (check): Support for checking of attributes, operations, and
430 arguments.
431 (Attribute/Operation idl_text): Exception names in getraises,
432 setraises, and raises clauses is serizlied by |$serialize_type|
433 code.
434
435 2008-08-02 Wakaba <wakaba@suika.fam.cx>
436
437 * WebIDL.pm ($integer): Order of selections are changed to match
438 hexadecimal numbers (the original pattern, taken from the spec,
439 was not work for hexadecimal numbers, because the "0" prefix
440 matches to the [0-7]* part (as an empty string) and therefore
441 it does not match with remaining "x..." part of a "0x..." integer
442 literal.
443 ($get_type): It now returns a string, not an array reference,
444 for regular types and |sequence| types (i.e. it in any case
445 returns a string).
446 ($get_next_token): The second item in the array that represents
447 a integer or float token is now a Perl number value, not the
448 original string representation of the number.
449 (check): Support for const value consistency checking.
450 No extended attribute is defined for constants.
451 (Node subclasses): Use simple strings rather than array references
452 for default data type values.
453 ($serialize_type): Type values are now simple strings.
454 (value): If the new attribute value is a false value, then
455 a FALSE value is set to the attribute.
456
457 2008-08-02 Wakaba <wakaba@suika.fam.cx>
458
459 * WebIDL.pm ($get_scoped_name): Now scoped names are stored
460 in its stringified format ("scoped name" as defined in the
461 spec). Note that future version of this module should not use
462 array references for type values and the |type_text| attribute
463 should be made obsolete.
464 (parse_char_string): Unescape attribute names.
465 (check): Support for checking of whether inherited interfaces
466 are actually defined or not. Support for checking of whether
467 interface member identifiers are duplicated or not.
468 ($serialize_type): Scoped names are returned as is. A future
469 version of this code should escape identifiers other than "DOMString",
470 otherwise the idl_text would be non-conforming.
471
472 2008-08-02 Wakaba <wakaba@suika.fam.cx>
473
474 * WebIDL.pm (parse_char_string): Set line/column numbers
475 to generated nodes. Unescape identifiers. Extended attributes
476 for Definition's were ignored.
477 (append_child): Set |parent_node| attribute.
478 (parent_node): New attribute.
479 (check): Support interface/exception members. Support
480 extended attributes. Support definition identifier uniqueness
481 constraint.
482 (qualified_name): New attribute.
483 (Interface/Exception idl_text): Extended attributes were
484 not prepended to the returned text.
485
486 2008-08-02 Wakaba <wakaba@suika.fam.cx>
487
488 * WebIDL.pm (parse_char_string): Set line/column numbers
489 to interface object experimentally. s/shift/pop/g, shift
490 would make things wrong. Support for interface forward
491 declarations was missing. Broken interface declarations
492 with no block were not ignored entirely.
493 (Whatpm::WebIDL::Node): New abstract class. This class
494 makes things easier.
495 (child_nodes): New attribute. Unlike DOM's attribute with
496 same name, this attribute returns a dead list of nodes for
497 simplicity.
498 (get_user_data, set_user_data): New methods.
499 (Module idl_text): A SPACE character should be inserted
500 before the |{| character.
501 (Interface idl_text): Support for interface forward declarations.
502 (is_forward_declaration): New attribute.
503
504 2008-07-19 Wakaba <wakaba@suika.fam.cx>
505
506 * WebIDL.pm (type_text): Better serializer.
507
508 2008-07-19 Wakaba <wakaba@suika.fam.cx>
509
510 * WebIDL.pm: Revise forward-compatible parsing so that
511 it now can handle broken extended attributes and as such.
512
513 2008-07-19 Wakaba <wakaba@suika.fam.cx>
514
515 * WebIDL.pm: Real support for extended attributes.
516 Support for extended attributes with arguments.
517
518 2008-07-19 Wakaba <wakaba@suika.fam.cx>
519
520 * WebIDL.pm: Support for |exception| syntax.
521 (Interface->idl_text): Tentative support for inheritances.
522
523 2008-07-19 Wakaba <wakaba@suika.fam.cx>
524
525 * WebIDL.pm: Hierarchical scoped name support was broken.
526 Support for raises, setraises, and getraises syntaxes.
527
528 2008-07-18 Wakaba <wakaba@suika.fam.cx>
529
530 * WebIDL.pm: Support for |idl_text| attribute, version 1 (no
531 proper support for types, extended attributes, and exceptions yet).
532 WebIDL parser, version 1 (no support for exceptions yet,
533 no proper support for extended attributes yet).
534
535 2008-07-09 Wakaba <wakaba@suika.fam.cx>
536
537 * WebIDL.pm (parse_char_string): Support for basic attribute syntax.
538
539 2008-06-29 Wakaba <wakaba@suika.fam.cx>
540
541 * WebIDL.pm: Support for valuetype and const.
542
543 2008-06-29 Wakaba <wakaba@suika.fam.cx>
544
545 * WebIDL.pm: New module.
546
547 2008-06-15 Wakaba <wakaba@suika.fam.cx>
548
549 * Makefile (Entities.html): URI changed.
550
551 2008-06-08 Wakaba <wakaba@suika.fam.cx>
552
553 * HTML.pm.src: Support for ruby parsing (HTML5 revision 1704).
554
555 2008-06-01 Wakaba <wakaba@suika.fam.cx>
556
557 * HTML.pm.src (_get_next_token): A parse error was missing.
558
559 2008-06-01 Wakaba <wakaba@suika.fam.cx>
560
561 * mklinktypelist.pl: rel=contact is no longer part of the HTML5
562 spec (commented out). (HTML5 revision 1711).
563
564 2008-05-25 Wakaba <wakaba@suika.fam.cx>
565
566 * ContentType.pm: Drop support for UTF-32 (HTML5 revision 1701).
567
568 * HTML.pm.src: UTF-16BE and UTF-16LE should be considered
569 as UTF-16 (HTML5 revision 1701).
570
571 2008-05-25 Wakaba <wakaba@suika.fam.cx>
572
573 * HTML.pm.src: Support for <noframes> in <head> (HTML5 revision
574 1692).
575
576 2008-05-25 Wakaba <wakaba@suika.fam.cx>
577
578 * HTML.pm.src: The secondary insertion mode used when switching
579 to foreign content is the "in body" insertion mode (HTML5 revision
580 1696).
581
582 2008-05-25 Wakaba <wakaba@suika.fam.cx>
583
584 * HTML.pm.src: Don't raise parse error for <isindex/> (HTML5
585 revision 1697).
586
587 2008-05-24 Wakaba <wakaba@suika.fam.cx>
588
589 * HTML.pm.src: Support for end-of-file token in foreign content
590 insertion mode (HTML5 revision 1693). Update SVG camelCase
591 attribute list (HTML5 revision 1700). <textarea> closes
592 </select> (HTML5 revision 1699). More start tags close in
593 foreign content insertion mode (HTML5 revision 1698).
594
595 2008-05-24 Wakaba <wakaba@suika.fam.cx>
596
597 * HTML.pm.src: ";" is not part of charset name (HTML5 revision 1665).
598
599 2008-05-24 Wakaba <wakaba@suika.fam.cx>
600
601 * HTML.pm.src: More robust charset parameter detection (HTML5
602 revision 1674).
603
604 2008-05-24 Wakaba <wakaba@suika.fam.cx>
605
606 * ContentType.pm: Support for image/vnd.microsoft.icon (HTML5
607 revision 1676).
608
609 2008-05-24 Wakaba <wakaba@suika.fam.cx>
610
611 * HTML.pm.src: Ignore language part of public identifiers for
612 quriks mode detection (HTML5 revision 1679).
613
614 2008-05-24 Wakaba <wakaba@suika.fam.cx>
615
616 * HTML.pm.src: Reduce the number of errors in truncated doctypes (HTML5
617 revision 1685).
618
619 2008-05-24 Wakaba <wakaba@suika.fam.cx>
620
621 * HTML.pm.src: Support for EOF in new states for tags (HTML5
622 revision 1684).
623
624 2008-05-24 Wakaba <wakaba@suika.fam.cx>
625
626 * HTML.pm.src (_reset_insertion_mode): Make <td>.innerHTML
627 work (HTML5 revision 1690).
628
629 2008-05-24 Wakaba <wakaba@suika.fam.cx>
630
631 * HTML.pm.src (_tree_construction_main): Change handling of
632 end tags in head insertion modes (HTML5 revision 1686).
633 (parse_char_string): Bug fix for non-utf8 character string handlings.
634 (parse_char_stream): |ungetc| does not work well for this context.
635
636 2008-05-18 Wakaba <wakaba@suika.fam.cx>
637
638 * HTML.pm.src (parse_byte_string): Redefined to invoke
639 |parse_byte_stream|.
640 (parse_byte_stream): New method.
641
642 2008-05-18 Wakaba <wakaba@suika.fam.cx>
643
644 * HTML.pm.src (parse_byte_string): Fix the column number reported
645 by encoding layer error reporter.
646
647 2008-05-17 Wakaba <wakaba@suika.fam.cx>
648
649 * HTML.pm.src (parse_byte_string): Use streaming decoder
650 rather than converting the whole byte string and then parsing.
651 Propagate errors in character encoding layer.
652 (get_next_token): Precise error reporting for |bare stago| error.
653
654 2008-05-17 Wakaba <wakaba@suika.fam.cx>
655
656 * HTML.pm.src (parse_char_stream): New method.
657 (parse_char_string): This method is now defined as an invocation
658 of the |parse_char_stream| method.
659
660 2008-05-17 Wakaba <wakaba@suika.fam.cx>
661
662 * HTML.pm.src (parse_byte_string): Report various status
663 of the sniffing as info-level errors. Support for new
664 decoding framework in parser resestting.
665 (new): Various default error levels were not set.
666
667 2008-05-17 Wakaba <wakaba@suika.fam.cx>
668
669 * HTML.pm.src (parse_byte_string): HTML5 encoding siniffing
670 algorithm, except for the actual sniffing, is implemented
671 with new framework with Message::Charset::Info.
672
673 2008-05-16 Wakaba <wakaba@suika.fam.cx>
674
675 * CacheManifest.pm (_parse): Drop fragment identifiers from
676 URIs in fallback section (HTML5 revision 1596).
677
678 2008-05-10 Wakaba <wakaba@suika.fam.cx>
679
680 * Makefile (Entities.html): URI has changed.
681
682 2008-05-10 Wakaba <wakaba@suika.fam.cx>
683
684 * CacheManifest.pm: Don't replace U+0000 NULL (HTML5 revision
685 1553).
686
687 2008-05-06 Wakaba <wakaba@suika.fam.cx>
688
689 * ContentChecker.pm: Noted that those returned in |table| are
690 no longer table elements, but table objects returned
691 by Whatpm::HTMLTable.
692
693 * HTMLTable.pm (form_table): Return table element node
694 as |$table->{element}|.
695 (assign_header): Support for the |headers=""| attribute.
696
697 2008-05-06 Wakaba <wakaba@suika.fam.cx>
698
699 * HTMLTable.pm (assign_header): New function; first version
700 with no support for headers="".
701 (form_table): Include table width and height to the returned
702 table object for covenience. Indexing in column assignement was wrong.
703 Set whether a data cell is empty or not for convenience.
704
705 2008-05-05 Wakaba <wakaba@suika.fam.cx>
706
707 * HTMLTable.pm: Robuster caption support (HTML5 revision 1393).
708
709 2008-05-05 Wakaba <wakaba@suika.fam.cx>
710
711 * HTMLTable.pm: How table model errors are detected is
712 changed (HTML5 revision 1387).
713
714 2008-05-05 Wakaba <wakaba@suika.fam.cx>
715
716 * HTMLTable.pm: The algorithm now moves |tfoot| elements
717 to the end of the table (HTML5 revision 1380).
718
719 2008-05-05 Wakaba <wakaba@suika.fam.cx>
720
721 * HTMLTable.pm: The algorithm is now 0-based indexing, instead
722 of 1-based (HTML5 revision 1376).
723
724 2008-05-05 Wakaba <wakaba@suika.fam.cx>
725
726 * ContentType.pm: "Content-Type: text/plain; charset=UTF-8"
727 and "Content-Encoding" no longer prevent sniffing (HTML5
728 revision 1288).
729
730 2008-05-05 Wakaba <wakaba@suika.fam.cx>
731
732 * ContentType.pm: Skip BOMs n feed or HTML algorithm (HTML5 revision
733 1282).
734
735 2008-05-03 Wakaba <wakaba@suika.fam.cx>
736
737 * ContentChecker.pm: Support for global attributes.
738 Status of XML specs are added.
739
740 2008-05-03 Wakaba <wakaba@suika.fam.cx>
741
742 * ContentChecker.pm (check_element): Support for |template=""|
743 and |ref=""| attribute (referent element type checking).
744
745 2008-04-29 Wakaba <wakaba@suika.fam.cx>
746
747 * CacheManifest.pm (_parse): New same origin definition (HTML5 revision
748 1500) is implemented (except for IDNA part and URI-scheme-specific
749 knowledge). Line number counting was wrong for LF-only
750 documents.
751
752 2008-04-13 Wakaba <wakaba@suika.fam.cx>
753
754 * HTML.pm.src: Raise an parse error for any disallowed
755 character (HTML5 revision 1263).
756
757 2008-04-13 Wakaba <wakaba@suika.fam.cx>
758
759 * mkentitylist.pl: Support for new HTML5 entity table format (the
760 definition for |AElig;| was missing).
761
762 2008-04-13 Wakaba <wakaba@suika.fam.cx>
763
764 * HTML.pm.src, mkhtmlparser.pl: Support for element/attribute
765 name/namespace fixup (HTML5 revisions 1413, 1415, 1416, and 1417).
766
767 2008-04-13 Wakaba <wakaba@suika.fam.cx>
768
769 * HTML.pm.src: List of element names that close foreign content
770 insertion mode is added (HTML5 revisions 1412 and 1418).
771
772 2008-04-13 Wakaba <wakaba@suika.fam.cx>
773
774 * HTML.pm.src: Support for |mglyph| and |malignmark|
775 elements (HTML5 revision 1410).
776
777 2008-04-13 Wakaba <wakaba@suika.fam.cx>
778
779 * HTML.pm.src: Support for new long MathML entities (HTML5
780 revision 1406).
781
782 2008-04-13 Wakaba <wakaba@suika.fam.cx>
783
784 * HTML.pm.src: CDATA section support for MathML and SVG
785 elements (HTML5 revision 1404 and 1420).
786
787 2008-04-12 Wakaba <wakaba@suika.fam.cx>
788
789 * HTML.pm.src, mkhtmlparser.pl: Support for MathML and SVG
790 elements (HTML5 revision 1404). Unused !!!macro definitions
791 are removed.
792
793 2008-04-12 Wakaba <wakaba@suika.fam.cx>
794
795 * HTML.pm.src, mkhtmlparser.pl: The way permitted slash errors
796 are raised is changed (HTML5 revision 1404).
797
798 2008-04-06 Wakaba <wakaba@suika.fam.cx>
799
800 * HTML.pm.src: Category sets that are no longer used are removed.
801
802 2008-04-06 Wakaba <wakaba@suika.fam.cx>
803
804 * HTML.pm.src: The ->[1] property of stack entries are now
805 replaced by constants representing element category.
806
807 2008-04-06 Wakaba <wakaba@suika.fam.cx>
808
809 * HTML.pm.src: Don't use local name stored in stack (i.e. ->[1])
810 for error reporting. (This is a preparation for using constant
811 value for ->[1].)
812
813 2008-03-22 Wakaba <wakaba@suika.fam.cx>
814
815 * RDFXML.pm: Typo fixed.
816
817 2008-03-22 Wakaba <wakaba@suika.fam.cx>
818
819 * ContentChecker.pm: |fact_level| is now treated
820 as same as |must_level|, i.e. level = |m|.
821 (check_element): Make list of URIs in the DOM.
822
823 2008-03-21 Wakaba <wakaba@suika.fam.cx>
824
825 * RDFXML.pm: Language accessor implemented.
826 Local (null-namespace) attribute support.
827
828 2008-03-21 Wakaba <wakaba@suika.fam.cx>
829
830 * RDFXML.pm: Factored out ID checking code.
831
832 2008-03-21 Wakaba <wakaba@suika.fam.cx>
833
834 * RDFXML.pm: TODO items noted. Validation of ID
835 and URI attributes is implemented. Warn if unknown
836 value is used in rdf:parseType="" attribute.
837
838 * URIChecker.pm (check_rdf_uri_reference): New function.
839
840 2008-03-21 Wakaba <wakaba@suika.fam.cx>
841
842 * RDFXML.pm: bnodeid implemented. Relative references
843 are now resolved.
844
845 2008-03-21 Wakaba <wakaba@suika.fam.cx>
846
847 * ContentChecker.pm: RDF reification implemented.
848
849 * RDFXML.pm: undef vs false bug fixed.
850 Reification implemented.
851
852 2008-03-21 Wakaba <wakaba@suika.fam.cx>
853
854 * RDFXML.pm: s/id/ID/ for attribute name.
855 The |node| arguments are added for |ontriple| calls.
856 Too many "attribute not allowed" errors were raised.
857
858 * ContentChecker.pm: Initial experimental support for rdf:RDF
859 element.
860
861 2008-03-21 Wakaba <wakaba@suika.fam.cx>
862
863 * RDFXML.pm: New module.
864
865 2008-03-20 Wakaba <wakaba@suika.fam.cx>
866
867 * HTML.pm.src (set_inner_html): Line/column number
868 code was old one yet.
869
870 2008-03-20 Wakaba <wakaba@suika.fam.cx>
871
872 * HTML.pm.src: Better line/column reporting for "duplicate attribute"
873 errors. Line/column markings for DOCTYPE, comment, and
874 character tokens are reintroduced; otherwise,
875 error location for "not HTML5" error and errors
876 for implied elements are not attached.
877
878 2008-03-20 Wakaba <wakaba@suika.fam.cx>
879
880 * HTML.pm.src: Set line/column numbers to attributes.
881
882 * NanoDOM.pm (create_attribute_ns, set_attribute_node_ns): Added.
883 (value): Setter implemented.
884
885 * mkhtmlparser.pl: Set line/column numbers to Attr nodes.
886
887 2008-03-20 Wakaba <wakaba@suika.fam.cx>
888
889 * HTML.pm.src: Unused line/column markings are removed.
890
891 2008-03-20 Wakaba <wakaba@suika.fam.cx>
892
893 * HTML.pm.src (_get_next_token): Remove |first_start_tag|
894 flag, which is no longer used.
895
896 2008-03-17 Wakaba <wakaba@suika.fam.cx>
897
898 * HTML.pm.src: Set line/column information to element nodes.
899
900 * mkhtmlparser.pl (!!!create-element, !!!insert-element,
901 and !!!insert-element-t): Set line/column information to
902 element nodes.
903
904 2008-03-17 Wakaba <wakaba@suika.fam.cx>
905
906 * HTML.pm.src (_get_next_token): The first "<" character
907 in "<?", "<>", or "</>" should be the error point.
908
909 2008-03-16 Wakaba <wakaba@suika.fam.cx>
910
911 * HTML.pm.src: Some more fixes on error position reporting.
912
913 2008-03-16 Wakaba <wakaba@suika.fam.cx>
914
915 * HTML.pm.src: Token-level precious error reporting.
916
917 2008-03-16 Wakaba <wakaba@suika.fam.cx>
918
919 * HTML.pm.src: Preparation for more precious error point
920 reporting.
921
922 2008-03-11 Wakaba <wakaba@suika.fam.cx>
923
924 * HTML.pm.src: Error type revised.
925
926 2008-03-11 Wakaba <wakaba@suika.fam.cx>
927
928 * HTML.pm.src: Similar codes are merged together, again.
929
930 2008-03-11 Wakaba <wakaba@suika.fam.cx>
931
932 * HTML.pm.src: Similar codes are merged together.
933
934 2008-03-10 Wakaba <wakaba@suika.fam.cx>
935
936 * mkhtmlparser.pl: Set "level" parameter to parse errors.
937
938 * HTML.pm.src: Code refined.
939
940 2008-03-09 Wakaba <wakaba@suika.fam.cx>
941
942 * HTML.pm.src: |</body>| treatement has been changed (HTML5
943 revision 1348). Note that I really don't know this makes
944 any difference in the black-box behavior of the parser.
945
946 2008-03-09 Wakaba <wakaba@suika.fam.cx>
947
948 * HTML.pm.src: New end-of-file token implementation (HTML5
949 revision 1348).
950
951 2008-03-09 Wakaba <wakaba@suika.fam.cx>
952
953 * HTML.pm.src: |applet| support (HTML5 revision 1347).
954
955 2008-03-09 Wakaba <wakaba@suika.fam.cx>
956
957 * HTML.pm.src: Foster parenting in AAA (HTML5 revision 1343).
958
959 2008-03-09 Wakaba <wakaba@suika.fam.cx>
960
961 * HTML.pm.src: Support for |<input>| in the "in select" insertion
962 mode and support for the "in select in table" insertion mode (HTML5
963 revision 1342).
964
965 2008-03-09 Wakaba <wakaba@suika.fam.cx>
966
967 * HTML.pm.src: No foster parenting for <script> and <script>
968 in non-tainted <table>s (HTML5 revision 1336).
969
970 2008-03-09 Wakaba <wakaba@suika.fam.cx>
971
972 * HTML.pm.src: Ignore white space characters between <html>
973 and <head> (HTML5 revision 1332).
974
975 2008-03-09 Wakaba <wakaba@suika.fam.cx>
976
977 * HTML.pm.src: Treat <input type=hidden> as if it were a
978 white space (HTML5 revision 1331).
979
980 2008-03-08 Wakaba <wakaba@suika.fam.cx>
981
982 * HTML.pm.src: Ignore U+000A at the beginning of a |listing|
983 element (HTML5 revision 1330).
984
985 2008-03-08 Wakaba <wakaba@suika.fam.cx>
986
987 * HTML.pm.src: <title> is always appended to the current
988 element (HTML5 revision 1328).
989
990 2008-03-08 Wakaba <wakaba@suika.fam.cx>
991
992 * HTML.pm.src: White space in tainted tables are moved
993 into foster parents (HTML5 revision 1326).
994
995 2008-03-08 Wakaba <wakaba@suika.fam.cx>
996
997 * HTML.pm.src: Reduce errors from foster parenting cases (HTML5
998 revision 1321).
999
1000 2008-03-08 Wakaba <wakaba@suika.fam.cx>
1001
1002 * HTML.pm.src: |</h/n/>| case code rearranged to align with
1003 the spec (HTML5 revision 1320). Note that we finally complete
1004 all of HTML5 revision 1320 changes.
1005
1006 2008-03-08 Wakaba <wakaba@suika.fam.cx>
1007
1008 * HTML.pm.src: |</form>| now works similar to |</div>| for unclosed
1009 tags (HTML5 revision 1320).
1010
1011 2008-03-08 Wakaba <wakaba@suika.fam.cx>
1012
1013 * HTML.pm.src: |</p>| case rearranged with no actual change in fact.
1014
1015 2008-03-08 Wakaba <wakaba@suika.fam.cx>
1016
1017 * HTML.pm.src: A "generate implied end tags" code (t409.1)
1018 could not be reached so that it is now removed (HTML5 revision
1019 1320).
1020
1021 2008-03-08 Wakaba <wakaba@suika.fam.cx>
1022
1023 * HTML.pm.src: Code for the case of |</div>| and so on are revised
1024 to align with new spec text (HTML5 revision 1320).
1025
1026 2008-03-08 Wakaba <wakaba@suika.fam.cx>
1027
1028 * HTML.pm.src: Remove strange |if| condition; however, it should
1029 have had no harm in theory.
1030
1031 2008-03-08 Wakaba <wakaba@suika.fam.cx>
1032
1033 * HTML.pm.src (_tree_construction_main): '</p>' in body
1034 case is split from other end tags for the preparation
1035 of implementing HTML revision 1320.
1036
1037 2008-03-07 Wakaba <wakaba@suika.fam.cx>
1038
1039 * HTML.pm.src: Simplified "generate implied end tag" (HTML5
1040 revision 1320).
1041
1042 2008-03-07 Wakaba <wakaba@suika.fam.cx>
1043
1044 * HTML.pm.src (_tree_construction_main): Merge rules for "h1"
1045 and "div" (HTML5 revision 1318). Add comments to where
1046 |form| pointer association codes should be inserted (HTML5 revision
1047 1319).
1048
1049 2008-03-06 Wakaba <wakaba@suika.fam.cx>
1050
1051 * HTML.pm.src: <html> treatement refined (HTML5 revision 1314).
1052
1053 2008-03-05 Wakaba <wakaba@suika.fam.cx>
1054
1055 * HTML.pm.src: Since the case t268 should never be reached (no
1056 other token type, there are), it is replaced by a |die| statement.
1057
1058 2008-03-05 Wakaba <wakaba@suika.fam.cx>
1059
1060 * HTML.pm.src: Typo fixed.
1061
1062 2008-03-04 Wakaba <wakaba@suika.fam.cx>
1063
1064 * HTML.pm.src (_tree_construction_initial): Some limited quirks
1065 doctypes were not uppercased for comparison.
1066
1067 2008-03-03 Wakaba <wakaba@suika.fam.cx>
1068
1069 * HTML.pm.src (tree construction and set_inner_html): Checkpoints
1070 are added.
1071
1072 2008-03-03 Wakaba <wakaba@suika.fam.cx>
1073
1074 * HTML.pm.src (_tokenize_attempt_to_consume_an_entity): Checkpoints
1075 are set. Cases that are unlikely reached are noted as so.
1076
1077 2008-03-03 Wakaba <wakaba@suika.fam.cx>
1078
1079 * HTML.pm.src: Checkpoints for debugging are added.
1080
1081 * mkhtmlparser.pl: Support for |!!!cp| syntax.
1082
1083 2008-03-03 Wakaba <wakaba@suika.fam.cx>
1084
1085 * HTML.pm.src, mkhtmlparser.pl: s/_input_character/_char/g
1086 for simplicity.
1087
1088 2008-03-03 Wakaba <wakaba@suika.fam.cx>
1089
1090 * HTML.pm.src: Flag name changed: s/correct/force-quirks/g (HTML5
1091 revision 1307).
1092
1093 2008-03-03 Wakaba <wakaba@suika.fam.cx>
1094
1095 * HTML.pm.src (_get_next_token): Where the /incorrect/ flag
1096 is set are changed (HTML5 revision 1305).
1097
1098 2008-03-02 Wakaba <wakaba@suika.fam.cx>
1099
1100 * HTML.pm.src: Raise a parse error for |<span ===>| (HTML5 revision
1101 1292). Entities are not parsed in comment-like part in RCDATA
1102 elements (HTML5 revision 1294). Allow bare & at the end
1103 of attribute value literals (HTML5 revision 1296). More
1104 quirks mode doctypes (HTML5 revision 1302). Requires spaces
1105 between attributes and ban attribute names or unquoted
1106 attribute values containing single or double quotes (HTML5
1107 revision 1303).
1108
1109 2008-03-02 Wakaba <wakaba@suika.fam.cx>
1110
1111 * ContentChecker.pm: Typo fixed. Don't raise "character encoding"
1112 and related errors unless it is an HTML document (though the spec
1113 is unclear on whether it is applied to XHTML document).
1114
1115 * HTML.pm (%HTMLAttrStatus): WF2 repetition model attributes
1116 are added.
1117
1118 2008-03-02 Wakaba <wakaba@suika.fam.cx>
1119
1120 * HTML.pm.src: s/local_name/manakai_local_name/g.
1121
1122 2008-03-01 Wakaba <wakaba@suika.fam.cx>
1123
1124 * _NamedEntityList.pm: Updated (HTML5 revision 1286).
1125
1126 * HTML.pm.src: |charset| in |content| attribute is
1127 case-insensitive (HTML5 revision 1270).
1128
1129 2008-02-26 Wakaba <wakaba@suika.fam.cx>
1130
1131 * ContentChecker.pm: New status constants are added.
1132 ($ElementDefault): |status| added.
1133 (check_element): Err for non-standard or deprecated elements.
1134 (_attr_status_info): For non-standard or deprecated attributes.
1135
1136 2008-02-24 Wakaba <wakaba@suika.fam.cx>
1137
1138 * ContentChecker.pm (_attr_status_info): New internal method.
1139
1140 2008-02-24 Wakaba <wakaba@suika.fam.cx>
1141
1142 * ContentChecker.pm (check_element): Element standardized
1143 status information is now dispatched.
1144
1145 2008-02-24 Wakaba <wakaba@suika.fam.cx>
1146
1147 * ContentChecker.pm (check_element): Fix |del|-and-significant
1148 problem by adding some more arguments.
1149
1150 2008-02-24 Wakaba <wakaba@suika.fam.cx>
1151
1152 * ContentChecker.pm (check_element): Use context of
1153 container-for-the-purpose-of-content-model element (not transparent
1154 element) for |check_child_element| calling and significant
1155 text flag marking. This reintroduces |<del>|-and-significant
1156 problem again.
1157
1158 2008-02-24 Wakaba <wakaba@suika.fam.cx>
1159
1160 * ContentChecker.pm (check_element): Make semi-transparent
1161 elements ignored for the purpose of phase changes in
1162 content model checking.
1163
1164 2008-02-23 Wakaba <wakaba@suika.fam.cx>
1165
1166 * ContentChecker.pm (check_element): In-element state
1167 was not properly managed for transparent cases.
1168
1169 2008-02-23 Wakaba <wakaba@suika.fam.cx>
1170
1171 * ContentChecker.pm (check_element): Support for |video|
1172 and |audio| as semi-transparent elements.
1173
1174 2008-02-23 Wakaba <wakaba@suika.fam.cx>
1175
1176 * ContentChecker.pm ($HTMLSemiTransparentElements): New.
1177 (check_element): s/minuses/minus_elements/, s/pluses/plus_elements/.
1178 Support for |html:object| as a semi-transparent element.
1179
1180 2008-02-23 Wakaba <wakaba@suika.fam.cx>
1181
1182 * ContentChecker.pm (check_element): The way to traverse
1183 the tree is entirely revised to make it easier to track
1184 the state of ancestors/descendants. As a result of this
1185 revision (which rewrites almost all of Whatpm::ContentChecker::HTML),
1186 support for content model checking for HTML elements |figure|,
1187 |object|, |video|, and |audio| and checking for XML elements (and
1188 some XMLNS checkings) are dropped for now. They will be
1189 reimplemented in due cource.
1190
1191 2008-02-17 Wakaba <wakaba@suika.fam.cx>
1192
1193 * HTML.pm.src: |>| in public or system literal closes the DOCTYPE
1194 token (HTML5 revision 1225).
1195
1196 2008-02-17 Wakaba <wakaba@suika.fam.cx>
1197
1198 * ContentChecker.pm ({unsupported_level}): New value.
1199
1200 * HTML.pm.src: Save whether |meta| |content| attribute
1201 contains character references or not.
1202
1203 2008-02-17 Wakaba <wakaba@suika.fam.cx>
1204
1205 * ContentChecker.pm (_get_children): (Incompleted) attempt to
1206 imlement significant content checking for contents
1207 with |del| elements.
1208
1209 2008-02-17 Wakaba <wakaba@suika.fam.cx>
1210
1211 * ContenteChecker.pm ($HTMLTransparentElements): More
1212 elements are added.
1213 (_get_children): HTML |object| elements are now semi-transparent.
1214
1215 * NanoDOM.pm (manakai_html, manakai_head): New methods.
1216
1217 2008-02-16 Wakaba <wakaba@suika.fam.cx>
1218
1219 * CacheManifest.pm: HTML5 revision 1211 implemented.
1220
1221 * CacheManifest.pod: Updated.
1222
1223 2008-02-10 Wakaba <wakaba@suika.fam.cx>
1224
1225 * ContentChecker.pm (check_document, check_element): Support
1226 for second argument ($onsubdoc).
1227 (_get_css_parser): Removed (now it is part of WDCC).
1228
1229 2008-02-09 Wakaba <wakaba@suika.fam.cx>
1230
1231 * ContentChecker.pm (_get_css_parser): New.
1232
1233 2007-11-25 Wakaba <wakaba@suika.fam.cx>
1234
1235 * ContentChecker.pm ($AnyChecker): Old way to add child elements
1236 for checking had been used.
1237
1238 2007-11-25 Wakaba <wakaba@suika.fam.cx>
1239
1240 * ContentChecker.pm (check_element): New todo item type |descendant|.
1241
1242 2007-11-23 Wakaba <wakaba@suika.fam.cx>
1243
1244 * IMTChecker.pm: Revised to raise errors and warnings as (poorly)
1245 specced in RFC 2046 and RFC 4288.
1246 (application/atom+xml): Definition added.
1247
1248 2007-11-23 Wakaba <wakaba@suika.fam.cx>
1249
1250 * URIChecker.pm: Make RFC 3986 should-level errors
1251 warnings (rather than SHOULD-level errors).
1252
1253 2007-11-23 Wakaba <wakaba@suika.fam.cx>
1254
1255 * NanoDOM.pm (get_user_data, set_user_data): New methods.
1256
1257 * HTML.pm.src: A flag for character references in attribute
1258 values are added. Set |manakai_has_reference| user data
1259 to |charset| attribute.
1260
1261 2007-11-23 Wakaba <wakaba@suika.fam.cx>
1262
1263 * NanoDOM.pm (input_encoding, manakai_charset, manakai_has_bom): New
1264 attributes.
1265
1266 * ContentChecker.pm (check_document): Warn if charset requirements
1267 cannot be tested.
1268
1269 2007-11-19 Wakaba <wakaba@suika.fam.cx>
1270
1271 * HTML.pm.src (parse_byte_string): Detect charset
1272 by universalchardet if charset parameter is not specified.
1273
1274 * Makefile (Charset-all, Charset-clean): New rules.
1275
1276 2007-11-18 Wakaba <wakaba@suika.fam.cx>
1277
1278 * ContentChecker.pm (check_document): Check the existence
1279 of character encoding declaration and selection of encoding
1280 for HTML document.
1281
1282 2007-11-18 Wakaba <wakaba@suika.fam.cx>
1283
1284 * ContentType.pm (get_sniffed_type): Return also the official
1285 type in list context.
1286
1287 2007-11-18 Wakaba <wakaba@suika.fam.cx>
1288
1289 * ContentType.pm: Sniffing with leading white space
1290 ignoring (HTML5 revisions 1013 and 1016).
1291
1292 2007-11-18 Wakaba <wakaba@suika.fam.cx>
1293
1294 * ContentType.pm: HTML5 revision 1013 changes, except for leading
1295 white spaces, are implemented.
1296
1297 2007-11-11 Wakaba <wakaba@suika.fam.cx>
1298
1299 * HTML.pm.src (parse_char_string): Set |inner_encoding|
1300 attribute if possible.
1301
1302 2007-11-11 Wakaba <wakaba@suika.fam.cx>
1303
1304 * HTML.pm.src (parse_byte_string): New method.
1305 (parse_char_string): New alias for |parse_string|.
1306 (main phase): Invoking "change the encoding" algorithm if desired.
1307
1308 * HTML.pod: Updated.
1309
1310 2007-11-11 Wakaba <wakaba@suika.fam.cx>
1311
1312 * HTML.pod (get_inner_html): Removed.
1313
1314 * Makefile (HTML-all, HTML-clean): New.
1315
1316 2007-11-11 Wakaba <wakaba@suika.fam.cx>
1317
1318 * HTML.pm.src (get_inner_html): Removed (moved to HTML/Serializer.pm).
1319
1320 2007-11-08 Wakaba <wakaba@suika.fam.cx>
1321
1322 * mklinktypelist.pl: s/noreferer/noreferrer/ (HTML5 revision 1132).
1323
1324 2007-11-04 Wakaba <wakaba@suika.fam.cx>
1325
1326 * Makefile: |CacheManifest.html| is added.
1327
1328 * CacheManifest.pod: New file.
1329
1330 2007-11-04 Wakaba <wakaba@suika.fam.cx>
1331
1332 * CacheManifest.pm: New module.
1333
1334 2007-11-04 Wakaba <wakaba@suika.fam.cx>
1335
1336 * HTML.pm.src: Support for application cache selection algorithm
1337 callback.
1338
1339 2007-11-04 Wakaba <wakaba@suika.fam.cx>
1340
1341 * mklinktypelist.pl: Support for rel=noreferer (HTML5 revision 1118).
1342
1343 2007-10-17 Wakaba <wakaba@suika.fam.cx>
1344
1345 * Makefile (clean): New rule.
1346
1347 * NanoDOM.pm (public_id, system_id): New attributes.
1348
1349 2007-10-17 Wakaba <wakaba@suika.fam.cx>
1350
1351 * Makefile (CSS-all, CSS-clean, clean): New rules.
1352
1353 2007-10-14 Wakaba <wakaba@suika.fam.cx>
1354
1355 * ContentChecker.pm (check_document): Support for
1356 new |is_xml_root| flag.
1357 (check_element): Support for new |pluses| state.
1358 (_add_pluses): New method.
1359 (_remove_minuses): Support for new |minus| item.
1360
1361 2007-09-24 Wakaba <wakaba@suika.fam.cx>
1362
1363 * ContentChecker.pm: Raise specific error for invalid
1364 root element.
1365
1366 2007-09-24 Wakaba <wakaba@suika.fam.cx>
1367
1368 * ContentChecker.pm: Set level values for later uses.
1369
1370 2007-09-09 Wakaba <wakaba@suika.fam.cx>
1371
1372 * ContentChecker.pm: Support for language tag validation.
1373
1374 2007-09-09 Wakaba <wakaba@suika.fam.cx>
1375
1376 * LangTag.pm (check_rfc3066_language_tag): New method.
1377
1378 2007-09-09 Wakaba <wakaba@suika.fam.cx>
1379
1380 * LangTag.pm: New module.
1381
1382 2007-09-04 Wakaba <wakaba@suika.fam.cx>
1383
1384 * HTML.pm.src: Some error types were wrong.
1385
1386 2007-08-17 Wakaba <wakaba@suika.fam.cx>
1387
1388 * CSS/: New directory.
1389
1390 2007-08-17 Wakaba <wakaba@suika.fam.cx>
1391
1392 * ContentChecker.pm (_check_get_children): Support
1393 for |noscript| in |head|.
1394
1395 2007-08-12 Wakaba <wakaba@suika.fam.cx>
1396
1397 * URI/: New directory.
1398
1399 2007-08-11 Wakaba <wakaba@suika.fam.cx>
1400
1401 * HTML.pm.src: Tokenizer's states are now represented in
1402 number.
1403
1404 2007-08-11 Wakaba <wakaba@suika.fam.cx>
1405
1406 * HTML.pm.src: |or|s for insertion modes are replaced
1407 by |&|s.
1408
1409 2007-08-11 Wakaba <wakaba@suika.fam.cx>
1410
1411 * HTML.pm.src: Token types are now represented in number.
1412
1413 2007-08-11 Wakaba <wakaba@suika.fam.cx>
1414
1415 * ContentType.pm (SEE ALSO): Updated.
1416
1417 * HTML.pm.src: Insertion modes are now represented in number.
1418
1419 2007-08-11 Wakaba <wakaba@suika.fam.cx>
1420
1421 * ContentType.pm: Sniffing for bitmap images (HTML5 revision
1422 999) is implemented.
1423
1424 2007-08-08 Wakaba <wakaba@suika.fam.cx>
1425
1426 * ContentType.pm: Sniffing for |<script| (HTML5 revision
1427 983) is implemented.
1428
1429 2007-08-06 Wakaba <wakaba@suika.fam.cx>
1430
1431 * ContentChecker.pod: New documentation.
1432
1433 * Makefile: A rule for |ContentChecker.html| is added.
1434
1435 * ContentChecker.pm: A pod "LICENSE" section is added.
1436
1437 * NanoDOM.pm ($VERSION): New variable.
1438
1439 2007-08-05 Wakaba <wakaba@suika.fam.cx>
1440
1441 * H2H.pm: |b|, |i|, and |sub| are added to the
1442 list of allowed HTML elements.
1443
1444 2007-08-05 Wakaba <wakaba@suika.fam.cx>
1445
1446 * H2H.pm: |samp| is added to the list of allowed
1447 HTML elements.
1448
1449 * URIChecker.pm (check_iri): New.
1450 (check_iri_reference): Error type for IRI reference
1451 syntax error is changed.
1452
1453 2007-08-04 Wakaba <wakaba@suika.fam.cx>
1454
1455 * ContentChecker.pm: Reference to the |Whatpm::ContentChecker::Atom|
1456 is added.
1457 (check_document): Load appropriate module before validation.
1458
1459 2007-08-04 Wakaba <wakaba@suika.fam.cx>
1460
1461 * ContentChecker/: New directory.
1462
1463 2007-08-04 Wakaba <wakaba@suika.fam.cx>
1464
1465 * ContentChecker.pm: HTML |time| element is implemented.
1466
1467 * HTMLTable.pm: Comments are updated as HTML5 is revised.
1468
1469 2007-07-21 Wakaba <wakaba@suika.fam.cx>
1470
1471 * ContentChecker.pm (check_document): Return value
1472 even if no document element is found.
1473
1474 2007-07-21 Wakaba <wakaba@suika.fam.cx>
1475
1476 * HTML.pm.src: |$in_body| is no longer a function.
1477
1478 2007-07-21 Wakaba <wakaba@suika.fam.cx>
1479
1480 * HTML.pm.src: The |$in_body| code has been moved down.
1481
1482 2007-07-21 Wakaba <wakaba@suika.fam.cx>
1483
1484 * HTML.pm.src: The "trailing end" insertion mode
1485 is split into "after html body" and "after html frameset"
1486 insertion modes. Their codes are merged with "after body"
1487 and "after frameset" codes. |$previous_insertion_mode|
1488 has been removed. "after frameset" code is
1489 merged with "in frameset" code.
1490
1491 2007-07-21 Wakaba <wakaba@suika.fam.cx>
1492
1493 * HTML.pm.src: The "before head" insertion mode is
1494 merged with the "in head" insertion mode.
1495
1496 2007-07-21 Wakaba <wakaba@suika.fam.cx>
1497
1498 * HTML.pm.src: Cases in "in head" insertion mode are
1499 reorganized.
1500
1501 2007-07-21 Wakaba <wakaba@suika.fam.cx>
1502
1503 * HTML.pm.src: Some cases in "in table" insertion mode
1504 are merged.
1505
1506 2007-07-21 Wakaba <wakaba@suika.fam.cx>
1507
1508 * HTML.pm.src: The "in row" insertion mode is merged
1509 with "in table" insertion mode.
1510
1511 2007-07-21 Wakaba <wakaba@suika.fam.cx>
1512
1513 * HTML.pm.src: The "in table" and "in table body" insertion
1514 modes are merged.
1515
1516 2007-07-21 Wakaba <wakaba@suika.fam.cx>
1517
1518 * HTML.pm.src: There is no "in table head" or "in table foot"
1519 insertion mode!
1520
1521 2007-07-21 Wakaba <wakaba@suika.fam.cx>
1522
1523 * HTML.pm.src: |<noframes>| "in frameset" and "in noframes"
1524 now directly invoke the handler.
1525
1526 2007-07-21 Wakaba <wakaba@suika.fam.cx>
1527
1528 * HTML.pm.src: Codes for "in cell" insertion mode
1529 is merged to the "in body" insertion mode code.
1530
1531 2007-07-21 Wakaba <wakaba@suika.fam.cx>
1532
1533 * HTML.pm.src: Codes for "in body" and "in caption"
1534 insertion modes are merged.
1535
1536 2007-07-21 Wakaba <wakaba@suika.fam.cx>
1537
1538 * HTML.pm.src: Two |!!!next-token|s were missing.
1539
1540 2007-07-21 Wakaba <wakaba@suika.fam.cx>
1541
1542 * HTML.pm.src: Use numeric constant for |{content_mode}|
1543 instead of string constant for |{content_model_flag}|.
1544
1545 2007-07-21 Wakaba <wakaba@suika.fam.cx>
1546
1547 * HTML.pm.src: Add the name of the attribute
1548 to the "duplicate attribute" error.
1549
1550 2007-07-17 Wakaba <wakaba@suika.fam.cx>
1551
1552 * ContentChecker.pm: Return the |class| node list.
1553
1554 2007-07-17 Wakaba <wakaba@suika.fam.cx>
1555
1556 * ContentChecker.pm: Return the |id| node list.
1557
1558 * HTML.pm.src: A typo is fixed.
1559
1560 2007-07-16 Wakaba <wakaba@suika.fam.cx>
1561
1562 * ContentChecker.pm: Drop wrong |level => 'error'| specification
1563 from "in HTML:xml:lang" error. Character position
1564 is now the last part of the error type in the URI error
1565 description. Report "unsupported" status for language
1566 tags, media queries, script codes, and style sheets.
1567
1568 2007-07-16 Wakaba <wakaba@suika.fam.cx>
1569
1570 * ContentChecker.pm: Report error if |xml:lang|
1571 in HTML, |lang| in XML, |xmlns| in XML, and |meta| |charset|
1572 in XML.
1573
1574 * NanoDOM.pm (Attr.owner_document): New attribute.
1575
1576 2007-07-16 Wakaba <wakaba@suika.fam.cx>
1577
1578 * HTML.pm.src: The character immediately following
1579 a bare |hcro| was discarded. Fix handling of
1580 entity references in attribute values.
1581
1582 2007-07-16 Wakaba <wakaba@suika.fam.cx>
1583
1584 * HTML.pm.src (main and trailin end phases): Token
1585 types |DOCTYPE|, |comment|, |end-of-file|, and |<html>| are
1586 factored out. Error types |in html:#DOCTYPE|
1587 and |after html:#DOCTYPE| are merged
1588 into |DOCTYPE in the middle|. |</frameset>|
1589 in fragment parsing mode changed the
1590 insertion mode.
1591
1592 2007-07-16 Wakaba <wakaba@suika.fam.cx>
1593
1594 * HTML.pm.src: |$phase| has been removed; The |trailing end|
1595 phase is now an insertion mode. Treatments for white
1596 space character tokens were incorrect for some
1597 insertion modes. An old |meta| case was not removed.
1598
1599 2007-07-16 Wakaba <wakaba@suika.fam.cx>
1600
1601 * HTML.pm.src: |meta| charset declaration extraction
1602 implemented (but changing the encoding is not yet:-).
1603
1604 2007-07-15 Wakaba <wakaba@suika.fam.cx>
1605
1606 * Charset/: New directory.
1607
1608 2007-07-15 Wakaba <wakaba@suika.fam.cx>
1609
1610 * H2H.pm: New Perl module (created from
1611 manakai's H2H.dis).
1612
1613 2007-07-15 Wakaba <wakaba@suika.fam.cx>
1614
1615 * XMLSerializer.pm: New Perl module (created from
1616 manakai's SimpleLS.dis).
1617
1618 2007-07-07 Wakaba <wakaba@suika.fam.cx>
1619
1620 * HTML.pm.src: |<!---x-->| was not processed correctly.
1621
1622 2007-07-01 Wakaba <wakaba@suika.fam.cx>
1623
1624 * HTML.pm.src: Report correct error message
1625 for |<body></div></body>|.
1626
1627 2007-07-01 Wakaba <wakaba@suika.fam.cx>
1628
1629 * HTMLTable.pm: An error description was incorrect.
1630
1631 2007-06-30 Wakaba <wakaba@suika.fam.cx>
1632
1633 * ContentChecker.pm: Return |{term}| list.
1634
1635 2007-06-30 Wakaba <wakaba@suika.fam.cx>
1636
1637 * HTML.pm.src: HTML5 revisions 961-966 (</p>, </br>,
1638 nested <nobr>, implied </tbody>, </tfoot>, and </thead>,
1639 and <title> outside of head).
1640
1641 2007-06-30 Wakaba <wakaba@suika.fam.cx>
1642
1643 * IMTChecker.pm: Report warning for unregistered
1644 and private types/subtypes.
1645
1646 * ContentChecker.pm, HTML.pm.src, IMTChecker.pm,
1647 URIChecker.pm, HTMLTable.pm: Error messages are now
1648 consistent; they are all listed in
1649 <http://suika.fam.cx/gate/2005/sw/Whatpm%20Error%20Types>.
1650
1651 2007-06-25 Wakaba <wakaba@suika.fam.cx>
1652
1653 * ContentChecker.pm: |<img ismap>| not in |<a></a>|
1654 is now erred. |<datalist>| is implemented.
1655 Attribute checker for |<command>| and |<menu>| are
1656 added. Support for |contextmenu| global attribute
1657 is added.
1658
1659 2007-06-25 Wakaba <wakaba@suika.fam.cx>
1660
1661 * HTML.pm.src (_reset_insertion_mode): Interpretation
1662 of Step 3 has been changed.
1663
1664 2007-06-25 Wakaba <wakaba@suika.fam.cx>
1665
1666 * HTML.pm.src: Late |<html>| parse error is implemented.
1667
1668 2007-06-24 Wakaba <wakaba@suika.fam.cx>
1669
1670 * URIChecker.pm (check_iri_reference): A |decode| method name was
1671 incorrect.
1672
1673 * ContentChecker.pm: Support for the |footer| element.
1674 Check URI syntax for space-separated URI attributes.
1675 Support for the |tabindex| attribute. Support
1676 for |datetime| attribute.
1677
1678 2007-06-24 Wakaba <wakaba@suika.fam.cx>
1679
1680 * HTML.pm.src: HTML5 revision 1.144 (&#x0D;) and 1.145 (invalid
1681 character references). HTML5 revision 1.146 (white space
1682 characters before root start tag). HTML5 revision
1683 1.148 (named character references in attribute values).
1684 HTML5 revision 1.152 (<plaintext>.innerHTML get).
1685
1686 2007-06-24 Wakaba <wakaba@suika.fam.cx>
1687
1688 * HTML.pm.src: HTML5 revisions 1.142 and 1.143 (<noscript>
1689 in <head>).
1690
1691 2007-06-24 Wakaba <wakaba@suika.fam.cx>
1692
1693 * HTML.pm.src: HTML5 revision 935 (<base>, <link>, <meta>
1694 in body).
1695
1696 * ContentChecker.pm: HTML5 revision 938 (scoped="").
1697
1698 2007-06-24 Wakaba <wakaba@suika.fam.cx>
1699
1700 * HTML.pm.src: HTML5 revision 923 (matching end tag in CDATA
1701 or RCDATA in fragment parsing mode). HTML5 revision
1702 924 (<!--> and <!--->). HTML5 revision 926 (hn in hn).
1703
1704 2007-06-23 Wakaba <wakaba@suika.fam.cx>
1705
1706 * HTML.pm.src (get_inner_html): HTML5 revision 922 (inner_html
1707 for <pre> and <textarea>).
1708
1709 2007-06-23 Wakaba <wakaba@suika.fam.cx>
1710
1711 * HTML.pm.src: HTML5 revision 920 (<isindex>).
1712
1713 2007-06-23 Wakaba <wakaba@suika.fam.cx>
1714
1715 * HTML.pm.src: HTML5 revision 918 (</head>, </body>, </html>).
1716
1717 2007-06-23 Wakaba <wakaba@suika.fam.cx>
1718
1719 * HTML.pm.src: HTML5 revision 916 (</body>).
1720 HTML5 revision 917 (conforming bare &).
1721
1722 2007-06-23 Wakaba <wakaba@suika.fam.cx>
1723
1724 * NanoDOM.pm (manakai_is_html): Setting to false did not work.
1725
1726 * HTML.pm.src: HTML5 revision 914 (</ in CDATA, RCDATA).
1727 HTML5 revision 915 (<nobr>).
1728
1729 2007-06-23 Wakaba <wakaba@suika.fam.cx>
1730
1731 * HTML.pm.src: HTML5 revisions 908, 909, 912, and 913 (quirks mode).
1732
1733 * NanoDOM.pm (manakai_is_html, manakai_compat_mode, compat_mode):
1734 New attributes.
1735
1736 2007-06-23 Wakaba <wakaba@suika.fam.cx>
1737
1738 * HTML.pm.src: HTML5 revisions 900, 901, 902, and 911 (<
1739 in tags).
1740
1741 2007-06-23 Wakaba <wakaba@suika.fam.cx>
1742
1743 * .cvsignore: |Entities.html| is added.
1744
1745 * HTML.pm.src: |$entity_char| is removed and
1746 requires |Whatpm::_NamedEntityList| instead.
1747 HTML5 revision 898 (refc), except that lack of refc
1748 is parse error.
1749
1750 * mkentitylist.pl: New script.
1751
1752 * Makefile (all): |_NamedEntityList.pm| is added.
1753 (_NamedEntityList.pm, Entities.html): New rules.
1754
1755 2007-06-23 Wakaba <wakaba@suika.fam.cx>
1756
1757 * HTML.pm.src: Parse errors immediately after U+000D
1758 were ignored and U+000D immediately following another
1759 U+000D was not converted to U+000A.
1760
1761 2007-06-23 Wakaba <wakaba@suika.fam.cx>
1762
1763 * HTML.pm.src (set_inner_html): HTML5 revision 892 (adopt
1764 nodes before appended). Parser was not ready for NULL
1765 parse error and escape flag.
1766
1767 * NanoDOM.pm (adopt_node): New.
1768
1769 2007-06-23 Wakaba <wakaba@suika.fam.cx>
1770
1771 * HTML.pm.src: HTML5 revision 886 (insane comment in
1772 CDATA and RCDATA). Note that current implementation
1773 is simply repeating what the spec says and it is maybe not
1774 a best way to do it.
1775
1776 2007-06-23 Wakaba <wakaba@suika.fam.cx>
1777
1778 * HTML.pm.src: HTML5 revision 884 (</form> don't close
1779 the form element if a descendant element without implied end tag has
1780 still been open).
1781
1782 2007-06-23 Wakaba <wakaba@suika.fam.cx>
1783
1784 * ContentChecker.pm: HTML5 revision 881 (Make |id| attribute with
1785 space characters non-conforming).
1786
1787 2007-06-23 Wakaba <wakaba@suika.fam.cx>
1788
1789 * HTML.pm.src: An error message was incorrect.
1790 HTML5 revision 869 (C1 character references).
1791
1792 2007-06-23 Wakaba <wakaba@suika.fam.cx>
1793
1794 * HTML.pm.src: HTML5 revision 867 (a LF at the beginning of
1795 a |textarea| is removed).
1796
1797 2007-06-05 Wakaba <wakaba@suika.fam.cx>
1798
1799 * NanoDOM.pm (get_attribute_node_ns): New method.
1800
1801 * ContentChecker.pm: |script| |async| and |defer|
1802 no longer require |src|. |async| MUST NOT be
1803 specified if |defer|. (HTML5 revision 858).
1804
1805 2007-05-30 Wakaba <wakaba@suika.fam.cx>
1806
1807 * HTML.pm.src: |<form><form>| went to inifinite loop.
1808
1809 2007-05-27 Wakaba <wakaba@suika.fam.cx>
1810
1811 * ContentChecker.pm (html): Set |is_root| (allowed
1812 as a document element) flag on.
1813 (new): Removed.
1814 (check_document): New method.
1815
1816 2007-05-27 Wakaba <wakaba@suika.fam.cx>
1817
1818 * ContentChecker.pm (thead, tfoot): Checker specifications
1819 were incorrect.
1820
1821 2007-05-27 Wakaba <wakaba@suika.fam.cx>
1822
1823 * ContentChecker.pm ($HTMLURIAttrChecker): Include
1824 error position in the |type| option of the error.
1825
1826 * HTMLTable.pm (form_table): The |$onerror| parameter
1827 is now optional. Some bugs are fixed.
1828
1829 2007-05-27 Wakaba <wakaba@suika.fam.cx>
1830
1831 * HTMLTable.pm: New module.
1832
1833 * ContentChecker.pm (table): Invoke table model error checker.
1834
1835 * NanoDOM.pm (first_child, get_attribute_ns): New.
1836
1837 2007-05-26 Wakaba <wakaba@suika.fam.cx>
1838
1839 * ContentChecker.pm ($HTMLLinkTypesAttrChecker): New checker.
1840 (link/@rel, a/@rel, area/@rel): Use new checker.
1841
1842 * Makefile (_LinkTypeList.pm, RelExtensions.html): New rules.
1843
1844 * _LinkTypeList.pm: New file.
1845
1846 * mklinktypelist.pl: New file.
1847
1848 * .cvsignore: |RelExtensions.html| added.
1849
1850 * NanoDOM.pm (child_nodes): Returns an empty array
1851 for non-child-containing node types.
1852 (text_content): New attribute.
1853
1854 2007-05-26 Wakaba <wakaba@suika.fam.cx>
1855
1856 * IMTChecker.pm: New module.
1857
1858 * ContentChecker.pm ($HTMLIMTAttrChecker): Call IMTChecker
1859 to test parameter value validity.
1860
1861 * HTML.pm.src ($style_start_tag): Attributes were
1862 discarded.
1863
1864 2007-05-25 Wakaba <wakaba@suika.fam.cx>
1865
1866 * ContentChecker.pm ($HTMLURIAttrChecker): Implemented.
1867
1868 2007-05-25 Wakaba <wakaba@suika.fam.cx>
1869
1870 * URIChecker.pm: All recommendations from RFC 3986
1871 and RFC 3987 are listed (not all testable items are checked yet).
1872
1873 2007-05-25 Wakaba <wakaba@suika.fam.cx>
1874
1875 * URIChecker.pm: New module.
1876
1877 2007-05-20 Wakaba <wakaba@suika.fam.cx>
1878
1879 * ContentChecker.pm: Now most attributes are implemented
1880 or associated to some placeholder.
1881 ($ElementDefault): Warn unknown attributes
1882 for unknown elements as "attribute not supported".
1883 ($HTMLLanguageTagAttrChecker, $HTMLMQAttrChecker): New placeholders.
1884 ($HTMLUsemapAttrChecker, $HTMLTargetAttrChecker): New checkers.
1885 (|a| attribute checker): Reimplemented.
1886
1887 2007-05-20 Wakaba <wakaba@suika.fam.cx>
1888
1889 * ContentChecker.pm ($HTMLEventHandlerAttrChecker): New placeholder.
1890 ($HTMLAttrChecker): Event handler content attributes
1891 are added.
1892 (link, embed): Required attribute is now checked.
1893 (embed): Unknown local attributes are no longer warned.
1894
1895 2007-05-20 Wakaba <wakaba@suika.fam.cx>
1896
1897 * ContentChecker.pm ($HTMLSpaceURIsAttrChecker): New placeholder.
1898 ($HTMLIMTAttrChecker): New checker.
1899 (link@rel, link@href, link@type, style@type,
1900 a@href, a@ping, a@ping, a@type, embed@src, embed@type,
1901 object@data, object@type, source@src, source@type, area@alt,
1902 area@shape, area@coords, area@href,
1903 area@ping, area@rel, area@type, script@src,
1904 script@defer, script@async, script@type): Checkers added.
1905
1906 2007-05-20 Wakaba <wakaba@suika.fam.cx>
1907
1908 * ContentChecker.pm: Descendant checking was incorrect.
1909
1910 2007-05-19 Wakaba <wakaba@suika.fam.cx>
1911
1912 * ContentChecker.pm: Support |xml:*| and |xmlns:*|
1913 attributes. Report an error if |Element.prefix|
1914 is |xmlns|.
1915
1916 * NanoDOM.pm (prefix): New attribute.
1917
1918 2007-05-19 Wakaba <wakaba@suika.fam.cx>
1919
1920 * HTML.pm.src: In |main| phase, |in body| insertion
1921 mode, action for |<iframe>| was missing.
1922
1923 2007-05-19 Wakaba <wakaba@suika.fam.cx>
1924
1925 * ContentChecker.pm: Support for many of HTML5 elements.
1926 ($GetHTMLNonNegativeIntegerAttrChecker): New.
1927
1928 2007-05-19 Wakaba <wakaba@suika.fam.cx>
1929
1930 * ContentChecker.pm: Support for most elements up to |progress|.
1931 ($HTMLURIAttrChecker): Placeholder.
1932 ($HTMLIntegerAttrChecker, $GetHTMLFloatingPointNumberAttrChecker):
1933 New.
1934
1935 2007-05-19 Wakaba <wakaba@suika.fam.cx>
1936
1937 * ContentChecker.pm: Attribute checkers
1938 for global attributes, |html|, |base|, |style|, and |meta|.
1939
1940 * NanoDOM.pm (insert_before): Weaken reference
1941 to the parent node.
1942 (Attr::new): Set |owner_element| attribute.
1943 (namespace_uri, manakai_local_name): New attribute implementations.
1944 (owner_element): New attribute.
1945
1946 2007-05-19 Wakaba <wakaba@suika.fam.cx>
1947
1948 * ContentChecker.pm ($AttrChecker, $HTMLAttrChecker,
1949 $AnyChecker->{attr_checker}, $HTMLAttrsChecker,
1950 $Element->{$HTML_NS}->{''}): New.
1951 (check_element): Invoke attrs_checker for each element.
1952
1953 2007-05-13 Wakaba <wakaba@suika.fam.cx>
1954
1955 * ContentChecker.pm: Don't use |manakai_element_type_match|.
1956
1957 2007-05-13 Wakaba <wakaba@suika.fam.cx>
1958
1959 * ContentChecker.pm: Use hashs rather than lists for
1960 element type testings.
1961
1962 2007-05-13 Wakaba <wakaba@suika.fam.cx>
1963
1964 * ContentChecker.pm: Don't generate duplicate
1965 error when an element type is put in the "minus" list
1966 and the element type is not allowed explicitly in the particular
1967 element content model.
1968 (html:a checker): New checker.
1969 (html:details, html:datagrid): New checkers.
1970 (html:legend): New checker.
1971
1972 2007-05-13 Wakaba <wakaba@suika.fam.cx>
1973
1974 * ContentChecker.pm (html:li checker): Implemented.
1975
1976 2007-05-13 Wakaba <wakaba@suika.fam.cx>
1977
1978 * ContentChecker.pm ($HTMLInlineOrStriclyInlineChecker): New
1979 checker.
1980 (html:dd checker): New checker.
1981 (html:q, html:em, html:strong, html:small,
1982 html:m, html:dfn, html:code, html:samp, html:span): New checkers.
1983
1984 2007-05-13 Wakaba <wakaba@suika.fam.cx>
1985
1986 * ContentChecker.pm ($AnyChecker): Renamed
1987 from |$ElementDefault->{checker}|.
1988 ($ElementDefault->{checker}): Throw an error that
1989 the element type is not supported by the checker.
1990 ($HTMLMetadataElement): |html:base| was missing.
1991 ($HTMLEmptyChecker): Don't throw an error
1992 for inter-element whitespace nodes.
1993 (html:html checker): Errors were not
1994 thrown even if |html:head| and/or |html:body|
1995 children were missing.
1996 (html:head checker): An error was not
1997 thrown if <meta charset> appered after other
1998 elements.
1999
2000 2007-05-05 Wakaba <wakaba@suika.fam.cx>
2001
2002 * ContentChecker.pm: |footer|, |video|, |audio|, |script|,
2003 and |noscript| elements are implemented.
2004 (new): New method.
2005
2006 2007-05-04 Wakaba <wakaba@suika.fam.cx>
2007
2008 * ContentChecker.pm: New module.
2009
2010 2007-05-04 Wakaba <wakaba@suika.fam.cx>
2011
2012 * NanoDOM.pm (manakai_parent_element,
2013 document_element, manakai_local_name,
2014 manakai_element_type_match): New method.
2015
2016 2007-05-03 Wakaba <wakaba@suika.fam.cx>
2017
2018 * HTML.pm.src: Replace decimal and hexadecimal numeric
2019 entities in C1 range using Windows-1252 mapping. Bare LF
2020 did not count as new line for error reporting.
2021
2022 2007-05-02 Wakaba <wakaba@suika.fam.cx>
2023
2024 * NanoDOM.pm (DOMImplementation): New class.
2025 (append_child): Weaken the |parent_node| reference.
2026 (create_element_ns, Element new): Set the |owner_document|
2027 reference.
2028 (implementation): New attribute.
2029 (owner_document, local_name, namespace_uri): New attributes.
2030
2031 * HTML.pm.src (parse_string): Line and column numbers
2032 are now provided to error handler.
2033 (!!!parse-error): Short descriptions are added.
2034 (_construct_tree): Split into three methods; support
2035 for innerHTML mode.
2036 (set_inner_html): New method.
2037
2038 2007-05-01 Wakaba <wakaba@suika.fam.cx>
2039
2040 * NanoDOM.html: Documentation is added.
2041
2042 * HTML.pod, ContentType.html: Documentation is revised.
2043
2044 * .cvsignore: Pod2html temporary files are added.
2045
2046 * Makefile: Make |NanoDOM.html|.
2047
2048 2007-05-01 Wakaba <wakaba@suika.fam.cx>
2049
2050 * HTML.pm.src (parse_string): New method.
2051 (get_inner_html): Renamed from |inner_html|.
2052
2053 * Makefile: A rule for |HTML.html| is added.
2054
2055 * HTML.pod: New documentation.
2056
2057 2007-05-01 Wakaba <wakaba@suika.fam.cx>
2058
2059 * NanoDOM.pm (last_child, previous_sibling): New attributes.
2060 (clone_node): Attribute nodes were not completely copied.
2061
2062 * HTML.pm.src: Many bugs are fixed.
2063
2064 2007-04-30 Wakaba <wakaba@suika.fam.cx>
2065
2066 * HTML.pm.src: Some typos are fixed.
2067
2068 2007-04-30 Wakaba <wakaba@suika.fam.cx>
2069
2070 * mkhtmlparser.pl, Makefile: References to the |HTML-consume-entity.src|
2071 are removed.
2072
2073 * HTML.pm.src: Tokenizer's handling on named entities are rewritten.
2074
2075 * HTML-consume-entity.src: Removed.
2076
2077 2007-04-30 Wakaba <wakaba@suika.fam.cx>
2078
2079 * HTML.pm.src: Tokenizer's handling on hexadecimal
2080 numeric entities are rewritten.
2081
2082 2007-04-30 Wakaba <wakaba@suika.fam.cx>
2083
2084 * HTML.pm.src: Some tokenizer bugs are fixed.
2085
2086 2007-04-30 Wakaba <wakaba@suika.fam.cx>
2087
2088 * HTML.pm.src: The tree construction stage is implemented.
2089
2090 * mkhtmlparser.pl: New macros are added.
2091
2092 2007-04-28 Wakaba <wakaba@suika.fam.cx>
2093
2094 * ContentType.pm: A note on bug in the specification
2095 is removed since it's been now fixed.
2096
2097 * .cvsignore: New file.
2098
2099 2007-04-28 Wakaba <wakaba@suika.fam.cx>
2100
2101 * HTML.pm.src, HTML-consume-entity.src: New files.
2102
2103 * Makefile (HTML.pm): New rule.
2104
2105 * mkhtmlparser.pl: New script.
2106
2107 2007-04-25 Wakaba <wakaba@suika.fam.cx>
2108
2109 * Makefile: New file.
2110
2111 2007-04-24 Wakaba <wakaba@suika.fam.cx>
2112
2113 * ContentType.pm: An error in pod is fixed.
2114
2115 2007-04-24 Wakaba <wakaba@suika.fam.cx>
2116
2117 * ContentType.pm: Documentation is added.
2118
2119 2007-04-24 Wakaba <wakaba@suika.fam.cx>
2120
2121 * ContentType.pm: New Perl module.
2122
2123 * ChangeLog: New file.
2124
2125

admin@suikawiki.org
ViewVC Help
Powered by ViewVC 1.1.24