/[suikacvs]/markup/html/whatpm/Whatpm/ChangeLog
Suika

Contents of /markup/html/whatpm/Whatpm/ChangeLog

Parent Directory Parent Directory | Revision Log Revision Log


Revision 1.312 - (show annotations) (download)
Mon Sep 15 08:09:39 2008 UTC (16 years, 10 months ago) by wakaba
Branch: MAIN
Changes since 1.311: +4 -0 lines
++ whatpm/Whatpm/ChangeLog	15 Sep 2008 08:09:35 -0000
	* HTML.pm.src: Shorten keys.

2008-09-15  Wakaba  <wakaba@suika.fam.cx>

1 2008-09-15 Wakaba <wakaba@suika.fam.cx>
2
3 * HTML.pm.src: Shorten keys.
4
5 2008-09-15 Wakaba <wakaba@suika.fam.cx>
6
7 * HTML.pm.src: Remove checking for control character, surrogate
8 pair, or noncharacter code points and non-Unicode code
9 points (they should be handled by Whatpm::Charset::UnicodeChecker).
10 (parse_char_stream): Support for the |$get_wrapper| argument and
11 character stream error handlers.
12
13 2008-09-15 Wakaba <wakaba@suika.fam.cx>
14
15 * ContentChecker.pm: Don't call |loda_ns_module|
16 for null-namespace elements/attributes.
17
18 * HTML.pm.src: Fact out $disallowed_control_chars
19 as a hash.
20
21 2008-09-14 Wakaba <wakaba@suika.fam.cx>
22
23 * HTML.pm.src: Regexp typo fixed. |{prev_char}|
24 and |{next_char}| initializations are moved to initialization
25 method. |{read_until}| now supports buffering. Sync |set_inner_html|
26 with |parse_char_stream|.
27
28 2008-09-14 Wakaba <wakaba@suika.fam.cx>
29
30 * HTML.pm.src (parse_char_stream): Make |set_next_char|
31 invoke |manakai_read_until|, not only |read|, where
32 possible, to decrease the number of |read| method calls.
33
34 * mkhtmlparser.pl: Related changes to the aforementioned
35 modification.
36
37 2008-09-14 Wakaba <wakaba@suika.fam.cx>
38
39 * HTML.pm.src: Use |read| instead of |getc|. |set_inner_html|
40 would report character error from now.
41
42 2008-09-14 Wakaba <wakaba@suika.fam.cx>
43
44 * HTML.pm.src: White-space-leaded non-white-space character
45 tokens in "before head insertion mode" was not
46 correctly handled.
47 (set_inner_html): Reimplemented using CharString decodehandle
48 class. Support for $get_wrapper argument. Support
49 for |{read_until}| feature.
50
51 2008-09-14 Wakaba <wakaba@suika.fam.cx>
52
53 * HTML.pm.src: Make a "bare ero" error for unknown
54 entities point the "&" character.
55
56 2008-09-14 Wakaba <wakaba@suika.fam.cx>
57
58 * HTML.pm.src: It turns out that U+FFFD don't have to
59 be added to the list of excluded characters.
60
61 2008-09-14 Wakaba <wakaba@suika.fam.cx>
62
63 * HTML.pm.src ($char_onerror): Have character decoder's |line|
64 and |column| a higher priority than the one set by the
65 tokenizer's input handler.
66 ($self->{read_until}): Exclude U+FFFD (but this might
67 not be necessary, since now we do line/column fixup in
68 the character decode handle).
69
70 2008-09-14 Wakaba <wakaba@suika.fam.cx>
71
72 * HTML.pm.src: Use |{read_until}| where possible.
73
74 2008-09-14 Wakaba <wakaba@suika.fam.cx>
75
76 * HTML.pm.src: Change |{getc_until}| to |{read_until}|
77 and |manakai_getc_until| to |manakai_read_until| to
78 reduce the number of string copies.
79
80 2008-09-14 Wakaba <wakaba@suika.fam.cx>
81
82 * HTML.pm.src (parse_char_string): Use newly created
83 |Whatpm::Charset::DecodeHandle::CharString| instead of Perl's
84 standard feature to |open| a string as a filehandle,
85 since Perl's string filehandle seems not supporting |ungetc|
86 method correctly.
87 (parse_char_stream): Define |{getc_until}| method.
88 (DATA_STATE): Experimental support for |getc_until| feature.
89
90 2008-09-13 Wakaba <wakaba@suika.fam.cx>
91
92 * HTML.pm.src: Check points added to newly added branches.
93
94 2008-09-13 Wakaba <wakaba@suika.fam.cx>
95
96 * HTML.pm.src: Remove |{char}|, which is no longer used.
97 Remove |{entity_in_attr}| and |{last_attribute_value_state}|
98 and replaced by |{prev_state}|.
99
100 * mkhtmlparser.pl: Remove |{char}| feature.
101 Remove |!!!back-next-input-character;| macro.
102
103 2008-09-13 Wakaba <wakaba@suika.fam.cx>
104
105 * HTML.pm.src: Finally we get rid of all the inner loops. Remove
106 entity related tokenizer states in favor of new states
107 implementing the consume character reference algorithm.
108
109 2008-09-13 Wakaba <wakaba@suika.fam.cx>
110
111 * HTML.pm.src: "Consume a character reference" algorithm is
112 now implemented as a tokenizer's state, rather than
113 a method, with minimum changes (more changes will
114 be made, in due course). "Bogus comment state"'s inner
115 loop gets removed.
116
117 2008-09-13 Wakaba <wakaba@suika.fam.cx>
118
119 * HTML.pm.src: Make |PUBLIC| and |SYSTEM| keyword tokenizing
120 into their own tokenizer states.
121
122 2008-09-13 Wakaba <wakaba@suika.fam.cx>
123
124 * HTML.pm.src: |CDATA_SECTION_STATE| (formally |CDATA_BLOCK_STATE|
125 is split into three states.
126
127 2008-09-13 Wakaba <wakaba@suika.fam.cx>
128
129 * HTML.pm.src: |CLOSE_TAG_OPEN_STATE| is broken into
130 itself and new |CDATA_PCDATA_CLOSE_TAG_STATE| so that
131 no longer does the tokenizer have to push back next input
132 characters in those states.
133
134 2008-09-13 Wakaba <wakaba@suika.fam.cx>
135
136 * HTML.pm.src: |MARKUP_DECLARATION_OPEN_STATE| broken
137 into four states so that no longer does the tokenizer have to push
138 back next input characters in that state.
139
140 2008-09-11 Wakaba <wakaba@suika.fam.cx>
141
142 * HTML.pm.src: Methods now accept additional parameter, $get_wrapper,
143 which can be used to insert some wrapper between the character
144 stream handle and the tokenizer. (It is currently not supported
145 for |set_inner_html| for |Element|s).
146
147 2008-09-10 Wakaba <wakaba@suika.fam.cx>
148
149 * HTML.pm.src: Ignore punctuations in charset names.
150
151 2008-09-10 Wakaba <wakaba@suika.fam.cx>
152
153 * ContentChecker.pm: Support for charset-layer error levels.
154
155 * HTML.pm.src: Don't specify |text| argument for the
156 |chardecode:fallback| error, since it is not the encoding
157 being used alternatively.
158
159 2008-09-06 Wakaba <wakaba@suika.fam.cx>
160
161 * HTML.pm.src: Support for |XSLT-compat| (HTML5 revision 2141).
162
163 2008-08-31 Wakaba <wakaba@suika.fam.cx>
164
165 * CacheManifest.pm: Support for extensibility (HTML5 revision 2051).
166
167 2008-08-31 Wakaba <wakaba@suika.fam.cx>
168
169 * HTML.pm.src: Bug fix and sync with the spec with regard
170 to after after frameset insertion mode processing (HTML5
171 revision 1909). Note that the implementation was wrong
172 per the old spec before the r1909 changes.
173
174 2008-08-30 Wakaba <wakaba@suika.fam.cx>
175
176 * HTMLTable.pm: scope=auto algorithm fix synced with the
177 spec (HTML5 revision 2093).
178 ($process_row): Algorithm step numbers synced with the
179 spec (HTML5 revision 2092).
180
181 2008-08-30 Wakaba <wakaba@suika.fam.cx>
182
183 * HTMLTable.pm: Zs is not what we want; we want White_Space! (HTML5
184 revision 2094).
185
186 2008-08-30 Wakaba <wakaba@suika.fam.cx>
187
188 * ContentType.pm: Support for image/svg+xml (HTML5 revision 2096).
189
190 2008-08-30 Wakaba <wakaba@suika.fam.cx>
191
192 * HTML.pm.src: '"' and "'" at the end of attribute
193 name (after another attribute) now raise parse error (HTML5
194 revision 2123). Empty unquoted attribute values are no
195 longer allowed (HTML5 revision 2122).
196
197 2008-08-30 Wakaba <wakaba@suika.fam.cx>
198
199 * mkhtmlparser.pl: Support for MathML |definitionURL| attribute (HTML5
200 revision 2130).
201
202 2008-08-30 Wakaba <wakaba@suika.fam.cx>
203
204 * ContentChecker.pm: |xml:lang| attribute value must be same
205 as |lang| attribute value for HTML elements (HTML5 revision 2062
206 and so on).
207
208 2008-08-30 Wakaba <wakaba@suika.fam.cx>
209
210 * ContentChecker.pm: Error level definition for |xml_id_error|
211 was missing.
212
213 * URIChecker.pm: The end of the URL should be marked as the
214 error location for an empty path error. The position
215 between the userinfo and the port components should be
216 marked as the error location for an empty host error.
217
218 2008-08-30 Wakaba <wakaba@suika.fam.cx>
219
220 * URIChecker.pm: Set parameters representing where in the
221 value the error occurs for errors. Report unknown
222 address format error in warning level, since address
223 formats are rarely added. Path segments starting with "/.."
224 were misinterpreted as a dot-segment.
225
226 2008-08-30 Wakaba <wakaba@suika.fam.cx>
227
228 * URIChecker.pm (check_iri_reference): Requires
229 |Message::DOM::DOMImplementation|.
230
231 2008-08-29 Wakaba <wakaba@suika.fam.cx>
232
233 * IMTChecker.pm: Updated for the new error reporting architecture.
234
235 * ContentChecker.pm: Error levels for IMTs are added.
236
237 2008-08-17 Wakaba <wakaba@suika.fam.cx>
238
239 * H2H.pm (_shift_token): Support for unquoted HTML attribute
240 values.
241
242 2008-08-16 Wakaba <wakaba@suika.fam.cx>
243
244 * CacheManifest.pm: Support for new style of error
245 reports.
246
247 * HTML.pm.src: Set line=1, column=1 to the document node.
248
249 2008-08-16 Wakaba <wakaba@suika.fam.cx>
250
251 * ContentChecker.pm, RDFXML.pm: Pass {level} object to language tag
252 and URL checkers. Support for more error levels for bogus
253 langauge tag and URL "standards".
254
255 * LangTag.pm, URIChecker.pm: Support for new style error
256 level reporting.
257
258 2008-08-15 Wakaba <wakaba@suika.fam.cx>
259
260 * ContentChecker.pm: Support for RDF/XML error levels.
261
262 * HTMLTable.pm, RDFXML.pm: Support for new style of error level
263 specifying. Error types are revised.
264
265 2008-08-15 Wakaba <wakaba@suika.fam.cx>
266
267 * ContentChecker.pm: All error reporting method calls are
268 renewed.
269
270 2008-08-15 Wakaba <wakaba@suika.fam.cx>
271
272 * HTML.pm.src: All error type names and "text" parameters
273 are revised. Use new style for "level" specification.
274
275 * mkhtmlparser.pl: Use new style for "level" specification.
276
277 2008-08-03 Wakaba <wakaba@suika.fam.cx>
278
279 * WebIDL.pm (parse_char_string): Simplified error
280 reporting process for broken ignored valuetype definition.
281 (Valuetype idl_text): Support for special "DOMString" name.
282
283 2008-08-03 Wakaba <wakaba@suika.fam.cx>
284
285 * WebIDL.pm ($get_scoped_name): Append "::::" if the last
286 terminal of the ScopedName is "DOMString", such that whether
287 the last part of the scoped name is "DOMString" or "_DOMString"
288 later. It is necessary to determine whether a |typedef|
289 definition should be ignored or not.
290 (parse_char_string): Unescape the identifier of
291 exception members.
292 ($resolve): Return undef for builtin types and sequence<T>
293 types (we might not have to do this, however...).
294 (check): Support checking for Exceptions, Valuetypes,
295 and Typedefs.
296 ($serialize_type): Support for "DOMString::::" syntax.
297 (Typedef idl_text): Output Type as "DOMString" if it
298 is really "DOMString" (i.e. its internal representation
299 is "::DOMString::").
300
301 2008-08-03 Wakaba <wakaba@suika.fam.cx>
302
303 * WebIDL.pm ($resolve): New code, based on resolve code
304 for constant types in the |check| method.
305 (check): Support for checking of attributes, operations, and
306 arguments.
307 (Attribute/Operation idl_text): Exception names in getraises,
308 setraises, and raises clauses is serizlied by |$serialize_type|
309 code.
310
311 2008-08-02 Wakaba <wakaba@suika.fam.cx>
312
313 * WebIDL.pm ($integer): Order of selections are changed to match
314 hexadecimal numbers (the original pattern, taken from the spec,
315 was not work for hexadecimal numbers, because the "0" prefix
316 matches to the [0-7]* part (as an empty string) and therefore
317 it does not match with remaining "x..." part of a "0x..." integer
318 literal.
319 ($get_type): It now returns a string, not an array reference,
320 for regular types and |sequence| types (i.e. it in any case
321 returns a string).
322 ($get_next_token): The second item in the array that represents
323 a integer or float token is now a Perl number value, not the
324 original string representation of the number.
325 (check): Support for const value consistency checking.
326 No extended attribute is defined for constants.
327 (Node subclasses): Use simple strings rather than array references
328 for default data type values.
329 ($serialize_type): Type values are now simple strings.
330 (value): If the new attribute value is a false value, then
331 a FALSE value is set to the attribute.
332
333 2008-08-02 Wakaba <wakaba@suika.fam.cx>
334
335 * WebIDL.pm ($get_scoped_name): Now scoped names are stored
336 in its stringified format ("scoped name" as defined in the
337 spec). Note that future version of this module should not use
338 array references for type values and the |type_text| attribute
339 should be made obsolete.
340 (parse_char_string): Unescape attribute names.
341 (check): Support for checking of whether inherited interfaces
342 are actually defined or not. Support for checking of whether
343 interface member identifiers are duplicated or not.
344 ($serialize_type): Scoped names are returned as is. A future
345 version of this code should escape identifiers other than "DOMString",
346 otherwise the idl_text would be non-conforming.
347
348 2008-08-02 Wakaba <wakaba@suika.fam.cx>
349
350 * WebIDL.pm (parse_char_string): Set line/column numbers
351 to generated nodes. Unescape identifiers. Extended attributes
352 for Definition's were ignored.
353 (append_child): Set |parent_node| attribute.
354 (parent_node): New attribute.
355 (check): Support interface/exception members. Support
356 extended attributes. Support definition identifier uniqueness
357 constraint.
358 (qualified_name): New attribute.
359 (Interface/Exception idl_text): Extended attributes were
360 not prepended to the returned text.
361
362 2008-08-02 Wakaba <wakaba@suika.fam.cx>
363
364 * WebIDL.pm (parse_char_string): Set line/column numbers
365 to interface object experimentally. s/shift/pop/g, shift
366 would make things wrong. Support for interface forward
367 declarations was missing. Broken interface declarations
368 with no block were not ignored entirely.
369 (Whatpm::WebIDL::Node): New abstract class. This class
370 makes things easier.
371 (child_nodes): New attribute. Unlike DOM's attribute with
372 same name, this attribute returns a dead list of nodes for
373 simplicity.
374 (get_user_data, set_user_data): New methods.
375 (Module idl_text): A SPACE character should be inserted
376 before the |{| character.
377 (Interface idl_text): Support for interface forward declarations.
378 (is_forward_declaration): New attribute.
379
380 2008-07-19 Wakaba <wakaba@suika.fam.cx>
381
382 * WebIDL.pm (type_text): Better serializer.
383
384 2008-07-19 Wakaba <wakaba@suika.fam.cx>
385
386 * WebIDL.pm: Revise forward-compatible parsing so that
387 it now can handle broken extended attributes and as such.
388
389 2008-07-19 Wakaba <wakaba@suika.fam.cx>
390
391 * WebIDL.pm: Real support for extended attributes.
392 Support for extended attributes with arguments.
393
394 2008-07-19 Wakaba <wakaba@suika.fam.cx>
395
396 * WebIDL.pm: Support for |exception| syntax.
397 (Interface->idl_text): Tentative support for inheritances.
398
399 2008-07-19 Wakaba <wakaba@suika.fam.cx>
400
401 * WebIDL.pm: Hierarchical scoped name support was broken.
402 Support for raises, setraises, and getraises syntaxes.
403
404 2008-07-18 Wakaba <wakaba@suika.fam.cx>
405
406 * WebIDL.pm: Support for |idl_text| attribute, version 1 (no
407 proper support for types, extended attributes, and exceptions yet).
408 WebIDL parser, version 1 (no support for exceptions yet,
409 no proper support for extended attributes yet).
410
411 2008-07-09 Wakaba <wakaba@suika.fam.cx>
412
413 * WebIDL.pm (parse_char_string): Support for basic attribute syntax.
414
415 2008-06-29 Wakaba <wakaba@suika.fam.cx>
416
417 * WebIDL.pm: Support for valuetype and const.
418
419 2008-06-29 Wakaba <wakaba@suika.fam.cx>
420
421 * WebIDL.pm: New module.
422
423 2008-06-15 Wakaba <wakaba@suika.fam.cx>
424
425 * Makefile (Entities.html): URI changed.
426
427 2008-06-08 Wakaba <wakaba@suika.fam.cx>
428
429 * HTML.pm.src: Support for ruby parsing (HTML5 revision 1704).
430
431 2008-06-01 Wakaba <wakaba@suika.fam.cx>
432
433 * HTML.pm.src (_get_next_token): A parse error was missing.
434
435 2008-06-01 Wakaba <wakaba@suika.fam.cx>
436
437 * mklinktypelist.pl: rel=contact is no longer part of the HTML5
438 spec (commented out). (HTML5 revision 1711).
439
440 2008-05-25 Wakaba <wakaba@suika.fam.cx>
441
442 * ContentType.pm: Drop support for UTF-32 (HTML5 revision 1701).
443
444 * HTML.pm.src: UTF-16BE and UTF-16LE should be considered
445 as UTF-16 (HTML5 revision 1701).
446
447 2008-05-25 Wakaba <wakaba@suika.fam.cx>
448
449 * HTML.pm.src: Support for <noframes> in <head> (HTML5 revision
450 1692).
451
452 2008-05-25 Wakaba <wakaba@suika.fam.cx>
453
454 * HTML.pm.src: The secondary insertion mode used when switching
455 to foreign content is the "in body" insertion mode (HTML5 revision
456 1696).
457
458 2008-05-25 Wakaba <wakaba@suika.fam.cx>
459
460 * HTML.pm.src: Don't raise parse error for <isindex/> (HTML5
461 revision 1697).
462
463 2008-05-24 Wakaba <wakaba@suika.fam.cx>
464
465 * HTML.pm.src: Support for end-of-file token in foreign content
466 insertion mode (HTML5 revision 1693). Update SVG camelCase
467 attribute list (HTML5 revision 1700). <textarea> closes
468 </select> (HTML5 revision 1699). More start tags close in
469 foreign content insertion mode (HTML5 revision 1698).
470
471 2008-05-24 Wakaba <wakaba@suika.fam.cx>
472
473 * HTML.pm.src: ";" is not part of charset name (HTML5 revision 1665).
474
475 2008-05-24 Wakaba <wakaba@suika.fam.cx>
476
477 * HTML.pm.src: More robust charset parameter detection (HTML5
478 revision 1674).
479
480 2008-05-24 Wakaba <wakaba@suika.fam.cx>
481
482 * ContentType.pm: Support for image/vnd.microsoft.icon (HTML5
483 revision 1676).
484
485 2008-05-24 Wakaba <wakaba@suika.fam.cx>
486
487 * HTML.pm.src: Ignore language part of public identifiers for
488 quriks mode detection (HTML5 revision 1679).
489
490 2008-05-24 Wakaba <wakaba@suika.fam.cx>
491
492 * HTML.pm.src: Reduce the number of errors in truncated doctypes (HTML5
493 revision 1685).
494
495 2008-05-24 Wakaba <wakaba@suika.fam.cx>
496
497 * HTML.pm.src: Support for EOF in new states for tags (HTML5
498 revision 1684).
499
500 2008-05-24 Wakaba <wakaba@suika.fam.cx>
501
502 * HTML.pm.src (_reset_insertion_mode): Make <td>.innerHTML
503 work (HTML5 revision 1690).
504
505 2008-05-24 Wakaba <wakaba@suika.fam.cx>
506
507 * HTML.pm.src (_tree_construction_main): Change handling of
508 end tags in head insertion modes (HTML5 revision 1686).
509 (parse_char_string): Bug fix for non-utf8 character string handlings.
510 (parse_char_stream): |ungetc| does not work well for this context.
511
512 2008-05-18 Wakaba <wakaba@suika.fam.cx>
513
514 * HTML.pm.src (parse_byte_string): Redefined to invoke
515 |parse_byte_stream|.
516 (parse_byte_stream): New method.
517
518 2008-05-18 Wakaba <wakaba@suika.fam.cx>
519
520 * HTML.pm.src (parse_byte_string): Fix the column number reported
521 by encoding layer error reporter.
522
523 2008-05-17 Wakaba <wakaba@suika.fam.cx>
524
525 * HTML.pm.src (parse_byte_string): Use streaming decoder
526 rather than converting the whole byte string and then parsing.
527 Propagate errors in character encoding layer.
528 (get_next_token): Precise error reporting for |bare stago| error.
529
530 2008-05-17 Wakaba <wakaba@suika.fam.cx>
531
532 * HTML.pm.src (parse_char_stream): New method.
533 (parse_char_string): This method is now defined as an invocation
534 of the |parse_char_stream| method.
535
536 2008-05-17 Wakaba <wakaba@suika.fam.cx>
537
538 * HTML.pm.src (parse_byte_string): Report various status
539 of the sniffing as info-level errors. Support for new
540 decoding framework in parser resestting.
541 (new): Various default error levels were not set.
542
543 2008-05-17 Wakaba <wakaba@suika.fam.cx>
544
545 * HTML.pm.src (parse_byte_string): HTML5 encoding siniffing
546 algorithm, except for the actual sniffing, is implemented
547 with new framework with Message::Charset::Info.
548
549 2008-05-16 Wakaba <wakaba@suika.fam.cx>
550
551 * CacheManifest.pm (_parse): Drop fragment identifiers from
552 URIs in fallback section (HTML5 revision 1596).
553
554 2008-05-10 Wakaba <wakaba@suika.fam.cx>
555
556 * Makefile (Entities.html): URI has changed.
557
558 2008-05-10 Wakaba <wakaba@suika.fam.cx>
559
560 * CacheManifest.pm: Don't replace U+0000 NULL (HTML5 revision
561 1553).
562
563 2008-05-06 Wakaba <wakaba@suika.fam.cx>
564
565 * ContentChecker.pm: Noted that those returned in |table| are
566 no longer table elements, but table objects returned
567 by Whatpm::HTMLTable.
568
569 * HTMLTable.pm (form_table): Return table element node
570 as |$table->{element}|.
571 (assign_header): Support for the |headers=""| attribute.
572
573 2008-05-06 Wakaba <wakaba@suika.fam.cx>
574
575 * HTMLTable.pm (assign_header): New function; first version
576 with no support for headers="".
577 (form_table): Include table width and height to the returned
578 table object for covenience. Indexing in column assignement was wrong.
579 Set whether a data cell is empty or not for convenience.
580
581 2008-05-05 Wakaba <wakaba@suika.fam.cx>
582
583 * HTMLTable.pm: Robuster caption support (HTML5 revision 1393).
584
585 2008-05-05 Wakaba <wakaba@suika.fam.cx>
586
587 * HTMLTable.pm: How table model errors are detected is
588 changed (HTML5 revision 1387).
589
590 2008-05-05 Wakaba <wakaba@suika.fam.cx>
591
592 * HTMLTable.pm: The algorithm now moves |tfoot| elements
593 to the end of the table (HTML5 revision 1380).
594
595 2008-05-05 Wakaba <wakaba@suika.fam.cx>
596
597 * HTMLTable.pm: The algorithm is now 0-based indexing, instead
598 of 1-based (HTML5 revision 1376).
599
600 2008-05-05 Wakaba <wakaba@suika.fam.cx>
601
602 * ContentType.pm: "Content-Type: text/plain; charset=UTF-8"
603 and "Content-Encoding" no longer prevent sniffing (HTML5
604 revision 1288).
605
606 2008-05-05 Wakaba <wakaba@suika.fam.cx>
607
608 * ContentType.pm: Skip BOMs n feed or HTML algorithm (HTML5 revision
609 1282).
610
611 2008-05-03 Wakaba <wakaba@suika.fam.cx>
612
613 * ContentChecker.pm: Support for global attributes.
614 Status of XML specs are added.
615
616 2008-05-03 Wakaba <wakaba@suika.fam.cx>
617
618 * ContentChecker.pm (check_element): Support for |template=""|
619 and |ref=""| attribute (referent element type checking).
620
621 2008-04-29 Wakaba <wakaba@suika.fam.cx>
622
623 * CacheManifest.pm (_parse): New same origin definition (HTML5 revision
624 1500) is implemented (except for IDNA part and URI-scheme-specific
625 knowledge). Line number counting was wrong for LF-only
626 documents.
627
628 2008-04-13 Wakaba <wakaba@suika.fam.cx>
629
630 * HTML.pm.src: Raise an parse error for any disallowed
631 character (HTML5 revision 1263).
632
633 2008-04-13 Wakaba <wakaba@suika.fam.cx>
634
635 * mkentitylist.pl: Support for new HTML5 entity table format (the
636 definition for |AElig;| was missing).
637
638 2008-04-13 Wakaba <wakaba@suika.fam.cx>
639
640 * HTML.pm.src, mkhtmlparser.pl: Support for element/attribute
641 name/namespace fixup (HTML5 revisions 1413, 1415, 1416, and 1417).
642
643 2008-04-13 Wakaba <wakaba@suika.fam.cx>
644
645 * HTML.pm.src: List of element names that close foreign content
646 insertion mode is added (HTML5 revisions 1412 and 1418).
647
648 2008-04-13 Wakaba <wakaba@suika.fam.cx>
649
650 * HTML.pm.src: Support for |mglyph| and |malignmark|
651 elements (HTML5 revision 1410).
652
653 2008-04-13 Wakaba <wakaba@suika.fam.cx>
654
655 * HTML.pm.src: Support for new long MathML entities (HTML5
656 revision 1406).
657
658 2008-04-13 Wakaba <wakaba@suika.fam.cx>
659
660 * HTML.pm.src: CDATA section support for MathML and SVG
661 elements (HTML5 revision 1404 and 1420).
662
663 2008-04-12 Wakaba <wakaba@suika.fam.cx>
664
665 * HTML.pm.src, mkhtmlparser.pl: Support for MathML and SVG
666 elements (HTML5 revision 1404). Unused !!!macro definitions
667 are removed.
668
669 2008-04-12 Wakaba <wakaba@suika.fam.cx>
670
671 * HTML.pm.src, mkhtmlparser.pl: The way permitted slash errors
672 are raised is changed (HTML5 revision 1404).
673
674 2008-04-06 Wakaba <wakaba@suika.fam.cx>
675
676 * HTML.pm.src: Category sets that are no longer used are removed.
677
678 2008-04-06 Wakaba <wakaba@suika.fam.cx>
679
680 * HTML.pm.src: The ->[1] property of stack entries are now
681 replaced by constants representing element category.
682
683 2008-04-06 Wakaba <wakaba@suika.fam.cx>
684
685 * HTML.pm.src: Don't use local name stored in stack (i.e. ->[1])
686 for error reporting. (This is a preparation for using constant
687 value for ->[1].)
688
689 2008-03-22 Wakaba <wakaba@suika.fam.cx>
690
691 * RDFXML.pm: Typo fixed.
692
693 2008-03-22 Wakaba <wakaba@suika.fam.cx>
694
695 * ContentChecker.pm: |fact_level| is now treated
696 as same as |must_level|, i.e. level = |m|.
697 (check_element): Make list of URIs in the DOM.
698
699 2008-03-21 Wakaba <wakaba@suika.fam.cx>
700
701 * RDFXML.pm: Language accessor implemented.
702 Local (null-namespace) attribute support.
703
704 2008-03-21 Wakaba <wakaba@suika.fam.cx>
705
706 * RDFXML.pm: Factored out ID checking code.
707
708 2008-03-21 Wakaba <wakaba@suika.fam.cx>
709
710 * RDFXML.pm: TODO items noted. Validation of ID
711 and URI attributes is implemented. Warn if unknown
712 value is used in rdf:parseType="" attribute.
713
714 * URIChecker.pm (check_rdf_uri_reference): New function.
715
716 2008-03-21 Wakaba <wakaba@suika.fam.cx>
717
718 * RDFXML.pm: bnodeid implemented. Relative references
719 are now resolved.
720
721 2008-03-21 Wakaba <wakaba@suika.fam.cx>
722
723 * ContentChecker.pm: RDF reification implemented.
724
725 * RDFXML.pm: undef vs false bug fixed.
726 Reification implemented.
727
728 2008-03-21 Wakaba <wakaba@suika.fam.cx>
729
730 * RDFXML.pm: s/id/ID/ for attribute name.
731 The |node| arguments are added for |ontriple| calls.
732 Too many "attribute not allowed" errors were raised.
733
734 * ContentChecker.pm: Initial experimental support for rdf:RDF
735 element.
736
737 2008-03-21 Wakaba <wakaba@suika.fam.cx>
738
739 * RDFXML.pm: New module.
740
741 2008-03-20 Wakaba <wakaba@suika.fam.cx>
742
743 * HTML.pm.src (set_inner_html): Line/column number
744 code was old one yet.
745
746 2008-03-20 Wakaba <wakaba@suika.fam.cx>
747
748 * HTML.pm.src: Better line/column reporting for "duplicate attribute"
749 errors. Line/column markings for DOCTYPE, comment, and
750 character tokens are reintroduced; otherwise,
751 error location for "not HTML5" error and errors
752 for implied elements are not attached.
753
754 2008-03-20 Wakaba <wakaba@suika.fam.cx>
755
756 * HTML.pm.src: Set line/column numbers to attributes.
757
758 * NanoDOM.pm (create_attribute_ns, set_attribute_node_ns): Added.
759 (value): Setter implemented.
760
761 * mkhtmlparser.pl: Set line/column numbers to Attr nodes.
762
763 2008-03-20 Wakaba <wakaba@suika.fam.cx>
764
765 * HTML.pm.src: Unused line/column markings are removed.
766
767 2008-03-20 Wakaba <wakaba@suika.fam.cx>
768
769 * HTML.pm.src (_get_next_token): Remove |first_start_tag|
770 flag, which is no longer used.
771
772 2008-03-17 Wakaba <wakaba@suika.fam.cx>
773
774 * HTML.pm.src: Set line/column information to element nodes.
775
776 * mkhtmlparser.pl (!!!create-element, !!!insert-element,
777 and !!!insert-element-t): Set line/column information to
778 element nodes.
779
780 2008-03-17 Wakaba <wakaba@suika.fam.cx>
781
782 * HTML.pm.src (_get_next_token): The first "<" character
783 in "<?", "<>", or "</>" should be the error point.
784
785 2008-03-16 Wakaba <wakaba@suika.fam.cx>
786
787 * HTML.pm.src: Some more fixes on error position reporting.
788
789 2008-03-16 Wakaba <wakaba@suika.fam.cx>
790
791 * HTML.pm.src: Token-level precious error reporting.
792
793 2008-03-16 Wakaba <wakaba@suika.fam.cx>
794
795 * HTML.pm.src: Preparation for more precious error point
796 reporting.
797
798 2008-03-11 Wakaba <wakaba@suika.fam.cx>
799
800 * HTML.pm.src: Error type revised.
801
802 2008-03-11 Wakaba <wakaba@suika.fam.cx>
803
804 * HTML.pm.src: Similar codes are merged together, again.
805
806 2008-03-11 Wakaba <wakaba@suika.fam.cx>
807
808 * HTML.pm.src: Similar codes are merged together.
809
810 2008-03-10 Wakaba <wakaba@suika.fam.cx>
811
812 * mkhtmlparser.pl: Set "level" parameter to parse errors.
813
814 * HTML.pm.src: Code refined.
815
816 2008-03-09 Wakaba <wakaba@suika.fam.cx>
817
818 * HTML.pm.src: |</body>| treatement has been changed (HTML5
819 revision 1348). Note that I really don't know this makes
820 any difference in the black-box behavior of the parser.
821
822 2008-03-09 Wakaba <wakaba@suika.fam.cx>
823
824 * HTML.pm.src: New end-of-file token implementation (HTML5
825 revision 1348).
826
827 2008-03-09 Wakaba <wakaba@suika.fam.cx>
828
829 * HTML.pm.src: |applet| support (HTML5 revision 1347).
830
831 2008-03-09 Wakaba <wakaba@suika.fam.cx>
832
833 * HTML.pm.src: Foster parenting in AAA (HTML5 revision 1343).
834
835 2008-03-09 Wakaba <wakaba@suika.fam.cx>
836
837 * HTML.pm.src: Support for |<input>| in the "in select" insertion
838 mode and support for the "in select in table" insertion mode (HTML5
839 revision 1342).
840
841 2008-03-09 Wakaba <wakaba@suika.fam.cx>
842
843 * HTML.pm.src: No foster parenting for <script> and <script>
844 in non-tainted <table>s (HTML5 revision 1336).
845
846 2008-03-09 Wakaba <wakaba@suika.fam.cx>
847
848 * HTML.pm.src: Ignore white space characters between <html>
849 and <head> (HTML5 revision 1332).
850
851 2008-03-09 Wakaba <wakaba@suika.fam.cx>
852
853 * HTML.pm.src: Treat <input type=hidden> as if it were a
854 white space (HTML5 revision 1331).
855
856 2008-03-08 Wakaba <wakaba@suika.fam.cx>
857
858 * HTML.pm.src: Ignore U+000A at the beginning of a |listing|
859 element (HTML5 revision 1330).
860
861 2008-03-08 Wakaba <wakaba@suika.fam.cx>
862
863 * HTML.pm.src: <title> is always appended to the current
864 element (HTML5 revision 1328).
865
866 2008-03-08 Wakaba <wakaba@suika.fam.cx>
867
868 * HTML.pm.src: White space in tainted tables are moved
869 into foster parents (HTML5 revision 1326).
870
871 2008-03-08 Wakaba <wakaba@suika.fam.cx>
872
873 * HTML.pm.src: Reduce errors from foster parenting cases (HTML5
874 revision 1321).
875
876 2008-03-08 Wakaba <wakaba@suika.fam.cx>
877
878 * HTML.pm.src: |</h/n/>| case code rearranged to align with
879 the spec (HTML5 revision 1320). Note that we finally complete
880 all of HTML5 revision 1320 changes.
881
882 2008-03-08 Wakaba <wakaba@suika.fam.cx>
883
884 * HTML.pm.src: |</form>| now works similar to |</div>| for unclosed
885 tags (HTML5 revision 1320).
886
887 2008-03-08 Wakaba <wakaba@suika.fam.cx>
888
889 * HTML.pm.src: |</p>| case rearranged with no actual change in fact.
890
891 2008-03-08 Wakaba <wakaba@suika.fam.cx>
892
893 * HTML.pm.src: A "generate implied end tags" code (t409.1)
894 could not be reached so that it is now removed (HTML5 revision
895 1320).
896
897 2008-03-08 Wakaba <wakaba@suika.fam.cx>
898
899 * HTML.pm.src: Code for the case of |</div>| and so on are revised
900 to align with new spec text (HTML5 revision 1320).
901
902 2008-03-08 Wakaba <wakaba@suika.fam.cx>
903
904 * HTML.pm.src: Remove strange |if| condition; however, it should
905 have had no harm in theory.
906
907 2008-03-08 Wakaba <wakaba@suika.fam.cx>
908
909 * HTML.pm.src (_tree_construction_main): '</p>' in body
910 case is split from other end tags for the preparation
911 of implementing HTML revision 1320.
912
913 2008-03-07 Wakaba <wakaba@suika.fam.cx>
914
915 * HTML.pm.src: Simplified "generate implied end tag" (HTML5
916 revision 1320).
917
918 2008-03-07 Wakaba <wakaba@suika.fam.cx>
919
920 * HTML.pm.src (_tree_construction_main): Merge rules for "h1"
921 and "div" (HTML5 revision 1318). Add comments to where
922 |form| pointer association codes should be inserted (HTML5 revision
923 1319).
924
925 2008-03-06 Wakaba <wakaba@suika.fam.cx>
926
927 * HTML.pm.src: <html> treatement refined (HTML5 revision 1314).
928
929 2008-03-05 Wakaba <wakaba@suika.fam.cx>
930
931 * HTML.pm.src: Since the case t268 should never be reached (no
932 other token type, there are), it is replaced by a |die| statement.
933
934 2008-03-05 Wakaba <wakaba@suika.fam.cx>
935
936 * HTML.pm.src: Typo fixed.
937
938 2008-03-04 Wakaba <wakaba@suika.fam.cx>
939
940 * HTML.pm.src (_tree_construction_initial): Some limited quirks
941 doctypes were not uppercased for comparison.
942
943 2008-03-03 Wakaba <wakaba@suika.fam.cx>
944
945 * HTML.pm.src (tree construction and set_inner_html): Checkpoints
946 are added.
947
948 2008-03-03 Wakaba <wakaba@suika.fam.cx>
949
950 * HTML.pm.src (_tokenize_attempt_to_consume_an_entity): Checkpoints
951 are set. Cases that are unlikely reached are noted as so.
952
953 2008-03-03 Wakaba <wakaba@suika.fam.cx>
954
955 * HTML.pm.src: Checkpoints for debugging are added.
956
957 * mkhtmlparser.pl: Support for |!!!cp| syntax.
958
959 2008-03-03 Wakaba <wakaba@suika.fam.cx>
960
961 * HTML.pm.src, mkhtmlparser.pl: s/_input_character/_char/g
962 for simplicity.
963
964 2008-03-03 Wakaba <wakaba@suika.fam.cx>
965
966 * HTML.pm.src: Flag name changed: s/correct/force-quirks/g (HTML5
967 revision 1307).
968
969 2008-03-03 Wakaba <wakaba@suika.fam.cx>
970
971 * HTML.pm.src (_get_next_token): Where the /incorrect/ flag
972 is set are changed (HTML5 revision 1305).
973
974 2008-03-02 Wakaba <wakaba@suika.fam.cx>
975
976 * HTML.pm.src: Raise a parse error for |<span ===>| (HTML5 revision
977 1292). Entities are not parsed in comment-like part in RCDATA
978 elements (HTML5 revision 1294). Allow bare & at the end
979 of attribute value literals (HTML5 revision 1296). More
980 quirks mode doctypes (HTML5 revision 1302). Requires spaces
981 between attributes and ban attribute names or unquoted
982 attribute values containing single or double quotes (HTML5
983 revision 1303).
984
985 2008-03-02 Wakaba <wakaba@suika.fam.cx>
986
987 * ContentChecker.pm: Typo fixed. Don't raise "character encoding"
988 and related errors unless it is an HTML document (though the spec
989 is unclear on whether it is applied to XHTML document).
990
991 * HTML.pm (%HTMLAttrStatus): WF2 repetition model attributes
992 are added.
993
994 2008-03-02 Wakaba <wakaba@suika.fam.cx>
995
996 * HTML.pm.src: s/local_name/manakai_local_name/g.
997
998 2008-03-01 Wakaba <wakaba@suika.fam.cx>
999
1000 * _NamedEntityList.pm: Updated (HTML5 revision 1286).
1001
1002 * HTML.pm.src: |charset| in |content| attribute is
1003 case-insensitive (HTML5 revision 1270).
1004
1005 2008-02-26 Wakaba <wakaba@suika.fam.cx>
1006
1007 * ContentChecker.pm: New status constants are added.
1008 ($ElementDefault): |status| added.
1009 (check_element): Err for non-standard or deprecated elements.
1010 (_attr_status_info): For non-standard or deprecated attributes.
1011
1012 2008-02-24 Wakaba <wakaba@suika.fam.cx>
1013
1014 * ContentChecker.pm (_attr_status_info): New internal method.
1015
1016 2008-02-24 Wakaba <wakaba@suika.fam.cx>
1017
1018 * ContentChecker.pm (check_element): Element standardized
1019 status information is now dispatched.
1020
1021 2008-02-24 Wakaba <wakaba@suika.fam.cx>
1022
1023 * ContentChecker.pm (check_element): Fix |del|-and-significant
1024 problem by adding some more arguments.
1025
1026 2008-02-24 Wakaba <wakaba@suika.fam.cx>
1027
1028 * ContentChecker.pm (check_element): Use context of
1029 container-for-the-purpose-of-content-model element (not transparent
1030 element) for |check_child_element| calling and significant
1031 text flag marking. This reintroduces |<del>|-and-significant
1032 problem again.
1033
1034 2008-02-24 Wakaba <wakaba@suika.fam.cx>
1035
1036 * ContentChecker.pm (check_element): Make semi-transparent
1037 elements ignored for the purpose of phase changes in
1038 content model checking.
1039
1040 2008-02-23 Wakaba <wakaba@suika.fam.cx>
1041
1042 * ContentChecker.pm (check_element): In-element state
1043 was not properly managed for transparent cases.
1044
1045 2008-02-23 Wakaba <wakaba@suika.fam.cx>
1046
1047 * ContentChecker.pm (check_element): Support for |video|
1048 and |audio| as semi-transparent elements.
1049
1050 2008-02-23 Wakaba <wakaba@suika.fam.cx>
1051
1052 * ContentChecker.pm ($HTMLSemiTransparentElements): New.
1053 (check_element): s/minuses/minus_elements/, s/pluses/plus_elements/.
1054 Support for |html:object| as a semi-transparent element.
1055
1056 2008-02-23 Wakaba <wakaba@suika.fam.cx>
1057
1058 * ContentChecker.pm (check_element): The way to traverse
1059 the tree is entirely revised to make it easier to track
1060 the state of ancestors/descendants. As a result of this
1061 revision (which rewrites almost all of Whatpm::ContentChecker::HTML),
1062 support for content model checking for HTML elements |figure|,
1063 |object|, |video|, and |audio| and checking for XML elements (and
1064 some XMLNS checkings) are dropped for now. They will be
1065 reimplemented in due cource.
1066
1067 2008-02-17 Wakaba <wakaba@suika.fam.cx>
1068
1069 * HTML.pm.src: |>| in public or system literal closes the DOCTYPE
1070 token (HTML5 revision 1225).
1071
1072 2008-02-17 Wakaba <wakaba@suika.fam.cx>
1073
1074 * ContentChecker.pm ({unsupported_level}): New value.
1075
1076 * HTML.pm.src: Save whether |meta| |content| attribute
1077 contains character references or not.
1078
1079 2008-02-17 Wakaba <wakaba@suika.fam.cx>
1080
1081 * ContentChecker.pm (_get_children): (Incompleted) attempt to
1082 imlement significant content checking for contents
1083 with |del| elements.
1084
1085 2008-02-17 Wakaba <wakaba@suika.fam.cx>
1086
1087 * ContenteChecker.pm ($HTMLTransparentElements): More
1088 elements are added.
1089 (_get_children): HTML |object| elements are now semi-transparent.
1090
1091 * NanoDOM.pm (manakai_html, manakai_head): New methods.
1092
1093 2008-02-16 Wakaba <wakaba@suika.fam.cx>
1094
1095 * CacheManifest.pm: HTML5 revision 1211 implemented.
1096
1097 * CacheManifest.pod: Updated.
1098
1099 2008-02-10 Wakaba <wakaba@suika.fam.cx>
1100
1101 * ContentChecker.pm (check_document, check_element): Support
1102 for second argument ($onsubdoc).
1103 (_get_css_parser): Removed (now it is part of WDCC).
1104
1105 2008-02-09 Wakaba <wakaba@suika.fam.cx>
1106
1107 * ContentChecker.pm (_get_css_parser): New.
1108
1109 2007-11-25 Wakaba <wakaba@suika.fam.cx>
1110
1111 * ContentChecker.pm ($AnyChecker): Old way to add child elements
1112 for checking had been used.
1113
1114 2007-11-25 Wakaba <wakaba@suika.fam.cx>
1115
1116 * ContentChecker.pm (check_element): New todo item type |descendant|.
1117
1118 2007-11-23 Wakaba <wakaba@suika.fam.cx>
1119
1120 * IMTChecker.pm: Revised to raise errors and warnings as (poorly)
1121 specced in RFC 2046 and RFC 4288.
1122 (application/atom+xml): Definition added.
1123
1124 2007-11-23 Wakaba <wakaba@suika.fam.cx>
1125
1126 * URIChecker.pm: Make RFC 3986 should-level errors
1127 warnings (rather than SHOULD-level errors).
1128
1129 2007-11-23 Wakaba <wakaba@suika.fam.cx>
1130
1131 * NanoDOM.pm (get_user_data, set_user_data): New methods.
1132
1133 * HTML.pm.src: A flag for character references in attribute
1134 values are added. Set |manakai_has_reference| user data
1135 to |charset| attribute.
1136
1137 2007-11-23 Wakaba <wakaba@suika.fam.cx>
1138
1139 * NanoDOM.pm (input_encoding, manakai_charset, manakai_has_bom): New
1140 attributes.
1141
1142 * ContentChecker.pm (check_document): Warn if charset requirements
1143 cannot be tested.
1144
1145 2007-11-19 Wakaba <wakaba@suika.fam.cx>
1146
1147 * HTML.pm.src (parse_byte_string): Detect charset
1148 by universalchardet if charset parameter is not specified.
1149
1150 * Makefile (Charset-all, Charset-clean): New rules.
1151
1152 2007-11-18 Wakaba <wakaba@suika.fam.cx>
1153
1154 * ContentChecker.pm (check_document): Check the existence
1155 of character encoding declaration and selection of encoding
1156 for HTML document.
1157
1158 2007-11-18 Wakaba <wakaba@suika.fam.cx>
1159
1160 * ContentType.pm (get_sniffed_type): Return also the official
1161 type in list context.
1162
1163 2007-11-18 Wakaba <wakaba@suika.fam.cx>
1164
1165 * ContentType.pm: Sniffing with leading white space
1166 ignoring (HTML5 revisions 1013 and 1016).
1167
1168 2007-11-18 Wakaba <wakaba@suika.fam.cx>
1169
1170 * ContentType.pm: HTML5 revision 1013 changes, except for leading
1171 white spaces, are implemented.
1172
1173 2007-11-11 Wakaba <wakaba@suika.fam.cx>
1174
1175 * HTML.pm.src (parse_char_string): Set |inner_encoding|
1176 attribute if possible.
1177
1178 2007-11-11 Wakaba <wakaba@suika.fam.cx>
1179
1180 * HTML.pm.src (parse_byte_string): New method.
1181 (parse_char_string): New alias for |parse_string|.
1182 (main phase): Invoking "change the encoding" algorithm if desired.
1183
1184 * HTML.pod: Updated.
1185
1186 2007-11-11 Wakaba <wakaba@suika.fam.cx>
1187
1188 * HTML.pod (get_inner_html): Removed.
1189
1190 * Makefile (HTML-all, HTML-clean): New.
1191
1192 2007-11-11 Wakaba <wakaba@suika.fam.cx>
1193
1194 * HTML.pm.src (get_inner_html): Removed (moved to HTML/Serializer.pm).
1195
1196 2007-11-08 Wakaba <wakaba@suika.fam.cx>
1197
1198 * mklinktypelist.pl: s/noreferer/noreferrer/ (HTML5 revision 1132).
1199
1200 2007-11-04 Wakaba <wakaba@suika.fam.cx>
1201
1202 * Makefile: |CacheManifest.html| is added.
1203
1204 * CacheManifest.pod: New file.
1205
1206 2007-11-04 Wakaba <wakaba@suika.fam.cx>
1207
1208 * CacheManifest.pm: New module.
1209
1210 2007-11-04 Wakaba <wakaba@suika.fam.cx>
1211
1212 * HTML.pm.src: Support for application cache selection algorithm
1213 callback.
1214
1215 2007-11-04 Wakaba <wakaba@suika.fam.cx>
1216
1217 * mklinktypelist.pl: Support for rel=noreferer (HTML5 revision 1118).
1218
1219 2007-10-17 Wakaba <wakaba@suika.fam.cx>
1220
1221 * Makefile (clean): New rule.
1222
1223 * NanoDOM.pm (public_id, system_id): New attributes.
1224
1225 2007-10-17 Wakaba <wakaba@suika.fam.cx>
1226
1227 * Makefile (CSS-all, CSS-clean, clean): New rules.
1228
1229 2007-10-14 Wakaba <wakaba@suika.fam.cx>
1230
1231 * ContentChecker.pm (check_document): Support for
1232 new |is_xml_root| flag.
1233 (check_element): Support for new |pluses| state.
1234 (_add_pluses): New method.
1235 (_remove_minuses): Support for new |minus| item.
1236
1237 2007-09-24 Wakaba <wakaba@suika.fam.cx>
1238
1239 * ContentChecker.pm: Raise specific error for invalid
1240 root element.
1241
1242 2007-09-24 Wakaba <wakaba@suika.fam.cx>
1243
1244 * ContentChecker.pm: Set level values for later uses.
1245
1246 2007-09-09 Wakaba <wakaba@suika.fam.cx>
1247
1248 * ContentChecker.pm: Support for language tag validation.
1249
1250 2007-09-09 Wakaba <wakaba@suika.fam.cx>
1251
1252 * LangTag.pm (check_rfc3066_language_tag): New method.
1253
1254 2007-09-09 Wakaba <wakaba@suika.fam.cx>
1255
1256 * LangTag.pm: New module.
1257
1258 2007-09-04 Wakaba <wakaba@suika.fam.cx>
1259
1260 * HTML.pm.src: Some error types were wrong.
1261
1262 2007-08-17 Wakaba <wakaba@suika.fam.cx>
1263
1264 * CSS/: New directory.
1265
1266 2007-08-17 Wakaba <wakaba@suika.fam.cx>
1267
1268 * ContentChecker.pm (_check_get_children): Support
1269 for |noscript| in |head|.
1270
1271 2007-08-12 Wakaba <wakaba@suika.fam.cx>
1272
1273 * URI/: New directory.
1274
1275 2007-08-11 Wakaba <wakaba@suika.fam.cx>
1276
1277 * HTML.pm.src: Tokenizer's states are now represented in
1278 number.
1279
1280 2007-08-11 Wakaba <wakaba@suika.fam.cx>
1281
1282 * HTML.pm.src: |or|s for insertion modes are replaced
1283 by |&|s.
1284
1285 2007-08-11 Wakaba <wakaba@suika.fam.cx>
1286
1287 * HTML.pm.src: Token types are now represented in number.
1288
1289 2007-08-11 Wakaba <wakaba@suika.fam.cx>
1290
1291 * ContentType.pm (SEE ALSO): Updated.
1292
1293 * HTML.pm.src: Insertion modes are now represented in number.
1294
1295 2007-08-11 Wakaba <wakaba@suika.fam.cx>
1296
1297 * ContentType.pm: Sniffing for bitmap images (HTML5 revision
1298 999) is implemented.
1299
1300 2007-08-08 Wakaba <wakaba@suika.fam.cx>
1301
1302 * ContentType.pm: Sniffing for |<script| (HTML5 revision
1303 983) is implemented.
1304
1305 2007-08-06 Wakaba <wakaba@suika.fam.cx>
1306
1307 * ContentChecker.pod: New documentation.
1308
1309 * Makefile: A rule for |ContentChecker.html| is added.
1310
1311 * ContentChecker.pm: A pod "LICENSE" section is added.
1312
1313 * NanoDOM.pm ($VERSION): New variable.
1314
1315 2007-08-05 Wakaba <wakaba@suika.fam.cx>
1316
1317 * H2H.pm: |b|, |i|, and |sub| are added to the
1318 list of allowed HTML elements.
1319
1320 2007-08-05 Wakaba <wakaba@suika.fam.cx>
1321
1322 * H2H.pm: |samp| is added to the list of allowed
1323 HTML elements.
1324
1325 * URIChecker.pm (check_iri): New.
1326 (check_iri_reference): Error type for IRI reference
1327 syntax error is changed.
1328
1329 2007-08-04 Wakaba <wakaba@suika.fam.cx>
1330
1331 * ContentChecker.pm: Reference to the |Whatpm::ContentChecker::Atom|
1332 is added.
1333 (check_document): Load appropriate module before validation.
1334
1335 2007-08-04 Wakaba <wakaba@suika.fam.cx>
1336
1337 * ContentChecker/: New directory.
1338
1339 2007-08-04 Wakaba <wakaba@suika.fam.cx>
1340
1341 * ContentChecker.pm: HTML |time| element is implemented.
1342
1343 * HTMLTable.pm: Comments are updated as HTML5 is revised.
1344
1345 2007-07-21 Wakaba <wakaba@suika.fam.cx>
1346
1347 * ContentChecker.pm (check_document): Return value
1348 even if no document element is found.
1349
1350 2007-07-21 Wakaba <wakaba@suika.fam.cx>
1351
1352 * HTML.pm.src: |$in_body| is no longer a function.
1353
1354 2007-07-21 Wakaba <wakaba@suika.fam.cx>
1355
1356 * HTML.pm.src: The |$in_body| code has been moved down.
1357
1358 2007-07-21 Wakaba <wakaba@suika.fam.cx>
1359
1360 * HTML.pm.src: The "trailing end" insertion mode
1361 is split into "after html body" and "after html frameset"
1362 insertion modes. Their codes are merged with "after body"
1363 and "after frameset" codes. |$previous_insertion_mode|
1364 has been removed. "after frameset" code is
1365 merged with "in frameset" code.
1366
1367 2007-07-21 Wakaba <wakaba@suika.fam.cx>
1368
1369 * HTML.pm.src: The "before head" insertion mode is
1370 merged with the "in head" insertion mode.
1371
1372 2007-07-21 Wakaba <wakaba@suika.fam.cx>
1373
1374 * HTML.pm.src: Cases in "in head" insertion mode are
1375 reorganized.
1376
1377 2007-07-21 Wakaba <wakaba@suika.fam.cx>
1378
1379 * HTML.pm.src: Some cases in "in table" insertion mode
1380 are merged.
1381
1382 2007-07-21 Wakaba <wakaba@suika.fam.cx>
1383
1384 * HTML.pm.src: The "in row" insertion mode is merged
1385 with "in table" insertion mode.
1386
1387 2007-07-21 Wakaba <wakaba@suika.fam.cx>
1388
1389 * HTML.pm.src: The "in table" and "in table body" insertion
1390 modes are merged.
1391
1392 2007-07-21 Wakaba <wakaba@suika.fam.cx>
1393
1394 * HTML.pm.src: There is no "in table head" or "in table foot"
1395 insertion mode!
1396
1397 2007-07-21 Wakaba <wakaba@suika.fam.cx>
1398
1399 * HTML.pm.src: |<noframes>| "in frameset" and "in noframes"
1400 now directly invoke the handler.
1401
1402 2007-07-21 Wakaba <wakaba@suika.fam.cx>
1403
1404 * HTML.pm.src: Codes for "in cell" insertion mode
1405 is merged to the "in body" insertion mode code.
1406
1407 2007-07-21 Wakaba <wakaba@suika.fam.cx>
1408
1409 * HTML.pm.src: Codes for "in body" and "in caption"
1410 insertion modes are merged.
1411
1412 2007-07-21 Wakaba <wakaba@suika.fam.cx>
1413
1414 * HTML.pm.src: Two |!!!next-token|s were missing.
1415
1416 2007-07-21 Wakaba <wakaba@suika.fam.cx>
1417
1418 * HTML.pm.src: Use numeric constant for |{content_mode}|
1419 instead of string constant for |{content_model_flag}|.
1420
1421 2007-07-21 Wakaba <wakaba@suika.fam.cx>
1422
1423 * HTML.pm.src: Add the name of the attribute
1424 to the "duplicate attribute" error.
1425
1426 2007-07-17 Wakaba <wakaba@suika.fam.cx>
1427
1428 * ContentChecker.pm: Return the |class| node list.
1429
1430 2007-07-17 Wakaba <wakaba@suika.fam.cx>
1431
1432 * ContentChecker.pm: Return the |id| node list.
1433
1434 * HTML.pm.src: A typo is fixed.
1435
1436 2007-07-16 Wakaba <wakaba@suika.fam.cx>
1437
1438 * ContentChecker.pm: Drop wrong |level => 'error'| specification
1439 from "in HTML:xml:lang" error. Character position
1440 is now the last part of the error type in the URI error
1441 description. Report "unsupported" status for language
1442 tags, media queries, script codes, and style sheets.
1443
1444 2007-07-16 Wakaba <wakaba@suika.fam.cx>
1445
1446 * ContentChecker.pm: Report error if |xml:lang|
1447 in HTML, |lang| in XML, |xmlns| in XML, and |meta| |charset|
1448 in XML.
1449
1450 * NanoDOM.pm (Attr.owner_document): New attribute.
1451
1452 2007-07-16 Wakaba <wakaba@suika.fam.cx>
1453
1454 * HTML.pm.src: The character immediately following
1455 a bare |hcro| was discarded. Fix handling of
1456 entity references in attribute values.
1457
1458 2007-07-16 Wakaba <wakaba@suika.fam.cx>
1459
1460 * HTML.pm.src (main and trailin end phases): Token
1461 types |DOCTYPE|, |comment|, |end-of-file|, and |<html>| are
1462 factored out. Error types |in html:#DOCTYPE|
1463 and |after html:#DOCTYPE| are merged
1464 into |DOCTYPE in the middle|. |</frameset>|
1465 in fragment parsing mode changed the
1466 insertion mode.
1467
1468 2007-07-16 Wakaba <wakaba@suika.fam.cx>
1469
1470 * HTML.pm.src: |$phase| has been removed; The |trailing end|
1471 phase is now an insertion mode. Treatments for white
1472 space character tokens were incorrect for some
1473 insertion modes. An old |meta| case was not removed.
1474
1475 2007-07-16 Wakaba <wakaba@suika.fam.cx>
1476
1477 * HTML.pm.src: |meta| charset declaration extraction
1478 implemented (but changing the encoding is not yet:-).
1479
1480 2007-07-15 Wakaba <wakaba@suika.fam.cx>
1481
1482 * Charset/: New directory.
1483
1484 2007-07-15 Wakaba <wakaba@suika.fam.cx>
1485
1486 * H2H.pm: New Perl module (created from
1487 manakai's H2H.dis).
1488
1489 2007-07-15 Wakaba <wakaba@suika.fam.cx>
1490
1491 * XMLSerializer.pm: New Perl module (created from
1492 manakai's SimpleLS.dis).
1493
1494 2007-07-07 Wakaba <wakaba@suika.fam.cx>
1495
1496 * HTML.pm.src: |<!---x-->| was not processed correctly.
1497
1498 2007-07-01 Wakaba <wakaba@suika.fam.cx>
1499
1500 * HTML.pm.src: Report correct error message
1501 for |<body></div></body>|.
1502
1503 2007-07-01 Wakaba <wakaba@suika.fam.cx>
1504
1505 * HTMLTable.pm: An error description was incorrect.
1506
1507 2007-06-30 Wakaba <wakaba@suika.fam.cx>
1508
1509 * ContentChecker.pm: Return |{term}| list.
1510
1511 2007-06-30 Wakaba <wakaba@suika.fam.cx>
1512
1513 * HTML.pm.src: HTML5 revisions 961-966 (</p>, </br>,
1514 nested <nobr>, implied </tbody>, </tfoot>, and </thead>,
1515 and <title> outside of head).
1516
1517 2007-06-30 Wakaba <wakaba@suika.fam.cx>
1518
1519 * IMTChecker.pm: Report warning for unregistered
1520 and private types/subtypes.
1521
1522 * ContentChecker.pm, HTML.pm.src, IMTChecker.pm,
1523 URIChecker.pm, HTMLTable.pm: Error messages are now
1524 consistent; they are all listed in
1525 <http://suika.fam.cx/gate/2005/sw/Whatpm%20Error%20Types>.
1526
1527 2007-06-25 Wakaba <wakaba@suika.fam.cx>
1528
1529 * ContentChecker.pm: |<img ismap>| not in |<a></a>|
1530 is now erred. |<datalist>| is implemented.
1531 Attribute checker for |<command>| and |<menu>| are
1532 added. Support for |contextmenu| global attribute
1533 is added.
1534
1535 2007-06-25 Wakaba <wakaba@suika.fam.cx>
1536
1537 * HTML.pm.src (_reset_insertion_mode): Interpretation
1538 of Step 3 has been changed.
1539
1540 2007-06-25 Wakaba <wakaba@suika.fam.cx>
1541
1542 * HTML.pm.src: Late |<html>| parse error is implemented.
1543
1544 2007-06-24 Wakaba <wakaba@suika.fam.cx>
1545
1546 * URIChecker.pm (check_iri_reference): A |decode| method name was
1547 incorrect.
1548
1549 * ContentChecker.pm: Support for the |footer| element.
1550 Check URI syntax for space-separated URI attributes.
1551 Support for the |tabindex| attribute. Support
1552 for |datetime| attribute.
1553
1554 2007-06-24 Wakaba <wakaba@suika.fam.cx>
1555
1556 * HTML.pm.src: HTML5 revision 1.144 (&#x0D;) and 1.145 (invalid
1557 character references). HTML5 revision 1.146 (white space
1558 characters before root start tag). HTML5 revision
1559 1.148 (named character references in attribute values).
1560 HTML5 revision 1.152 (<plaintext>.innerHTML get).
1561
1562 2007-06-24 Wakaba <wakaba@suika.fam.cx>
1563
1564 * HTML.pm.src: HTML5 revisions 1.142 and 1.143 (<noscript>
1565 in <head>).
1566
1567 2007-06-24 Wakaba <wakaba@suika.fam.cx>
1568
1569 * HTML.pm.src: HTML5 revision 935 (<base>, <link>, <meta>
1570 in body).
1571
1572 * ContentChecker.pm: HTML5 revision 938 (scoped="").
1573
1574 2007-06-24 Wakaba <wakaba@suika.fam.cx>
1575
1576 * HTML.pm.src: HTML5 revision 923 (matching end tag in CDATA
1577 or RCDATA in fragment parsing mode). HTML5 revision
1578 924 (<!--> and <!--->). HTML5 revision 926 (hn in hn).
1579
1580 2007-06-23 Wakaba <wakaba@suika.fam.cx>
1581
1582 * HTML.pm.src (get_inner_html): HTML5 revision 922 (inner_html
1583 for <pre> and <textarea>).
1584
1585 2007-06-23 Wakaba <wakaba@suika.fam.cx>
1586
1587 * HTML.pm.src: HTML5 revision 920 (<isindex>).
1588
1589 2007-06-23 Wakaba <wakaba@suika.fam.cx>
1590
1591 * HTML.pm.src: HTML5 revision 918 (</head>, </body>, </html>).
1592
1593 2007-06-23 Wakaba <wakaba@suika.fam.cx>
1594
1595 * HTML.pm.src: HTML5 revision 916 (</body>).
1596 HTML5 revision 917 (conforming bare &).
1597
1598 2007-06-23 Wakaba <wakaba@suika.fam.cx>
1599
1600 * NanoDOM.pm (manakai_is_html): Setting to false did not work.
1601
1602 * HTML.pm.src: HTML5 revision 914 (</ in CDATA, RCDATA).
1603 HTML5 revision 915 (<nobr>).
1604
1605 2007-06-23 Wakaba <wakaba@suika.fam.cx>
1606
1607 * HTML.pm.src: HTML5 revisions 908, 909, 912, and 913 (quirks mode).
1608
1609 * NanoDOM.pm (manakai_is_html, manakai_compat_mode, compat_mode):
1610 New attributes.
1611
1612 2007-06-23 Wakaba <wakaba@suika.fam.cx>
1613
1614 * HTML.pm.src: HTML5 revisions 900, 901, 902, and 911 (<
1615 in tags).
1616
1617 2007-06-23 Wakaba <wakaba@suika.fam.cx>
1618
1619 * .cvsignore: |Entities.html| is added.
1620
1621 * HTML.pm.src: |$entity_char| is removed and
1622 requires |Whatpm::_NamedEntityList| instead.
1623 HTML5 revision 898 (refc), except that lack of refc
1624 is parse error.
1625
1626 * mkentitylist.pl: New script.
1627
1628 * Makefile (all): |_NamedEntityList.pm| is added.
1629 (_NamedEntityList.pm, Entities.html): New rules.
1630
1631 2007-06-23 Wakaba <wakaba@suika.fam.cx>
1632
1633 * HTML.pm.src: Parse errors immediately after U+000D
1634 were ignored and U+000D immediately following another
1635 U+000D was not converted to U+000A.
1636
1637 2007-06-23 Wakaba <wakaba@suika.fam.cx>
1638
1639 * HTML.pm.src (set_inner_html): HTML5 revision 892 (adopt
1640 nodes before appended). Parser was not ready for NULL
1641 parse error and escape flag.
1642
1643 * NanoDOM.pm (adopt_node): New.
1644
1645 2007-06-23 Wakaba <wakaba@suika.fam.cx>
1646
1647 * HTML.pm.src: HTML5 revision 886 (insane comment in
1648 CDATA and RCDATA). Note that current implementation
1649 is simply repeating what the spec says and it is maybe not
1650 a best way to do it.
1651
1652 2007-06-23 Wakaba <wakaba@suika.fam.cx>
1653
1654 * HTML.pm.src: HTML5 revision 884 (</form> don't close
1655 the form element if a descendant element without implied end tag has
1656 still been open).
1657
1658 2007-06-23 Wakaba <wakaba@suika.fam.cx>
1659
1660 * ContentChecker.pm: HTML5 revision 881 (Make |id| attribute with
1661 space characters non-conforming).
1662
1663 2007-06-23 Wakaba <wakaba@suika.fam.cx>
1664
1665 * HTML.pm.src: An error message was incorrect.
1666 HTML5 revision 869 (C1 character references).
1667
1668 2007-06-23 Wakaba <wakaba@suika.fam.cx>
1669
1670 * HTML.pm.src: HTML5 revision 867 (a LF at the beginning of
1671 a |textarea| is removed).
1672
1673 2007-06-05 Wakaba <wakaba@suika.fam.cx>
1674
1675 * NanoDOM.pm (get_attribute_node_ns): New method.
1676
1677 * ContentChecker.pm: |script| |async| and |defer|
1678 no longer require |src|. |async| MUST NOT be
1679 specified if |defer|. (HTML5 revision 858).
1680
1681 2007-05-30 Wakaba <wakaba@suika.fam.cx>
1682
1683 * HTML.pm.src: |<form><form>| went to inifinite loop.
1684
1685 2007-05-27 Wakaba <wakaba@suika.fam.cx>
1686
1687 * ContentChecker.pm (html): Set |is_root| (allowed
1688 as a document element) flag on.
1689 (new): Removed.
1690 (check_document): New method.
1691
1692 2007-05-27 Wakaba <wakaba@suika.fam.cx>
1693
1694 * ContentChecker.pm (thead, tfoot): Checker specifications
1695 were incorrect.
1696
1697 2007-05-27 Wakaba <wakaba@suika.fam.cx>
1698
1699 * ContentChecker.pm ($HTMLURIAttrChecker): Include
1700 error position in the |type| option of the error.
1701
1702 * HTMLTable.pm (form_table): The |$onerror| parameter
1703 is now optional. Some bugs are fixed.
1704
1705 2007-05-27 Wakaba <wakaba@suika.fam.cx>
1706
1707 * HTMLTable.pm: New module.
1708
1709 * ContentChecker.pm (table): Invoke table model error checker.
1710
1711 * NanoDOM.pm (first_child, get_attribute_ns): New.
1712
1713 2007-05-26 Wakaba <wakaba@suika.fam.cx>
1714
1715 * ContentChecker.pm ($HTMLLinkTypesAttrChecker): New checker.
1716 (link/@rel, a/@rel, area/@rel): Use new checker.
1717
1718 * Makefile (_LinkTypeList.pm, RelExtensions.html): New rules.
1719
1720 * _LinkTypeList.pm: New file.
1721
1722 * mklinktypelist.pl: New file.
1723
1724 * .cvsignore: |RelExtensions.html| added.
1725
1726 * NanoDOM.pm (child_nodes): Returns an empty array
1727 for non-child-containing node types.
1728 (text_content): New attribute.
1729
1730 2007-05-26 Wakaba <wakaba@suika.fam.cx>
1731
1732 * IMTChecker.pm: New module.
1733
1734 * ContentChecker.pm ($HTMLIMTAttrChecker): Call IMTChecker
1735 to test parameter value validity.
1736
1737 * HTML.pm.src ($style_start_tag): Attributes were
1738 discarded.
1739
1740 2007-05-25 Wakaba <wakaba@suika.fam.cx>
1741
1742 * ContentChecker.pm ($HTMLURIAttrChecker): Implemented.
1743
1744 2007-05-25 Wakaba <wakaba@suika.fam.cx>
1745
1746 * URIChecker.pm: All recommendations from RFC 3986
1747 and RFC 3987 are listed (not all testable items are checked yet).
1748
1749 2007-05-25 Wakaba <wakaba@suika.fam.cx>
1750
1751 * URIChecker.pm: New module.
1752
1753 2007-05-20 Wakaba <wakaba@suika.fam.cx>
1754
1755 * ContentChecker.pm: Now most attributes are implemented
1756 or associated to some placeholder.
1757 ($ElementDefault): Warn unknown attributes
1758 for unknown elements as "attribute not supported".
1759 ($HTMLLanguageTagAttrChecker, $HTMLMQAttrChecker): New placeholders.
1760 ($HTMLUsemapAttrChecker, $HTMLTargetAttrChecker): New checkers.
1761 (|a| attribute checker): Reimplemented.
1762
1763 2007-05-20 Wakaba <wakaba@suika.fam.cx>
1764
1765 * ContentChecker.pm ($HTMLEventHandlerAttrChecker): New placeholder.
1766 ($HTMLAttrChecker): Event handler content attributes
1767 are added.
1768 (link, embed): Required attribute is now checked.
1769 (embed): Unknown local attributes are no longer warned.
1770
1771 2007-05-20 Wakaba <wakaba@suika.fam.cx>
1772
1773 * ContentChecker.pm ($HTMLSpaceURIsAttrChecker): New placeholder.
1774 ($HTMLIMTAttrChecker): New checker.
1775 (link@rel, link@href, link@type, style@type,
1776 a@href, a@ping, a@ping, a@type, embed@src, embed@type,
1777 object@data, object@type, source@src, source@type, area@alt,
1778 area@shape, area@coords, area@href,
1779 area@ping, area@rel, area@type, script@src,
1780 script@defer, script@async, script@type): Checkers added.
1781
1782 2007-05-20 Wakaba <wakaba@suika.fam.cx>
1783
1784 * ContentChecker.pm: Descendant checking was incorrect.
1785
1786 2007-05-19 Wakaba <wakaba@suika.fam.cx>
1787
1788 * ContentChecker.pm: Support |xml:*| and |xmlns:*|
1789 attributes. Report an error if |Element.prefix|
1790 is |xmlns|.
1791
1792 * NanoDOM.pm (prefix): New attribute.
1793
1794 2007-05-19 Wakaba <wakaba@suika.fam.cx>
1795
1796 * HTML.pm.src: In |main| phase, |in body| insertion
1797 mode, action for |<iframe>| was missing.
1798
1799 2007-05-19 Wakaba <wakaba@suika.fam.cx>
1800
1801 * ContentChecker.pm: Support for many of HTML5 elements.
1802 ($GetHTMLNonNegativeIntegerAttrChecker): New.
1803
1804 2007-05-19 Wakaba <wakaba@suika.fam.cx>
1805
1806 * ContentChecker.pm: Support for most elements up to |progress|.
1807 ($HTMLURIAttrChecker): Placeholder.
1808 ($HTMLIntegerAttrChecker, $GetHTMLFloatingPointNumberAttrChecker):
1809 New.
1810
1811 2007-05-19 Wakaba <wakaba@suika.fam.cx>
1812
1813 * ContentChecker.pm: Attribute checkers
1814 for global attributes, |html|, |base|, |style|, and |meta|.
1815
1816 * NanoDOM.pm (insert_before): Weaken reference
1817 to the parent node.
1818 (Attr::new): Set |owner_element| attribute.
1819 (namespace_uri, manakai_local_name): New attribute implementations.
1820 (owner_element): New attribute.
1821
1822 2007-05-19 Wakaba <wakaba@suika.fam.cx>
1823
1824 * ContentChecker.pm ($AttrChecker, $HTMLAttrChecker,
1825 $AnyChecker->{attr_checker}, $HTMLAttrsChecker,
1826 $Element->{$HTML_NS}->{''}): New.
1827 (check_element): Invoke attrs_checker for each element.
1828
1829 2007-05-13 Wakaba <wakaba@suika.fam.cx>
1830
1831 * ContentChecker.pm: Don't use |manakai_element_type_match|.
1832
1833 2007-05-13 Wakaba <wakaba@suika.fam.cx>
1834
1835 * ContentChecker.pm: Use hashs rather than lists for
1836 element type testings.
1837
1838 2007-05-13 Wakaba <wakaba@suika.fam.cx>
1839
1840 * ContentChecker.pm: Don't generate duplicate
1841 error when an element type is put in the "minus" list
1842 and the element type is not allowed explicitly in the particular
1843 element content model.
1844 (html:a checker): New checker.
1845 (html:details, html:datagrid): New checkers.
1846 (html:legend): New checker.
1847
1848 2007-05-13 Wakaba <wakaba@suika.fam.cx>
1849
1850 * ContentChecker.pm (html:li checker): Implemented.
1851
1852 2007-05-13 Wakaba <wakaba@suika.fam.cx>
1853
1854 * ContentChecker.pm ($HTMLInlineOrStriclyInlineChecker): New
1855 checker.
1856 (html:dd checker): New checker.
1857 (html:q, html:em, html:strong, html:small,
1858 html:m, html:dfn, html:code, html:samp, html:span): New checkers.
1859
1860 2007-05-13 Wakaba <wakaba@suika.fam.cx>
1861
1862 * ContentChecker.pm ($AnyChecker): Renamed
1863 from |$ElementDefault->{checker}|.
1864 ($ElementDefault->{checker}): Throw an error that
1865 the element type is not supported by the checker.
1866 ($HTMLMetadataElement): |html:base| was missing.
1867 ($HTMLEmptyChecker): Don't throw an error
1868 for inter-element whitespace nodes.
1869 (html:html checker): Errors were not
1870 thrown even if |html:head| and/or |html:body|
1871 children were missing.
1872 (html:head checker): An error was not
1873 thrown if <meta charset> appered after other
1874 elements.
1875
1876 2007-05-05 Wakaba <wakaba@suika.fam.cx>
1877
1878 * ContentChecker.pm: |footer|, |video|, |audio|, |script|,
1879 and |noscript| elements are implemented.
1880 (new): New method.
1881
1882 2007-05-04 Wakaba <wakaba@suika.fam.cx>
1883
1884 * ContentChecker.pm: New module.
1885
1886 2007-05-04 Wakaba <wakaba@suika.fam.cx>
1887
1888 * NanoDOM.pm (manakai_parent_element,
1889 document_element, manakai_local_name,
1890 manakai_element_type_match): New method.
1891
1892 2007-05-03 Wakaba <wakaba@suika.fam.cx>
1893
1894 * HTML.pm.src: Replace decimal and hexadecimal numeric
1895 entities in C1 range using Windows-1252 mapping. Bare LF
1896 did not count as new line for error reporting.
1897
1898 2007-05-02 Wakaba <wakaba@suika.fam.cx>
1899
1900 * NanoDOM.pm (DOMImplementation): New class.
1901 (append_child): Weaken the |parent_node| reference.
1902 (create_element_ns, Element new): Set the |owner_document|
1903 reference.
1904 (implementation): New attribute.
1905 (owner_document, local_name, namespace_uri): New attributes.
1906
1907 * HTML.pm.src (parse_string): Line and column numbers
1908 are now provided to error handler.
1909 (!!!parse-error): Short descriptions are added.
1910 (_construct_tree): Split into three methods; support
1911 for innerHTML mode.
1912 (set_inner_html): New method.
1913
1914 2007-05-01 Wakaba <wakaba@suika.fam.cx>
1915
1916 * NanoDOM.html: Documentation is added.
1917
1918 * HTML.pod, ContentType.html: Documentation is revised.
1919
1920 * .cvsignore: Pod2html temporary files are added.
1921
1922 * Makefile: Make |NanoDOM.html|.
1923
1924 2007-05-01 Wakaba <wakaba@suika.fam.cx>
1925
1926 * HTML.pm.src (parse_string): New method.
1927 (get_inner_html): Renamed from |inner_html|.
1928
1929 * Makefile: A rule for |HTML.html| is added.
1930
1931 * HTML.pod: New documentation.
1932
1933 2007-05-01 Wakaba <wakaba@suika.fam.cx>
1934
1935 * NanoDOM.pm (last_child, previous_sibling): New attributes.
1936 (clone_node): Attribute nodes were not completely copied.
1937
1938 * HTML.pm.src: Many bugs are fixed.
1939
1940 2007-04-30 Wakaba <wakaba@suika.fam.cx>
1941
1942 * HTML.pm.src: Some typos are fixed.
1943
1944 2007-04-30 Wakaba <wakaba@suika.fam.cx>
1945
1946 * mkhtmlparser.pl, Makefile: References to the |HTML-consume-entity.src|
1947 are removed.
1948
1949 * HTML.pm.src: Tokenizer's handling on named entities are rewritten.
1950
1951 * HTML-consume-entity.src: Removed.
1952
1953 2007-04-30 Wakaba <wakaba@suika.fam.cx>
1954
1955 * HTML.pm.src: Tokenizer's handling on hexadecimal
1956 numeric entities are rewritten.
1957
1958 2007-04-30 Wakaba <wakaba@suika.fam.cx>
1959
1960 * HTML.pm.src: Some tokenizer bugs are fixed.
1961
1962 2007-04-30 Wakaba <wakaba@suika.fam.cx>
1963
1964 * HTML.pm.src: The tree construction stage is implemented.
1965
1966 * mkhtmlparser.pl: New macros are added.
1967
1968 2007-04-28 Wakaba <wakaba@suika.fam.cx>
1969
1970 * ContentType.pm: A note on bug in the specification
1971 is removed since it's been now fixed.
1972
1973 * .cvsignore: New file.
1974
1975 2007-04-28 Wakaba <wakaba@suika.fam.cx>
1976
1977 * HTML.pm.src, HTML-consume-entity.src: New files.
1978
1979 * Makefile (HTML.pm): New rule.
1980
1981 * mkhtmlparser.pl: New script.
1982
1983 2007-04-25 Wakaba <wakaba@suika.fam.cx>
1984
1985 * Makefile: New file.
1986
1987 2007-04-24 Wakaba <wakaba@suika.fam.cx>
1988
1989 * ContentType.pm: An error in pod is fixed.
1990
1991 2007-04-24 Wakaba <wakaba@suika.fam.cx>
1992
1993 * ContentType.pm: Documentation is added.
1994
1995 2007-04-24 Wakaba <wakaba@suika.fam.cx>
1996
1997 * ContentType.pm: New Perl module.
1998
1999 * ChangeLog: New file.
2000
2001

admin@suikawiki.org
ViewVC Help
Powered by ViewVC 1.1.24