/[suikacvs]/markup/html/whatpm/Whatpm/ChangeLog
Suika

Contents of /markup/html/whatpm/Whatpm/ChangeLog

Parent Directory Parent Directory | Revision Log Revision Log


Revision 1.350 - (show annotations) (download)
Mon Oct 13 08:22:29 2008 UTC (17 years, 4 months ago) by wakaba
Branch: MAIN
Changes since 1.349: +4 -0 lines
++ whatpm/Whatpm/ChangeLog	13 Oct 2008 08:22:15 -0000
	* HTML.pm.src: Element category constants redefined.

2008-10-13  Wakaba  <wakaba@suika.fam.cx>

1 2008-10-13 Wakaba <wakaba@suika.fam.cx>
2
3 * HTML.pm.src: Element category constants redefined.
4
5 2008-10-13 Wakaba <wakaba@suika.fam.cx>
6
7 * HTML.pm.src: Steps for CDATA/RCDATA elements in tree
8 construction stage synced with the spec (HTML5 revisions 2139 and
9 2302).
10
11 2008-10-07 Wakaba <wakaba@suika.fam.cx>
12
13 * ContentChecker.pm: New error level "html5_fact" added, which
14 should be tentatively used until all of requirements are properly
15 specced as RFC 2119 "MUST" in HTML5.
16
17 2008-10-05 Wakaba <wakaba@suika.fam.cx>
18
19 * ContentChecker.pod: Note on internal flags is added.
20
21 2008-10-05 Wakaba <wakaba@suika.fam.cx>
22
23 * HTML.pm.src: An AAA bug fixed.
24
25 2008-10-04 Wakaba <wakaba@suika.fam.cx>
26
27 * HTML.pm.src: If another node is inserted by the parser, don't
28 reuse existing Text node to append a character (HTML5 revision
29 2124).
30
31 2008-10-04 Wakaba <wakaba@suika.fam.cx>
32
33 * HTML.pm.src: Support for <option> and <optgroup> in body (HTML5
34 revisions 1731 and 2128).
35
36 2008-10-04 Wakaba <wakaba@suika.fam.cx>
37
38 * HTML.pm.src: Make <foreignObject> scoping (HTML5 revision 1837).
39 Support for end tags of camelCase SVG elements were broken. A
40 wrong error type text fixed.
41
42 2008-10-04 Wakaba <wakaba@suika.fam.cx>
43
44 * HTML.pm.src: Drop redundant code (HTML5 revision 1731).
45
46 2008-10-04 Wakaba <wakaba@suika.fam.cx>
47
48 * HTML.pm.src: Support for new definition of |param| and |source|
49 start tag parsing (HTML5 revision 1731).
50
51 2008-10-04 Wakaba <wakaba@suika.fam.cx>
52
53 * HTML.pm.src: <p> steps reimplemented (HTML5 revision 1731).
54
55 2008-10-04 Wakaba <wakaba@suika.fam.cx>
56
57 * HTML.pm.src: <li>, <dt>, and <dd> steps reimplemented (HTML5
58 revisions 1731 and 1831).
59
60 2008-10-04 Wakaba <wakaba@suika.fam.cx>
61
62 * HTML.pm.src: Support for new flow (but not phrasing) elements (HTML5
63 revisions 1731 and 1778). Support for the </sarcasm> end tag (HTML5
64 revision 1731).
65
66 2008-10-04 Wakaba <wakaba@suika.fam.cx>
67
68 * HTML.pm.src: Support for |command| and |eventsource| elements (HTML5
69 revision 1731). End tags of |option| and |optgroup| elements are
70 now optional (HTML5 revision 1731).
71
72 2008-10-04 Wakaba <wakaba@suika.fam.cx>
73
74 * HTML.pm.src: New "special" elements added to the list (HTML5
75 revision 1778). "strile" -> "strike".
76
77 2008-10-02 Wakaba <wakaba@suika.fam.cx>
78
79 * ContentType.pm (get_sniffed_type): Support for the "better"
80 content sniffing (HTML5 revision 1927). In a case the official
81 type was not returned when the method is invoked in the list
82 context.
83
84 2008-09-22 Wakaba <wakaba@suika.fam.cx>
85
86 * HTML.pm.src: Character references for non-space C0 characters,
87 including U+000B VT, DEL character, noncharacter code points, are
88 now converted to the U+FFFD character (cf. HTML5 revision 2138).
89
90 2008-09-21 Wakaba <wakaba@suika.fam.cx>
91
92 * ContentChecker.pm: |form=""| check support added.
93
94 2008-09-21 Wakaba <wakaba@suika.fam.cx>
95
96 * ContentChecker.pm: |contextmenu| validness is now checked using
97 |id| and |id_type| properties, and |menu| property is removed.
98
99 2008-09-21 Wakaba <wakaba@suika.fam.cx>
100
101 * ContentChecker.pm: Prepare for |form| |name| attribute's
102 duplication checking.
103
104 2008-09-21 Wakaba <wakaba@suika.fam.cx>
105
106 * HTML.pm.src (parse_byte_stream): Support (or non-support) for
107 unsupported charset="" parameter value (HTML5 revision 2131).
108
109 2008-09-20 Wakaba <wakaba@suika.fam.cx>
110
111 * HTML.pm.src: Reminding places where U+000B is allowed as a space
112 character is fixed (cf. HTML5 revision 1738).
113
114 * ContentChecker.pm, HTMLTable.pm: U+000B is no longer part of
115 space characters (HTML5 revision 1738).
116
117 2008-09-20 Wakaba <wakaba@suika.fam.cx>
118
119 * HTML.pm.src: The "anything else" case for the "after after body"
120 insertion mode was not updated to swtich to the "in body"
121 insertion mode. U+000B is no longer a space character for the
122 purpose of tree construction phase (HTML5 revision 1738).
123
124 2008-09-20 Wakaba <wakaba@suika.fam.cx>
125
126 * HTML.pm.src: U+000B is no longer a space character (HTML5
127 revision 1738).
128
129 2008-09-20 Wakaba <wakaba@suika.fam.cx>
130
131 * ContentType.pm: 0x0B is no longer a space character (HTML5
132 revision 1738).
133
134 * HTML.pm.src: U+000B is no longer a space character for the
135 algorithm for extracting an encoding from a Content-Type (HTML5
136 revision 1738).
137
138 2008-09-20 Wakaba <wakaba@suika.fam.cx>
139
140 * ContentChecker.pm ($IsInHTMLInteractiveContent): New.
141
142 2008-09-18 Wakaba <wakaba@suika.fam.cx>
143
144 * LangTag.pm: Add checks for remaining requirements from RFC 4646.
145
146 * mklangreg.pl: Sort 'Prefix' values by their length, to ease
147 matching.
148
149 2008-09-18 Wakaba <wakaba@suika.fam.cx>
150
151 * LangTag.pm: Warn for private use language subtags. Error level
152 typos fixed. Support for Suppress-Script field.
153
154 * mklangreg.pl: Support for dumping of nested structure.
155
156 2008-09-18 Wakaba <wakaba@suika.fam.cx>
157
158 * LangTag.pm (check_rfc4646_langtag): Check if a tag is in the
159 recommended case as per RFC 4646.
160
161 2008-09-18 Wakaba <wakaba@suika.fam.cx>
162
163 * LangTag.pm (check_rfc4646_langtag): New method.
164
165 2008-09-18 Wakaba <wakaba@suika.fam.cx>
166
167 * mklangreg.pl: New script.
168
169 * Makefile: Updated for creation of the module for language subtag
170 registry.
171
172 2008-09-16 Wakaba <wakaba@suika.fam.cx>
173
174 * Makefile: WebIDL.html added.
175
176 * WebIDL.pod: New documentation.
177
178 2008-09-16 Wakaba <wakaba@suika.fam.cx>
179
180 * WebIDL.pm: Checker's error types are redefined.
181
182 2008-09-16 Wakaba <wakaba@suika.fam.cx>
183
184 * WebIDL.pm: Parser's error types are redefined. Some forward
185 compatible parsing bugs are fixed. Some unreachable codes are
186 commented out.
187
188 2008-09-16 Wakaba <wakaba@suika.fam.cx>
189
190 * WebIDL.pm: Support for the reminding extended attributes are
191 added. It does not satisfy the definition that a forward
192 interface declaration has an extended attribute. It seems that
193 unless explicitly allowed multiple extended attributes with the
194 same name is not allowed, though it is not explicitly mentioned in
195 the spec.
196
197 2008-09-16 Wakaba <wakaba@suika.fam.cx>
198
199 * WebIDL.pm: Unescapes extended attribute names and extended
200 attribute identifiers. Preserve whether an extended attribute has
201 an argument list of not. Support for extended attributes:
202 Constructor, ExceptionConsts, IndexGetter, IndexSetter,
203 NameGetter, NameSetter, and Null.
204 (has_argument_list): New attribute.
205 (idl_text): Stringifies argument lists, if any, even if it is
206 empty.
207
208 2008-09-15 Wakaba <wakaba@suika.fam.cx>
209
210 * HTML.pm.src: New state |PCDATA_STATE|. Use an empty string for
211 |{s_kwd}| in DATA_STATE as default.
212
213 2008-09-15 Wakaba <wakaba@suika.fam.cx>
214
215 * HTML.pm.src, mkhtmlparser.pl: Replace |{prev_char}|
216 by |{s_kwd}| in DATA_STATE.
217
218 2008-09-15 Wakaba <wakaba@suika.fam.cx>
219
220 * HTML.pm.src: Shorten keys.
221
222 2008-09-15 Wakaba <wakaba@suika.fam.cx>
223
224 * HTML.pm.src: Remove checking for control character, surrogate
225 pair, or noncharacter code points and non-Unicode code
226 points (they should be handled by Whatpm::Charset::UnicodeChecker).
227 (parse_char_stream): Support for the |$get_wrapper| argument and
228 character stream error handlers.
229
230 2008-09-15 Wakaba <wakaba@suika.fam.cx>
231
232 * ContentChecker.pm: Don't call |loda_ns_module|
233 for null-namespace elements/attributes.
234
235 * HTML.pm.src: Fact out $disallowed_control_chars
236 as a hash.
237
238 2008-09-14 Wakaba <wakaba@suika.fam.cx>
239
240 * HTML.pm.src: Regexp typo fixed. |{prev_char}|
241 and |{next_char}| initializations are moved to initialization
242 method. |{read_until}| now supports buffering. Sync |set_inner_html|
243 with |parse_char_stream|.
244
245 2008-09-14 Wakaba <wakaba@suika.fam.cx>
246
247 * HTML.pm.src (parse_char_stream): Make |set_next_char|
248 invoke |manakai_read_until|, not only |read|, where
249 possible, to decrease the number of |read| method calls.
250
251 * mkhtmlparser.pl: Related changes to the aforementioned
252 modification.
253
254 2008-09-14 Wakaba <wakaba@suika.fam.cx>
255
256 * HTML.pm.src: Use |read| instead of |getc|. |set_inner_html|
257 would report character error from now.
258
259 2008-09-14 Wakaba <wakaba@suika.fam.cx>
260
261 * HTML.pm.src: White-space-leaded non-white-space character
262 tokens in "before head insertion mode" was not
263 correctly handled.
264 (set_inner_html): Reimplemented using CharString decodehandle
265 class. Support for $get_wrapper argument. Support
266 for |{read_until}| feature.
267
268 2008-09-14 Wakaba <wakaba@suika.fam.cx>
269
270 * HTML.pm.src: Make a "bare ero" error for unknown
271 entities point the "&" character.
272
273 2008-09-14 Wakaba <wakaba@suika.fam.cx>
274
275 * HTML.pm.src: It turns out that U+FFFD don't have to
276 be added to the list of excluded characters.
277
278 2008-09-14 Wakaba <wakaba@suika.fam.cx>
279
280 * HTML.pm.src ($char_onerror): Have character decoder's |line|
281 and |column| a higher priority than the one set by the
282 tokenizer's input handler.
283 ($self->{read_until}): Exclude U+FFFD (but this might
284 not be necessary, since now we do line/column fixup in
285 the character decode handle).
286
287 2008-09-14 Wakaba <wakaba@suika.fam.cx>
288
289 * HTML.pm.src: Use |{read_until}| where possible.
290
291 2008-09-14 Wakaba <wakaba@suika.fam.cx>
292
293 * HTML.pm.src: Change |{getc_until}| to |{read_until}|
294 and |manakai_getc_until| to |manakai_read_until| to
295 reduce the number of string copies.
296
297 2008-09-14 Wakaba <wakaba@suika.fam.cx>
298
299 * HTML.pm.src (parse_char_string): Use newly created
300 |Whatpm::Charset::DecodeHandle::CharString| instead of Perl's
301 standard feature to |open| a string as a filehandle,
302 since Perl's string filehandle seems not supporting |ungetc|
303 method correctly.
304 (parse_char_stream): Define |{getc_until}| method.
305 (DATA_STATE): Experimental support for |getc_until| feature.
306
307 2008-09-13 Wakaba <wakaba@suika.fam.cx>
308
309 * HTML.pm.src: Check points added to newly added branches.
310
311 2008-09-13 Wakaba <wakaba@suika.fam.cx>
312
313 * HTML.pm.src: Remove |{char}|, which is no longer used.
314 Remove |{entity_in_attr}| and |{last_attribute_value_state}|
315 and replaced by |{prev_state}|.
316
317 * mkhtmlparser.pl: Remove |{char}| feature.
318 Remove |!!!back-next-input-character;| macro.
319
320 2008-09-13 Wakaba <wakaba@suika.fam.cx>
321
322 * HTML.pm.src: Finally we get rid of all the inner loops. Remove
323 entity related tokenizer states in favor of new states
324 implementing the consume character reference algorithm.
325
326 2008-09-13 Wakaba <wakaba@suika.fam.cx>
327
328 * HTML.pm.src: "Consume a character reference" algorithm is
329 now implemented as a tokenizer's state, rather than
330 a method, with minimum changes (more changes will
331 be made, in due course). "Bogus comment state"'s inner
332 loop gets removed.
333
334 2008-09-13 Wakaba <wakaba@suika.fam.cx>
335
336 * HTML.pm.src: Make |PUBLIC| and |SYSTEM| keyword tokenizing
337 into their own tokenizer states.
338
339 2008-09-13 Wakaba <wakaba@suika.fam.cx>
340
341 * HTML.pm.src: |CDATA_SECTION_STATE| (formally |CDATA_BLOCK_STATE|
342 is split into three states.
343
344 2008-09-13 Wakaba <wakaba@suika.fam.cx>
345
346 * HTML.pm.src: |CLOSE_TAG_OPEN_STATE| is broken into
347 itself and new |CDATA_PCDATA_CLOSE_TAG_STATE| so that
348 no longer does the tokenizer have to push back next input
349 characters in those states.
350
351 2008-09-13 Wakaba <wakaba@suika.fam.cx>
352
353 * HTML.pm.src: |MARKUP_DECLARATION_OPEN_STATE| broken
354 into four states so that no longer does the tokenizer have to push
355 back next input characters in that state.
356
357 2008-09-11 Wakaba <wakaba@suika.fam.cx>
358
359 * HTML.pm.src: Methods now accept additional parameter, $get_wrapper,
360 which can be used to insert some wrapper between the character
361 stream handle and the tokenizer. (It is currently not supported
362 for |set_inner_html| for |Element|s).
363
364 2008-09-10 Wakaba <wakaba@suika.fam.cx>
365
366 * HTML.pm.src: Ignore punctuations in charset names.
367
368 2008-09-10 Wakaba <wakaba@suika.fam.cx>
369
370 * ContentChecker.pm: Support for charset-layer error levels.
371
372 * HTML.pm.src: Don't specify |text| argument for the
373 |chardecode:fallback| error, since it is not the encoding
374 being used alternatively.
375
376 2008-09-06 Wakaba <wakaba@suika.fam.cx>
377
378 * HTML.pm.src: Support for |XSLT-compat| (HTML5 revision 2141).
379
380 2008-08-31 Wakaba <wakaba@suika.fam.cx>
381
382 * CacheManifest.pm: Support for extensibility (HTML5 revision 2051).
383
384 2008-08-31 Wakaba <wakaba@suika.fam.cx>
385
386 * HTML.pm.src: Bug fix and sync with the spec with regard
387 to after after frameset insertion mode processing (HTML5
388 revision 1909). Note that the implementation was wrong
389 per the old spec before the r1909 changes.
390
391 2008-08-30 Wakaba <wakaba@suika.fam.cx>
392
393 * HTMLTable.pm: scope=auto algorithm fix synced with the
394 spec (HTML5 revision 2093).
395 ($process_row): Algorithm step numbers synced with the
396 spec (HTML5 revision 2092).
397
398 2008-08-30 Wakaba <wakaba@suika.fam.cx>
399
400 * HTMLTable.pm: Zs is not what we want; we want White_Space! (HTML5
401 revision 2094).
402
403 2008-08-30 Wakaba <wakaba@suika.fam.cx>
404
405 * ContentType.pm: Support for image/svg+xml (HTML5 revision 2096).
406
407 2008-08-30 Wakaba <wakaba@suika.fam.cx>
408
409 * HTML.pm.src: '"' and "'" at the end of attribute
410 name (after another attribute) now raise parse error (HTML5
411 revision 2123). Empty unquoted attribute values are no
412 longer allowed (HTML5 revision 2122).
413
414 2008-08-30 Wakaba <wakaba@suika.fam.cx>
415
416 * mkhtmlparser.pl: Support for MathML |definitionURL| attribute (HTML5
417 revision 2130).
418
419 2008-08-30 Wakaba <wakaba@suika.fam.cx>
420
421 * ContentChecker.pm: |xml:lang| attribute value must be same
422 as |lang| attribute value for HTML elements (HTML5 revision 2062
423 and so on).
424
425 2008-08-30 Wakaba <wakaba@suika.fam.cx>
426
427 * ContentChecker.pm: Error level definition for |xml_id_error|
428 was missing.
429
430 * URIChecker.pm: The end of the URL should be marked as the
431 error location for an empty path error. The position
432 between the userinfo and the port components should be
433 marked as the error location for an empty host error.
434
435 2008-08-30 Wakaba <wakaba@suika.fam.cx>
436
437 * URIChecker.pm: Set parameters representing where in the
438 value the error occurs for errors. Report unknown
439 address format error in warning level, since address
440 formats are rarely added. Path segments starting with "/.."
441 were misinterpreted as a dot-segment.
442
443 2008-08-30 Wakaba <wakaba@suika.fam.cx>
444
445 * URIChecker.pm (check_iri_reference): Requires
446 |Message::DOM::DOMImplementation|.
447
448 2008-08-29 Wakaba <wakaba@suika.fam.cx>
449
450 * IMTChecker.pm: Updated for the new error reporting architecture.
451
452 * ContentChecker.pm: Error levels for IMTs are added.
453
454 2008-08-17 Wakaba <wakaba@suika.fam.cx>
455
456 * H2H.pm (_shift_token): Support for unquoted HTML attribute
457 values.
458
459 2008-08-16 Wakaba <wakaba@suika.fam.cx>
460
461 * CacheManifest.pm: Support for new style of error
462 reports.
463
464 * HTML.pm.src: Set line=1, column=1 to the document node.
465
466 2008-08-16 Wakaba <wakaba@suika.fam.cx>
467
468 * ContentChecker.pm, RDFXML.pm: Pass {level} object to language tag
469 and URL checkers. Support for more error levels for bogus
470 langauge tag and URL "standards".
471
472 * LangTag.pm, URIChecker.pm: Support for new style error
473 level reporting.
474
475 2008-08-15 Wakaba <wakaba@suika.fam.cx>
476
477 * ContentChecker.pm: Support for RDF/XML error levels.
478
479 * HTMLTable.pm, RDFXML.pm: Support for new style of error level
480 specifying. Error types are revised.
481
482 2008-08-15 Wakaba <wakaba@suika.fam.cx>
483
484 * ContentChecker.pm: All error reporting method calls are
485 renewed.
486
487 2008-08-15 Wakaba <wakaba@suika.fam.cx>
488
489 * HTML.pm.src: All error type names and "text" parameters
490 are revised. Use new style for "level" specification.
491
492 * mkhtmlparser.pl: Use new style for "level" specification.
493
494 2008-08-03 Wakaba <wakaba@suika.fam.cx>
495
496 * WebIDL.pm (parse_char_string): Simplified error
497 reporting process for broken ignored valuetype definition.
498 (Valuetype idl_text): Support for special "DOMString" name.
499
500 2008-08-03 Wakaba <wakaba@suika.fam.cx>
501
502 * WebIDL.pm ($get_scoped_name): Append "::::" if the last
503 terminal of the ScopedName is "DOMString", such that whether
504 the last part of the scoped name is "DOMString" or "_DOMString"
505 later. It is necessary to determine whether a |typedef|
506 definition should be ignored or not.
507 (parse_char_string): Unescape the identifier of
508 exception members.
509 ($resolve): Return undef for builtin types and sequence<T>
510 types (we might not have to do this, however...).
511 (check): Support checking for Exceptions, Valuetypes,
512 and Typedefs.
513 ($serialize_type): Support for "DOMString::::" syntax.
514 (Typedef idl_text): Output Type as "DOMString" if it
515 is really "DOMString" (i.e. its internal representation
516 is "::DOMString::").
517
518 2008-08-03 Wakaba <wakaba@suika.fam.cx>
519
520 * WebIDL.pm ($resolve): New code, based on resolve code
521 for constant types in the |check| method.
522 (check): Support for checking of attributes, operations, and
523 arguments.
524 (Attribute/Operation idl_text): Exception names in getraises,
525 setraises, and raises clauses is serizlied by |$serialize_type|
526 code.
527
528 2008-08-02 Wakaba <wakaba@suika.fam.cx>
529
530 * WebIDL.pm ($integer): Order of selections are changed to match
531 hexadecimal numbers (the original pattern, taken from the spec,
532 was not work for hexadecimal numbers, because the "0" prefix
533 matches to the [0-7]* part (as an empty string) and therefore
534 it does not match with remaining "x..." part of a "0x..." integer
535 literal.
536 ($get_type): It now returns a string, not an array reference,
537 for regular types and |sequence| types (i.e. it in any case
538 returns a string).
539 ($get_next_token): The second item in the array that represents
540 a integer or float token is now a Perl number value, not the
541 original string representation of the number.
542 (check): Support for const value consistency checking.
543 No extended attribute is defined for constants.
544 (Node subclasses): Use simple strings rather than array references
545 for default data type values.
546 ($serialize_type): Type values are now simple strings.
547 (value): If the new attribute value is a false value, then
548 a FALSE value is set to the attribute.
549
550 2008-08-02 Wakaba <wakaba@suika.fam.cx>
551
552 * WebIDL.pm ($get_scoped_name): Now scoped names are stored
553 in its stringified format ("scoped name" as defined in the
554 spec). Note that future version of this module should not use
555 array references for type values and the |type_text| attribute
556 should be made obsolete.
557 (parse_char_string): Unescape attribute names.
558 (check): Support for checking of whether inherited interfaces
559 are actually defined or not. Support for checking of whether
560 interface member identifiers are duplicated or not.
561 ($serialize_type): Scoped names are returned as is. A future
562 version of this code should escape identifiers other than "DOMString",
563 otherwise the idl_text would be non-conforming.
564
565 2008-08-02 Wakaba <wakaba@suika.fam.cx>
566
567 * WebIDL.pm (parse_char_string): Set line/column numbers
568 to generated nodes. Unescape identifiers. Extended attributes
569 for Definition's were ignored.
570 (append_child): Set |parent_node| attribute.
571 (parent_node): New attribute.
572 (check): Support interface/exception members. Support
573 extended attributes. Support definition identifier uniqueness
574 constraint.
575 (qualified_name): New attribute.
576 (Interface/Exception idl_text): Extended attributes were
577 not prepended to the returned text.
578
579 2008-08-02 Wakaba <wakaba@suika.fam.cx>
580
581 * WebIDL.pm (parse_char_string): Set line/column numbers
582 to interface object experimentally. s/shift/pop/g, shift
583 would make things wrong. Support for interface forward
584 declarations was missing. Broken interface declarations
585 with no block were not ignored entirely.
586 (Whatpm::WebIDL::Node): New abstract class. This class
587 makes things easier.
588 (child_nodes): New attribute. Unlike DOM's attribute with
589 same name, this attribute returns a dead list of nodes for
590 simplicity.
591 (get_user_data, set_user_data): New methods.
592 (Module idl_text): A SPACE character should be inserted
593 before the |{| character.
594 (Interface idl_text): Support for interface forward declarations.
595 (is_forward_declaration): New attribute.
596
597 2008-07-19 Wakaba <wakaba@suika.fam.cx>
598
599 * WebIDL.pm (type_text): Better serializer.
600
601 2008-07-19 Wakaba <wakaba@suika.fam.cx>
602
603 * WebIDL.pm: Revise forward-compatible parsing so that
604 it now can handle broken extended attributes and as such.
605
606 2008-07-19 Wakaba <wakaba@suika.fam.cx>
607
608 * WebIDL.pm: Real support for extended attributes.
609 Support for extended attributes with arguments.
610
611 2008-07-19 Wakaba <wakaba@suika.fam.cx>
612
613 * WebIDL.pm: Support for |exception| syntax.
614 (Interface->idl_text): Tentative support for inheritances.
615
616 2008-07-19 Wakaba <wakaba@suika.fam.cx>
617
618 * WebIDL.pm: Hierarchical scoped name support was broken.
619 Support for raises, setraises, and getraises syntaxes.
620
621 2008-07-18 Wakaba <wakaba@suika.fam.cx>
622
623 * WebIDL.pm: Support for |idl_text| attribute, version 1 (no
624 proper support for types, extended attributes, and exceptions yet).
625 WebIDL parser, version 1 (no support for exceptions yet,
626 no proper support for extended attributes yet).
627
628 2008-07-09 Wakaba <wakaba@suika.fam.cx>
629
630 * WebIDL.pm (parse_char_string): Support for basic attribute syntax.
631
632 2008-06-29 Wakaba <wakaba@suika.fam.cx>
633
634 * WebIDL.pm: Support for valuetype and const.
635
636 2008-06-29 Wakaba <wakaba@suika.fam.cx>
637
638 * WebIDL.pm: New module.
639
640 2008-06-15 Wakaba <wakaba@suika.fam.cx>
641
642 * Makefile (Entities.html): URI changed.
643
644 2008-06-08 Wakaba <wakaba@suika.fam.cx>
645
646 * HTML.pm.src: Support for ruby parsing (HTML5 revision 1704).
647
648 2008-06-01 Wakaba <wakaba@suika.fam.cx>
649
650 * HTML.pm.src (_get_next_token): A parse error was missing.
651
652 2008-06-01 Wakaba <wakaba@suika.fam.cx>
653
654 * mklinktypelist.pl: rel=contact is no longer part of the HTML5
655 spec (commented out). (HTML5 revision 1711).
656
657 2008-05-25 Wakaba <wakaba@suika.fam.cx>
658
659 * ContentType.pm: Drop support for UTF-32 (HTML5 revision 1701).
660
661 * HTML.pm.src: UTF-16BE and UTF-16LE should be considered
662 as UTF-16 (HTML5 revision 1701).
663
664 2008-05-25 Wakaba <wakaba@suika.fam.cx>
665
666 * HTML.pm.src: Support for <noframes> in <head> (HTML5 revision
667 1692).
668
669 2008-05-25 Wakaba <wakaba@suika.fam.cx>
670
671 * HTML.pm.src: The secondary insertion mode used when switching
672 to foreign content is the "in body" insertion mode (HTML5 revision
673 1696).
674
675 2008-05-25 Wakaba <wakaba@suika.fam.cx>
676
677 * HTML.pm.src: Don't raise parse error for <isindex/> (HTML5
678 revision 1697).
679
680 2008-05-24 Wakaba <wakaba@suika.fam.cx>
681
682 * HTML.pm.src: Support for end-of-file token in foreign content
683 insertion mode (HTML5 revision 1693). Update SVG camelCase
684 attribute list (HTML5 revision 1700). <textarea> closes
685 </select> (HTML5 revision 1699). More start tags close in
686 foreign content insertion mode (HTML5 revision 1698).
687
688 2008-05-24 Wakaba <wakaba@suika.fam.cx>
689
690 * HTML.pm.src: ";" is not part of charset name (HTML5 revision 1665).
691
692 2008-05-24 Wakaba <wakaba@suika.fam.cx>
693
694 * HTML.pm.src: More robust charset parameter detection (HTML5
695 revision 1674).
696
697 2008-05-24 Wakaba <wakaba@suika.fam.cx>
698
699 * ContentType.pm: Support for image/vnd.microsoft.icon (HTML5
700 revision 1676).
701
702 2008-05-24 Wakaba <wakaba@suika.fam.cx>
703
704 * HTML.pm.src: Ignore language part of public identifiers for
705 quriks mode detection (HTML5 revision 1679).
706
707 2008-05-24 Wakaba <wakaba@suika.fam.cx>
708
709 * HTML.pm.src: Reduce the number of errors in truncated doctypes (HTML5
710 revision 1685).
711
712 2008-05-24 Wakaba <wakaba@suika.fam.cx>
713
714 * HTML.pm.src: Support for EOF in new states for tags (HTML5
715 revision 1684).
716
717 2008-05-24 Wakaba <wakaba@suika.fam.cx>
718
719 * HTML.pm.src (_reset_insertion_mode): Make <td>.innerHTML
720 work (HTML5 revision 1690).
721
722 2008-05-24 Wakaba <wakaba@suika.fam.cx>
723
724 * HTML.pm.src (_tree_construction_main): Change handling of
725 end tags in head insertion modes (HTML5 revision 1686).
726 (parse_char_string): Bug fix for non-utf8 character string handlings.
727 (parse_char_stream): |ungetc| does not work well for this context.
728
729 2008-05-18 Wakaba <wakaba@suika.fam.cx>
730
731 * HTML.pm.src (parse_byte_string): Redefined to invoke
732 |parse_byte_stream|.
733 (parse_byte_stream): New method.
734
735 2008-05-18 Wakaba <wakaba@suika.fam.cx>
736
737 * HTML.pm.src (parse_byte_string): Fix the column number reported
738 by encoding layer error reporter.
739
740 2008-05-17 Wakaba <wakaba@suika.fam.cx>
741
742 * HTML.pm.src (parse_byte_string): Use streaming decoder
743 rather than converting the whole byte string and then parsing.
744 Propagate errors in character encoding layer.
745 (get_next_token): Precise error reporting for |bare stago| error.
746
747 2008-05-17 Wakaba <wakaba@suika.fam.cx>
748
749 * HTML.pm.src (parse_char_stream): New method.
750 (parse_char_string): This method is now defined as an invocation
751 of the |parse_char_stream| method.
752
753 2008-05-17 Wakaba <wakaba@suika.fam.cx>
754
755 * HTML.pm.src (parse_byte_string): Report various status
756 of the sniffing as info-level errors. Support for new
757 decoding framework in parser resestting.
758 (new): Various default error levels were not set.
759
760 2008-05-17 Wakaba <wakaba@suika.fam.cx>
761
762 * HTML.pm.src (parse_byte_string): HTML5 encoding siniffing
763 algorithm, except for the actual sniffing, is implemented
764 with new framework with Message::Charset::Info.
765
766 2008-05-16 Wakaba <wakaba@suika.fam.cx>
767
768 * CacheManifest.pm (_parse): Drop fragment identifiers from
769 URIs in fallback section (HTML5 revision 1596).
770
771 2008-05-10 Wakaba <wakaba@suika.fam.cx>
772
773 * Makefile (Entities.html): URI has changed.
774
775 2008-05-10 Wakaba <wakaba@suika.fam.cx>
776
777 * CacheManifest.pm: Don't replace U+0000 NULL (HTML5 revision
778 1553).
779
780 2008-05-06 Wakaba <wakaba@suika.fam.cx>
781
782 * ContentChecker.pm: Noted that those returned in |table| are
783 no longer table elements, but table objects returned
784 by Whatpm::HTMLTable.
785
786 * HTMLTable.pm (form_table): Return table element node
787 as |$table->{element}|.
788 (assign_header): Support for the |headers=""| attribute.
789
790 2008-05-06 Wakaba <wakaba@suika.fam.cx>
791
792 * HTMLTable.pm (assign_header): New function; first version
793 with no support for headers="".
794 (form_table): Include table width and height to the returned
795 table object for covenience. Indexing in column assignement was wrong.
796 Set whether a data cell is empty or not for convenience.
797
798 2008-05-05 Wakaba <wakaba@suika.fam.cx>
799
800 * HTMLTable.pm: Robuster caption support (HTML5 revision 1393).
801
802 2008-05-05 Wakaba <wakaba@suika.fam.cx>
803
804 * HTMLTable.pm: How table model errors are detected is
805 changed (HTML5 revision 1387).
806
807 2008-05-05 Wakaba <wakaba@suika.fam.cx>
808
809 * HTMLTable.pm: The algorithm now moves |tfoot| elements
810 to the end of the table (HTML5 revision 1380).
811
812 2008-05-05 Wakaba <wakaba@suika.fam.cx>
813
814 * HTMLTable.pm: The algorithm is now 0-based indexing, instead
815 of 1-based (HTML5 revision 1376).
816
817 2008-05-05 Wakaba <wakaba@suika.fam.cx>
818
819 * ContentType.pm: "Content-Type: text/plain; charset=UTF-8"
820 and "Content-Encoding" no longer prevent sniffing (HTML5
821 revision 1288).
822
823 2008-05-05 Wakaba <wakaba@suika.fam.cx>
824
825 * ContentType.pm: Skip BOMs n feed or HTML algorithm (HTML5 revision
826 1282).
827
828 2008-05-03 Wakaba <wakaba@suika.fam.cx>
829
830 * ContentChecker.pm: Support for global attributes.
831 Status of XML specs are added.
832
833 2008-05-03 Wakaba <wakaba@suika.fam.cx>
834
835 * ContentChecker.pm (check_element): Support for |template=""|
836 and |ref=""| attribute (referent element type checking).
837
838 2008-04-29 Wakaba <wakaba@suika.fam.cx>
839
840 * CacheManifest.pm (_parse): New same origin definition (HTML5 revision
841 1500) is implemented (except for IDNA part and URI-scheme-specific
842 knowledge). Line number counting was wrong for LF-only
843 documents.
844
845 2008-04-13 Wakaba <wakaba@suika.fam.cx>
846
847 * HTML.pm.src: Raise an parse error for any disallowed
848 character (HTML5 revision 1263).
849
850 2008-04-13 Wakaba <wakaba@suika.fam.cx>
851
852 * mkentitylist.pl: Support for new HTML5 entity table format (the
853 definition for |AElig;| was missing).
854
855 2008-04-13 Wakaba <wakaba@suika.fam.cx>
856
857 * HTML.pm.src, mkhtmlparser.pl: Support for element/attribute
858 name/namespace fixup (HTML5 revisions 1413, 1415, 1416, and 1417).
859
860 2008-04-13 Wakaba <wakaba@suika.fam.cx>
861
862 * HTML.pm.src: List of element names that close foreign content
863 insertion mode is added (HTML5 revisions 1412 and 1418).
864
865 2008-04-13 Wakaba <wakaba@suika.fam.cx>
866
867 * HTML.pm.src: Support for |mglyph| and |malignmark|
868 elements (HTML5 revision 1410).
869
870 2008-04-13 Wakaba <wakaba@suika.fam.cx>
871
872 * HTML.pm.src: Support for new long MathML entities (HTML5
873 revision 1406).
874
875 2008-04-13 Wakaba <wakaba@suika.fam.cx>
876
877 * HTML.pm.src: CDATA section support for MathML and SVG
878 elements (HTML5 revision 1404 and 1420).
879
880 2008-04-12 Wakaba <wakaba@suika.fam.cx>
881
882 * HTML.pm.src, mkhtmlparser.pl: Support for MathML and SVG
883 elements (HTML5 revision 1404). Unused !!!macro definitions
884 are removed.
885
886 2008-04-12 Wakaba <wakaba@suika.fam.cx>
887
888 * HTML.pm.src, mkhtmlparser.pl: The way permitted slash errors
889 are raised is changed (HTML5 revision 1404).
890
891 2008-04-06 Wakaba <wakaba@suika.fam.cx>
892
893 * HTML.pm.src: Category sets that are no longer used are removed.
894
895 2008-04-06 Wakaba <wakaba@suika.fam.cx>
896
897 * HTML.pm.src: The ->[1] property of stack entries are now
898 replaced by constants representing element category.
899
900 2008-04-06 Wakaba <wakaba@suika.fam.cx>
901
902 * HTML.pm.src: Don't use local name stored in stack (i.e. ->[1])
903 for error reporting. (This is a preparation for using constant
904 value for ->[1].)
905
906 2008-03-22 Wakaba <wakaba@suika.fam.cx>
907
908 * RDFXML.pm: Typo fixed.
909
910 2008-03-22 Wakaba <wakaba@suika.fam.cx>
911
912 * ContentChecker.pm: |fact_level| is now treated
913 as same as |must_level|, i.e. level = |m|.
914 (check_element): Make list of URIs in the DOM.
915
916 2008-03-21 Wakaba <wakaba@suika.fam.cx>
917
918 * RDFXML.pm: Language accessor implemented.
919 Local (null-namespace) attribute support.
920
921 2008-03-21 Wakaba <wakaba@suika.fam.cx>
922
923 * RDFXML.pm: Factored out ID checking code.
924
925 2008-03-21 Wakaba <wakaba@suika.fam.cx>
926
927 * RDFXML.pm: TODO items noted. Validation of ID
928 and URI attributes is implemented. Warn if unknown
929 value is used in rdf:parseType="" attribute.
930
931 * URIChecker.pm (check_rdf_uri_reference): New function.
932
933 2008-03-21 Wakaba <wakaba@suika.fam.cx>
934
935 * RDFXML.pm: bnodeid implemented. Relative references
936 are now resolved.
937
938 2008-03-21 Wakaba <wakaba@suika.fam.cx>
939
940 * ContentChecker.pm: RDF reification implemented.
941
942 * RDFXML.pm: undef vs false bug fixed.
943 Reification implemented.
944
945 2008-03-21 Wakaba <wakaba@suika.fam.cx>
946
947 * RDFXML.pm: s/id/ID/ for attribute name.
948 The |node| arguments are added for |ontriple| calls.
949 Too many "attribute not allowed" errors were raised.
950
951 * ContentChecker.pm: Initial experimental support for rdf:RDF
952 element.
953
954 2008-03-21 Wakaba <wakaba@suika.fam.cx>
955
956 * RDFXML.pm: New module.
957
958 2008-03-20 Wakaba <wakaba@suika.fam.cx>
959
960 * HTML.pm.src (set_inner_html): Line/column number
961 code was old one yet.
962
963 2008-03-20 Wakaba <wakaba@suika.fam.cx>
964
965 * HTML.pm.src: Better line/column reporting for "duplicate attribute"
966 errors. Line/column markings for DOCTYPE, comment, and
967 character tokens are reintroduced; otherwise,
968 error location for "not HTML5" error and errors
969 for implied elements are not attached.
970
971 2008-03-20 Wakaba <wakaba@suika.fam.cx>
972
973 * HTML.pm.src: Set line/column numbers to attributes.
974
975 * NanoDOM.pm (create_attribute_ns, set_attribute_node_ns): Added.
976 (value): Setter implemented.
977
978 * mkhtmlparser.pl: Set line/column numbers to Attr nodes.
979
980 2008-03-20 Wakaba <wakaba@suika.fam.cx>
981
982 * HTML.pm.src: Unused line/column markings are removed.
983
984 2008-03-20 Wakaba <wakaba@suika.fam.cx>
985
986 * HTML.pm.src (_get_next_token): Remove |first_start_tag|
987 flag, which is no longer used.
988
989 2008-03-17 Wakaba <wakaba@suika.fam.cx>
990
991 * HTML.pm.src: Set line/column information to element nodes.
992
993 * mkhtmlparser.pl (!!!create-element, !!!insert-element,
994 and !!!insert-element-t): Set line/column information to
995 element nodes.
996
997 2008-03-17 Wakaba <wakaba@suika.fam.cx>
998
999 * HTML.pm.src (_get_next_token): The first "<" character
1000 in "<?", "<>", or "</>" should be the error point.
1001
1002 2008-03-16 Wakaba <wakaba@suika.fam.cx>
1003
1004 * HTML.pm.src: Some more fixes on error position reporting.
1005
1006 2008-03-16 Wakaba <wakaba@suika.fam.cx>
1007
1008 * HTML.pm.src: Token-level precious error reporting.
1009
1010 2008-03-16 Wakaba <wakaba@suika.fam.cx>
1011
1012 * HTML.pm.src: Preparation for more precious error point
1013 reporting.
1014
1015 2008-03-11 Wakaba <wakaba@suika.fam.cx>
1016
1017 * HTML.pm.src: Error type revised.
1018
1019 2008-03-11 Wakaba <wakaba@suika.fam.cx>
1020
1021 * HTML.pm.src: Similar codes are merged together, again.
1022
1023 2008-03-11 Wakaba <wakaba@suika.fam.cx>
1024
1025 * HTML.pm.src: Similar codes are merged together.
1026
1027 2008-03-10 Wakaba <wakaba@suika.fam.cx>
1028
1029 * mkhtmlparser.pl: Set "level" parameter to parse errors.
1030
1031 * HTML.pm.src: Code refined.
1032
1033 2008-03-09 Wakaba <wakaba@suika.fam.cx>
1034
1035 * HTML.pm.src: |</body>| treatement has been changed (HTML5
1036 revision 1348). Note that I really don't know this makes
1037 any difference in the black-box behavior of the parser.
1038
1039 2008-03-09 Wakaba <wakaba@suika.fam.cx>
1040
1041 * HTML.pm.src: New end-of-file token implementation (HTML5
1042 revision 1348).
1043
1044 2008-03-09 Wakaba <wakaba@suika.fam.cx>
1045
1046 * HTML.pm.src: |applet| support (HTML5 revision 1347).
1047
1048 2008-03-09 Wakaba <wakaba@suika.fam.cx>
1049
1050 * HTML.pm.src: Foster parenting in AAA (HTML5 revision 1343).
1051
1052 2008-03-09 Wakaba <wakaba@suika.fam.cx>
1053
1054 * HTML.pm.src: Support for |<input>| in the "in select" insertion
1055 mode and support for the "in select in table" insertion mode (HTML5
1056 revision 1342).
1057
1058 2008-03-09 Wakaba <wakaba@suika.fam.cx>
1059
1060 * HTML.pm.src: No foster parenting for <script> and <script>
1061 in non-tainted <table>s (HTML5 revision 1336).
1062
1063 2008-03-09 Wakaba <wakaba@suika.fam.cx>
1064
1065 * HTML.pm.src: Ignore white space characters between <html>
1066 and <head> (HTML5 revision 1332).
1067
1068 2008-03-09 Wakaba <wakaba@suika.fam.cx>
1069
1070 * HTML.pm.src: Treat <input type=hidden> as if it were a
1071 white space (HTML5 revision 1331).
1072
1073 2008-03-08 Wakaba <wakaba@suika.fam.cx>
1074
1075 * HTML.pm.src: Ignore U+000A at the beginning of a |listing|
1076 element (HTML5 revision 1330).
1077
1078 2008-03-08 Wakaba <wakaba@suika.fam.cx>
1079
1080 * HTML.pm.src: <title> is always appended to the current
1081 element (HTML5 revision 1328).
1082
1083 2008-03-08 Wakaba <wakaba@suika.fam.cx>
1084
1085 * HTML.pm.src: White space in tainted tables are moved
1086 into foster parents (HTML5 revision 1326).
1087
1088 2008-03-08 Wakaba <wakaba@suika.fam.cx>
1089
1090 * HTML.pm.src: Reduce errors from foster parenting cases (HTML5
1091 revision 1321).
1092
1093 2008-03-08 Wakaba <wakaba@suika.fam.cx>
1094
1095 * HTML.pm.src: |</h/n/>| case code rearranged to align with
1096 the spec (HTML5 revision 1320). Note that we finally complete
1097 all of HTML5 revision 1320 changes.
1098
1099 2008-03-08 Wakaba <wakaba@suika.fam.cx>
1100
1101 * HTML.pm.src: |</form>| now works similar to |</div>| for unclosed
1102 tags (HTML5 revision 1320).
1103
1104 2008-03-08 Wakaba <wakaba@suika.fam.cx>
1105
1106 * HTML.pm.src: |</p>| case rearranged with no actual change in fact.
1107
1108 2008-03-08 Wakaba <wakaba@suika.fam.cx>
1109
1110 * HTML.pm.src: A "generate implied end tags" code (t409.1)
1111 could not be reached so that it is now removed (HTML5 revision
1112 1320).
1113
1114 2008-03-08 Wakaba <wakaba@suika.fam.cx>
1115
1116 * HTML.pm.src: Code for the case of |</div>| and so on are revised
1117 to align with new spec text (HTML5 revision 1320).
1118
1119 2008-03-08 Wakaba <wakaba@suika.fam.cx>
1120
1121 * HTML.pm.src: Remove strange |if| condition; however, it should
1122 have had no harm in theory.
1123
1124 2008-03-08 Wakaba <wakaba@suika.fam.cx>
1125
1126 * HTML.pm.src (_tree_construction_main): '</p>' in body
1127 case is split from other end tags for the preparation
1128 of implementing HTML revision 1320.
1129
1130 2008-03-07 Wakaba <wakaba@suika.fam.cx>
1131
1132 * HTML.pm.src: Simplified "generate implied end tag" (HTML5
1133 revision 1320).
1134
1135 2008-03-07 Wakaba <wakaba@suika.fam.cx>
1136
1137 * HTML.pm.src (_tree_construction_main): Merge rules for "h1"
1138 and "div" (HTML5 revision 1318). Add comments to where
1139 |form| pointer association codes should be inserted (HTML5 revision
1140 1319).
1141
1142 2008-03-06 Wakaba <wakaba@suika.fam.cx>
1143
1144 * HTML.pm.src: <html> treatement refined (HTML5 revision 1314).
1145
1146 2008-03-05 Wakaba <wakaba@suika.fam.cx>
1147
1148 * HTML.pm.src: Since the case t268 should never be reached (no
1149 other token type, there are), it is replaced by a |die| statement.
1150
1151 2008-03-05 Wakaba <wakaba@suika.fam.cx>
1152
1153 * HTML.pm.src: Typo fixed.
1154
1155 2008-03-04 Wakaba <wakaba@suika.fam.cx>
1156
1157 * HTML.pm.src (_tree_construction_initial): Some limited quirks
1158 doctypes were not uppercased for comparison.
1159
1160 2008-03-03 Wakaba <wakaba@suika.fam.cx>
1161
1162 * HTML.pm.src (tree construction and set_inner_html): Checkpoints
1163 are added.
1164
1165 2008-03-03 Wakaba <wakaba@suika.fam.cx>
1166
1167 * HTML.pm.src (_tokenize_attempt_to_consume_an_entity): Checkpoints
1168 are set. Cases that are unlikely reached are noted as so.
1169
1170 2008-03-03 Wakaba <wakaba@suika.fam.cx>
1171
1172 * HTML.pm.src: Checkpoints for debugging are added.
1173
1174 * mkhtmlparser.pl: Support for |!!!cp| syntax.
1175
1176 2008-03-03 Wakaba <wakaba@suika.fam.cx>
1177
1178 * HTML.pm.src, mkhtmlparser.pl: s/_input_character/_char/g
1179 for simplicity.
1180
1181 2008-03-03 Wakaba <wakaba@suika.fam.cx>
1182
1183 * HTML.pm.src: Flag name changed: s/correct/force-quirks/g (HTML5
1184 revision 1307).
1185
1186 2008-03-03 Wakaba <wakaba@suika.fam.cx>
1187
1188 * HTML.pm.src (_get_next_token): Where the /incorrect/ flag
1189 is set are changed (HTML5 revision 1305).
1190
1191 2008-03-02 Wakaba <wakaba@suika.fam.cx>
1192
1193 * HTML.pm.src: Raise a parse error for |<span ===>| (HTML5 revision
1194 1292). Entities are not parsed in comment-like part in RCDATA
1195 elements (HTML5 revision 1294). Allow bare & at the end
1196 of attribute value literals (HTML5 revision 1296). More
1197 quirks mode doctypes (HTML5 revision 1302). Requires spaces
1198 between attributes and ban attribute names or unquoted
1199 attribute values containing single or double quotes (HTML5
1200 revision 1303).
1201
1202 2008-03-02 Wakaba <wakaba@suika.fam.cx>
1203
1204 * ContentChecker.pm: Typo fixed. Don't raise "character encoding"
1205 and related errors unless it is an HTML document (though the spec
1206 is unclear on whether it is applied to XHTML document).
1207
1208 * HTML.pm (%HTMLAttrStatus): WF2 repetition model attributes
1209 are added.
1210
1211 2008-03-02 Wakaba <wakaba@suika.fam.cx>
1212
1213 * HTML.pm.src: s/local_name/manakai_local_name/g.
1214
1215 2008-03-01 Wakaba <wakaba@suika.fam.cx>
1216
1217 * _NamedEntityList.pm: Updated (HTML5 revision 1286).
1218
1219 * HTML.pm.src: |charset| in |content| attribute is
1220 case-insensitive (HTML5 revision 1270).
1221
1222 2008-02-26 Wakaba <wakaba@suika.fam.cx>
1223
1224 * ContentChecker.pm: New status constants are added.
1225 ($ElementDefault): |status| added.
1226 (check_element): Err for non-standard or deprecated elements.
1227 (_attr_status_info): For non-standard or deprecated attributes.
1228
1229 2008-02-24 Wakaba <wakaba@suika.fam.cx>
1230
1231 * ContentChecker.pm (_attr_status_info): New internal method.
1232
1233 2008-02-24 Wakaba <wakaba@suika.fam.cx>
1234
1235 * ContentChecker.pm (check_element): Element standardized
1236 status information is now dispatched.
1237
1238 2008-02-24 Wakaba <wakaba@suika.fam.cx>
1239
1240 * ContentChecker.pm (check_element): Fix |del|-and-significant
1241 problem by adding some more arguments.
1242
1243 2008-02-24 Wakaba <wakaba@suika.fam.cx>
1244
1245 * ContentChecker.pm (check_element): Use context of
1246 container-for-the-purpose-of-content-model element (not transparent
1247 element) for |check_child_element| calling and significant
1248 text flag marking. This reintroduces |<del>|-and-significant
1249 problem again.
1250
1251 2008-02-24 Wakaba <wakaba@suika.fam.cx>
1252
1253 * ContentChecker.pm (check_element): Make semi-transparent
1254 elements ignored for the purpose of phase changes in
1255 content model checking.
1256
1257 2008-02-23 Wakaba <wakaba@suika.fam.cx>
1258
1259 * ContentChecker.pm (check_element): In-element state
1260 was not properly managed for transparent cases.
1261
1262 2008-02-23 Wakaba <wakaba@suika.fam.cx>
1263
1264 * ContentChecker.pm (check_element): Support for |video|
1265 and |audio| as semi-transparent elements.
1266
1267 2008-02-23 Wakaba <wakaba@suika.fam.cx>
1268
1269 * ContentChecker.pm ($HTMLSemiTransparentElements): New.
1270 (check_element): s/minuses/minus_elements/, s/pluses/plus_elements/.
1271 Support for |html:object| as a semi-transparent element.
1272
1273 2008-02-23 Wakaba <wakaba@suika.fam.cx>
1274
1275 * ContentChecker.pm (check_element): The way to traverse
1276 the tree is entirely revised to make it easier to track
1277 the state of ancestors/descendants. As a result of this
1278 revision (which rewrites almost all of Whatpm::ContentChecker::HTML),
1279 support for content model checking for HTML elements |figure|,
1280 |object|, |video|, and |audio| and checking for XML elements (and
1281 some XMLNS checkings) are dropped for now. They will be
1282 reimplemented in due cource.
1283
1284 2008-02-17 Wakaba <wakaba@suika.fam.cx>
1285
1286 * HTML.pm.src: |>| in public or system literal closes the DOCTYPE
1287 token (HTML5 revision 1225).
1288
1289 2008-02-17 Wakaba <wakaba@suika.fam.cx>
1290
1291 * ContentChecker.pm ({unsupported_level}): New value.
1292
1293 * HTML.pm.src: Save whether |meta| |content| attribute
1294 contains character references or not.
1295
1296 2008-02-17 Wakaba <wakaba@suika.fam.cx>
1297
1298 * ContentChecker.pm (_get_children): (Incompleted) attempt to
1299 imlement significant content checking for contents
1300 with |del| elements.
1301
1302 2008-02-17 Wakaba <wakaba@suika.fam.cx>
1303
1304 * ContenteChecker.pm ($HTMLTransparentElements): More
1305 elements are added.
1306 (_get_children): HTML |object| elements are now semi-transparent.
1307
1308 * NanoDOM.pm (manakai_html, manakai_head): New methods.
1309
1310 2008-02-16 Wakaba <wakaba@suika.fam.cx>
1311
1312 * CacheManifest.pm: HTML5 revision 1211 implemented.
1313
1314 * CacheManifest.pod: Updated.
1315
1316 2008-02-10 Wakaba <wakaba@suika.fam.cx>
1317
1318 * ContentChecker.pm (check_document, check_element): Support
1319 for second argument ($onsubdoc).
1320 (_get_css_parser): Removed (now it is part of WDCC).
1321
1322 2008-02-09 Wakaba <wakaba@suika.fam.cx>
1323
1324 * ContentChecker.pm (_get_css_parser): New.
1325
1326 2007-11-25 Wakaba <wakaba@suika.fam.cx>
1327
1328 * ContentChecker.pm ($AnyChecker): Old way to add child elements
1329 for checking had been used.
1330
1331 2007-11-25 Wakaba <wakaba@suika.fam.cx>
1332
1333 * ContentChecker.pm (check_element): New todo item type |descendant|.
1334
1335 2007-11-23 Wakaba <wakaba@suika.fam.cx>
1336
1337 * IMTChecker.pm: Revised to raise errors and warnings as (poorly)
1338 specced in RFC 2046 and RFC 4288.
1339 (application/atom+xml): Definition added.
1340
1341 2007-11-23 Wakaba <wakaba@suika.fam.cx>
1342
1343 * URIChecker.pm: Make RFC 3986 should-level errors
1344 warnings (rather than SHOULD-level errors).
1345
1346 2007-11-23 Wakaba <wakaba@suika.fam.cx>
1347
1348 * NanoDOM.pm (get_user_data, set_user_data): New methods.
1349
1350 * HTML.pm.src: A flag for character references in attribute
1351 values are added. Set |manakai_has_reference| user data
1352 to |charset| attribute.
1353
1354 2007-11-23 Wakaba <wakaba@suika.fam.cx>
1355
1356 * NanoDOM.pm (input_encoding, manakai_charset, manakai_has_bom): New
1357 attributes.
1358
1359 * ContentChecker.pm (check_document): Warn if charset requirements
1360 cannot be tested.
1361
1362 2007-11-19 Wakaba <wakaba@suika.fam.cx>
1363
1364 * HTML.pm.src (parse_byte_string): Detect charset
1365 by universalchardet if charset parameter is not specified.
1366
1367 * Makefile (Charset-all, Charset-clean): New rules.
1368
1369 2007-11-18 Wakaba <wakaba@suika.fam.cx>
1370
1371 * ContentChecker.pm (check_document): Check the existence
1372 of character encoding declaration and selection of encoding
1373 for HTML document.
1374
1375 2007-11-18 Wakaba <wakaba@suika.fam.cx>
1376
1377 * ContentType.pm (get_sniffed_type): Return also the official
1378 type in list context.
1379
1380 2007-11-18 Wakaba <wakaba@suika.fam.cx>
1381
1382 * ContentType.pm: Sniffing with leading white space
1383 ignoring (HTML5 revisions 1013 and 1016).
1384
1385 2007-11-18 Wakaba <wakaba@suika.fam.cx>
1386
1387 * ContentType.pm: HTML5 revision 1013 changes, except for leading
1388 white spaces, are implemented.
1389
1390 2007-11-11 Wakaba <wakaba@suika.fam.cx>
1391
1392 * HTML.pm.src (parse_char_string): Set |inner_encoding|
1393 attribute if possible.
1394
1395 2007-11-11 Wakaba <wakaba@suika.fam.cx>
1396
1397 * HTML.pm.src (parse_byte_string): New method.
1398 (parse_char_string): New alias for |parse_string|.
1399 (main phase): Invoking "change the encoding" algorithm if desired.
1400
1401 * HTML.pod: Updated.
1402
1403 2007-11-11 Wakaba <wakaba@suika.fam.cx>
1404
1405 * HTML.pod (get_inner_html): Removed.
1406
1407 * Makefile (HTML-all, HTML-clean): New.
1408
1409 2007-11-11 Wakaba <wakaba@suika.fam.cx>
1410
1411 * HTML.pm.src (get_inner_html): Removed (moved to HTML/Serializer.pm).
1412
1413 2007-11-08 Wakaba <wakaba@suika.fam.cx>
1414
1415 * mklinktypelist.pl: s/noreferer/noreferrer/ (HTML5 revision 1132).
1416
1417 2007-11-04 Wakaba <wakaba@suika.fam.cx>
1418
1419 * Makefile: |CacheManifest.html| is added.
1420
1421 * CacheManifest.pod: New file.
1422
1423 2007-11-04 Wakaba <wakaba@suika.fam.cx>
1424
1425 * CacheManifest.pm: New module.
1426
1427 2007-11-04 Wakaba <wakaba@suika.fam.cx>
1428
1429 * HTML.pm.src: Support for application cache selection algorithm
1430 callback.
1431
1432 2007-11-04 Wakaba <wakaba@suika.fam.cx>
1433
1434 * mklinktypelist.pl: Support for rel=noreferer (HTML5 revision 1118).
1435
1436 2007-10-17 Wakaba <wakaba@suika.fam.cx>
1437
1438 * Makefile (clean): New rule.
1439
1440 * NanoDOM.pm (public_id, system_id): New attributes.
1441
1442 2007-10-17 Wakaba <wakaba@suika.fam.cx>
1443
1444 * Makefile (CSS-all, CSS-clean, clean): New rules.
1445
1446 2007-10-14 Wakaba <wakaba@suika.fam.cx>
1447
1448 * ContentChecker.pm (check_document): Support for
1449 new |is_xml_root| flag.
1450 (check_element): Support for new |pluses| state.
1451 (_add_pluses): New method.
1452 (_remove_minuses): Support for new |minus| item.
1453
1454 2007-09-24 Wakaba <wakaba@suika.fam.cx>
1455
1456 * ContentChecker.pm: Raise specific error for invalid
1457 root element.
1458
1459 2007-09-24 Wakaba <wakaba@suika.fam.cx>
1460
1461 * ContentChecker.pm: Set level values for later uses.
1462
1463 2007-09-09 Wakaba <wakaba@suika.fam.cx>
1464
1465 * ContentChecker.pm: Support for language tag validation.
1466
1467 2007-09-09 Wakaba <wakaba@suika.fam.cx>
1468
1469 * LangTag.pm (check_rfc3066_language_tag): New method.
1470
1471 2007-09-09 Wakaba <wakaba@suika.fam.cx>
1472
1473 * LangTag.pm: New module.
1474
1475 2007-09-04 Wakaba <wakaba@suika.fam.cx>
1476
1477 * HTML.pm.src: Some error types were wrong.
1478
1479 2007-08-17 Wakaba <wakaba@suika.fam.cx>
1480
1481 * CSS/: New directory.
1482
1483 2007-08-17 Wakaba <wakaba@suika.fam.cx>
1484
1485 * ContentChecker.pm (_check_get_children): Support
1486 for |noscript| in |head|.
1487
1488 2007-08-12 Wakaba <wakaba@suika.fam.cx>
1489
1490 * URI/: New directory.
1491
1492 2007-08-11 Wakaba <wakaba@suika.fam.cx>
1493
1494 * HTML.pm.src: Tokenizer's states are now represented in
1495 number.
1496
1497 2007-08-11 Wakaba <wakaba@suika.fam.cx>
1498
1499 * HTML.pm.src: |or|s for insertion modes are replaced
1500 by |&|s.
1501
1502 2007-08-11 Wakaba <wakaba@suika.fam.cx>
1503
1504 * HTML.pm.src: Token types are now represented in number.
1505
1506 2007-08-11 Wakaba <wakaba@suika.fam.cx>
1507
1508 * ContentType.pm (SEE ALSO): Updated.
1509
1510 * HTML.pm.src: Insertion modes are now represented in number.
1511
1512 2007-08-11 Wakaba <wakaba@suika.fam.cx>
1513
1514 * ContentType.pm: Sniffing for bitmap images (HTML5 revision
1515 999) is implemented.
1516
1517 2007-08-08 Wakaba <wakaba@suika.fam.cx>
1518
1519 * ContentType.pm: Sniffing for |<script| (HTML5 revision
1520 983) is implemented.
1521
1522 2007-08-06 Wakaba <wakaba@suika.fam.cx>
1523
1524 * ContentChecker.pod: New documentation.
1525
1526 * Makefile: A rule for |ContentChecker.html| is added.
1527
1528 * ContentChecker.pm: A pod "LICENSE" section is added.
1529
1530 * NanoDOM.pm ($VERSION): New variable.
1531
1532 2007-08-05 Wakaba <wakaba@suika.fam.cx>
1533
1534 * H2H.pm: |b|, |i|, and |sub| are added to the
1535 list of allowed HTML elements.
1536
1537 2007-08-05 Wakaba <wakaba@suika.fam.cx>
1538
1539 * H2H.pm: |samp| is added to the list of allowed
1540 HTML elements.
1541
1542 * URIChecker.pm (check_iri): New.
1543 (check_iri_reference): Error type for IRI reference
1544 syntax error is changed.
1545
1546 2007-08-04 Wakaba <wakaba@suika.fam.cx>
1547
1548 * ContentChecker.pm: Reference to the |Whatpm::ContentChecker::Atom|
1549 is added.
1550 (check_document): Load appropriate module before validation.
1551
1552 2007-08-04 Wakaba <wakaba@suika.fam.cx>
1553
1554 * ContentChecker/: New directory.
1555
1556 2007-08-04 Wakaba <wakaba@suika.fam.cx>
1557
1558 * ContentChecker.pm: HTML |time| element is implemented.
1559
1560 * HTMLTable.pm: Comments are updated as HTML5 is revised.
1561
1562 2007-07-21 Wakaba <wakaba@suika.fam.cx>
1563
1564 * ContentChecker.pm (check_document): Return value
1565 even if no document element is found.
1566
1567 2007-07-21 Wakaba <wakaba@suika.fam.cx>
1568
1569 * HTML.pm.src: |$in_body| is no longer a function.
1570
1571 2007-07-21 Wakaba <wakaba@suika.fam.cx>
1572
1573 * HTML.pm.src: The |$in_body| code has been moved down.
1574
1575 2007-07-21 Wakaba <wakaba@suika.fam.cx>
1576
1577 * HTML.pm.src: The "trailing end" insertion mode
1578 is split into "after html body" and "after html frameset"
1579 insertion modes. Their codes are merged with "after body"
1580 and "after frameset" codes. |$previous_insertion_mode|
1581 has been removed. "after frameset" code is
1582 merged with "in frameset" code.
1583
1584 2007-07-21 Wakaba <wakaba@suika.fam.cx>
1585
1586 * HTML.pm.src: The "before head" insertion mode is
1587 merged with the "in head" insertion mode.
1588
1589 2007-07-21 Wakaba <wakaba@suika.fam.cx>
1590
1591 * HTML.pm.src: Cases in "in head" insertion mode are
1592 reorganized.
1593
1594 2007-07-21 Wakaba <wakaba@suika.fam.cx>
1595
1596 * HTML.pm.src: Some cases in "in table" insertion mode
1597 are merged.
1598
1599 2007-07-21 Wakaba <wakaba@suika.fam.cx>
1600
1601 * HTML.pm.src: The "in row" insertion mode is merged
1602 with "in table" insertion mode.
1603
1604 2007-07-21 Wakaba <wakaba@suika.fam.cx>
1605
1606 * HTML.pm.src: The "in table" and "in table body" insertion
1607 modes are merged.
1608
1609 2007-07-21 Wakaba <wakaba@suika.fam.cx>
1610
1611 * HTML.pm.src: There is no "in table head" or "in table foot"
1612 insertion mode!
1613
1614 2007-07-21 Wakaba <wakaba@suika.fam.cx>
1615
1616 * HTML.pm.src: |<noframes>| "in frameset" and "in noframes"
1617 now directly invoke the handler.
1618
1619 2007-07-21 Wakaba <wakaba@suika.fam.cx>
1620
1621 * HTML.pm.src: Codes for "in cell" insertion mode
1622 is merged to the "in body" insertion mode code.
1623
1624 2007-07-21 Wakaba <wakaba@suika.fam.cx>
1625
1626 * HTML.pm.src: Codes for "in body" and "in caption"
1627 insertion modes are merged.
1628
1629 2007-07-21 Wakaba <wakaba@suika.fam.cx>
1630
1631 * HTML.pm.src: Two |!!!next-token|s were missing.
1632
1633 2007-07-21 Wakaba <wakaba@suika.fam.cx>
1634
1635 * HTML.pm.src: Use numeric constant for |{content_mode}|
1636 instead of string constant for |{content_model_flag}|.
1637
1638 2007-07-21 Wakaba <wakaba@suika.fam.cx>
1639
1640 * HTML.pm.src: Add the name of the attribute
1641 to the "duplicate attribute" error.
1642
1643 2007-07-17 Wakaba <wakaba@suika.fam.cx>
1644
1645 * ContentChecker.pm: Return the |class| node list.
1646
1647 2007-07-17 Wakaba <wakaba@suika.fam.cx>
1648
1649 * ContentChecker.pm: Return the |id| node list.
1650
1651 * HTML.pm.src: A typo is fixed.
1652
1653 2007-07-16 Wakaba <wakaba@suika.fam.cx>
1654
1655 * ContentChecker.pm: Drop wrong |level => 'error'| specification
1656 from "in HTML:xml:lang" error. Character position
1657 is now the last part of the error type in the URI error
1658 description. Report "unsupported" status for language
1659 tags, media queries, script codes, and style sheets.
1660
1661 2007-07-16 Wakaba <wakaba@suika.fam.cx>
1662
1663 * ContentChecker.pm: Report error if |xml:lang|
1664 in HTML, |lang| in XML, |xmlns| in XML, and |meta| |charset|
1665 in XML.
1666
1667 * NanoDOM.pm (Attr.owner_document): New attribute.
1668
1669 2007-07-16 Wakaba <wakaba@suika.fam.cx>
1670
1671 * HTML.pm.src: The character immediately following
1672 a bare |hcro| was discarded. Fix handling of
1673 entity references in attribute values.
1674
1675 2007-07-16 Wakaba <wakaba@suika.fam.cx>
1676
1677 * HTML.pm.src (main and trailin end phases): Token
1678 types |DOCTYPE|, |comment|, |end-of-file|, and |<html>| are
1679 factored out. Error types |in html:#DOCTYPE|
1680 and |after html:#DOCTYPE| are merged
1681 into |DOCTYPE in the middle|. |</frameset>|
1682 in fragment parsing mode changed the
1683 insertion mode.
1684
1685 2007-07-16 Wakaba <wakaba@suika.fam.cx>
1686
1687 * HTML.pm.src: |$phase| has been removed; The |trailing end|
1688 phase is now an insertion mode. Treatments for white
1689 space character tokens were incorrect for some
1690 insertion modes. An old |meta| case was not removed.
1691
1692 2007-07-16 Wakaba <wakaba@suika.fam.cx>
1693
1694 * HTML.pm.src: |meta| charset declaration extraction
1695 implemented (but changing the encoding is not yet:-).
1696
1697 2007-07-15 Wakaba <wakaba@suika.fam.cx>
1698
1699 * Charset/: New directory.
1700
1701 2007-07-15 Wakaba <wakaba@suika.fam.cx>
1702
1703 * H2H.pm: New Perl module (created from
1704 manakai's H2H.dis).
1705
1706 2007-07-15 Wakaba <wakaba@suika.fam.cx>
1707
1708 * XMLSerializer.pm: New Perl module (created from
1709 manakai's SimpleLS.dis).
1710
1711 2007-07-07 Wakaba <wakaba@suika.fam.cx>
1712
1713 * HTML.pm.src: |<!---x-->| was not processed correctly.
1714
1715 2007-07-01 Wakaba <wakaba@suika.fam.cx>
1716
1717 * HTML.pm.src: Report correct error message
1718 for |<body></div></body>|.
1719
1720 2007-07-01 Wakaba <wakaba@suika.fam.cx>
1721
1722 * HTMLTable.pm: An error description was incorrect.
1723
1724 2007-06-30 Wakaba <wakaba@suika.fam.cx>
1725
1726 * ContentChecker.pm: Return |{term}| list.
1727
1728 2007-06-30 Wakaba <wakaba@suika.fam.cx>
1729
1730 * HTML.pm.src: HTML5 revisions 961-966 (</p>, </br>,
1731 nested <nobr>, implied </tbody>, </tfoot>, and </thead>,
1732 and <title> outside of head).
1733
1734 2007-06-30 Wakaba <wakaba@suika.fam.cx>
1735
1736 * IMTChecker.pm: Report warning for unregistered
1737 and private types/subtypes.
1738
1739 * ContentChecker.pm, HTML.pm.src, IMTChecker.pm,
1740 URIChecker.pm, HTMLTable.pm: Error messages are now
1741 consistent; they are all listed in
1742 <http://suika.fam.cx/gate/2005/sw/Whatpm%20Error%20Types>.
1743
1744 2007-06-25 Wakaba <wakaba@suika.fam.cx>
1745
1746 * ContentChecker.pm: |<img ismap>| not in |<a></a>|
1747 is now erred. |<datalist>| is implemented.
1748 Attribute checker for |<command>| and |<menu>| are
1749 added. Support for |contextmenu| global attribute
1750 is added.
1751
1752 2007-06-25 Wakaba <wakaba@suika.fam.cx>
1753
1754 * HTML.pm.src (_reset_insertion_mode): Interpretation
1755 of Step 3 has been changed.
1756
1757 2007-06-25 Wakaba <wakaba@suika.fam.cx>
1758
1759 * HTML.pm.src: Late |<html>| parse error is implemented.
1760
1761 2007-06-24 Wakaba <wakaba@suika.fam.cx>
1762
1763 * URIChecker.pm (check_iri_reference): A |decode| method name was
1764 incorrect.
1765
1766 * ContentChecker.pm: Support for the |footer| element.
1767 Check URI syntax for space-separated URI attributes.
1768 Support for the |tabindex| attribute. Support
1769 for |datetime| attribute.
1770
1771 2007-06-24 Wakaba <wakaba@suika.fam.cx>
1772
1773 * HTML.pm.src: HTML5 revision 1.144 (&#x0D;) and 1.145 (invalid
1774 character references). HTML5 revision 1.146 (white space
1775 characters before root start tag). HTML5 revision
1776 1.148 (named character references in attribute values).
1777 HTML5 revision 1.152 (<plaintext>.innerHTML get).
1778
1779 2007-06-24 Wakaba <wakaba@suika.fam.cx>
1780
1781 * HTML.pm.src: HTML5 revisions 1.142 and 1.143 (<noscript>
1782 in <head>).
1783
1784 2007-06-24 Wakaba <wakaba@suika.fam.cx>
1785
1786 * HTML.pm.src: HTML5 revision 935 (<base>, <link>, <meta>
1787 in body).
1788
1789 * ContentChecker.pm: HTML5 revision 938 (scoped="").
1790
1791 2007-06-24 Wakaba <wakaba@suika.fam.cx>
1792
1793 * HTML.pm.src: HTML5 revision 923 (matching end tag in CDATA
1794 or RCDATA in fragment parsing mode). HTML5 revision
1795 924 (<!--> and <!--->). HTML5 revision 926 (hn in hn).
1796
1797 2007-06-23 Wakaba <wakaba@suika.fam.cx>
1798
1799 * HTML.pm.src (get_inner_html): HTML5 revision 922 (inner_html
1800 for <pre> and <textarea>).
1801
1802 2007-06-23 Wakaba <wakaba@suika.fam.cx>
1803
1804 * HTML.pm.src: HTML5 revision 920 (<isindex>).
1805
1806 2007-06-23 Wakaba <wakaba@suika.fam.cx>
1807
1808 * HTML.pm.src: HTML5 revision 918 (</head>, </body>, </html>).
1809
1810 2007-06-23 Wakaba <wakaba@suika.fam.cx>
1811
1812 * HTML.pm.src: HTML5 revision 916 (</body>).
1813 HTML5 revision 917 (conforming bare &).
1814
1815 2007-06-23 Wakaba <wakaba@suika.fam.cx>
1816
1817 * NanoDOM.pm (manakai_is_html): Setting to false did not work.
1818
1819 * HTML.pm.src: HTML5 revision 914 (</ in CDATA, RCDATA).
1820 HTML5 revision 915 (<nobr>).
1821
1822 2007-06-23 Wakaba <wakaba@suika.fam.cx>
1823
1824 * HTML.pm.src: HTML5 revisions 908, 909, 912, and 913 (quirks mode).
1825
1826 * NanoDOM.pm (manakai_is_html, manakai_compat_mode, compat_mode):
1827 New attributes.
1828
1829 2007-06-23 Wakaba <wakaba@suika.fam.cx>
1830
1831 * HTML.pm.src: HTML5 revisions 900, 901, 902, and 911 (<
1832 in tags).
1833
1834 2007-06-23 Wakaba <wakaba@suika.fam.cx>
1835
1836 * .cvsignore: |Entities.html| is added.
1837
1838 * HTML.pm.src: |$entity_char| is removed and
1839 requires |Whatpm::_NamedEntityList| instead.
1840 HTML5 revision 898 (refc), except that lack of refc
1841 is parse error.
1842
1843 * mkentitylist.pl: New script.
1844
1845 * Makefile (all): |_NamedEntityList.pm| is added.
1846 (_NamedEntityList.pm, Entities.html): New rules.
1847
1848 2007-06-23 Wakaba <wakaba@suika.fam.cx>
1849
1850 * HTML.pm.src: Parse errors immediately after U+000D
1851 were ignored and U+000D immediately following another
1852 U+000D was not converted to U+000A.
1853
1854 2007-06-23 Wakaba <wakaba@suika.fam.cx>
1855
1856 * HTML.pm.src (set_inner_html): HTML5 revision 892 (adopt
1857 nodes before appended). Parser was not ready for NULL
1858 parse error and escape flag.
1859
1860 * NanoDOM.pm (adopt_node): New.
1861
1862 2007-06-23 Wakaba <wakaba@suika.fam.cx>
1863
1864 * HTML.pm.src: HTML5 revision 886 (insane comment in
1865 CDATA and RCDATA). Note that current implementation
1866 is simply repeating what the spec says and it is maybe not
1867 a best way to do it.
1868
1869 2007-06-23 Wakaba <wakaba@suika.fam.cx>
1870
1871 * HTML.pm.src: HTML5 revision 884 (</form> don't close
1872 the form element if a descendant element without implied end tag has
1873 still been open).
1874
1875 2007-06-23 Wakaba <wakaba@suika.fam.cx>
1876
1877 * ContentChecker.pm: HTML5 revision 881 (Make |id| attribute with
1878 space characters non-conforming).
1879
1880 2007-06-23 Wakaba <wakaba@suika.fam.cx>
1881
1882 * HTML.pm.src: An error message was incorrect.
1883 HTML5 revision 869 (C1 character references).
1884
1885 2007-06-23 Wakaba <wakaba@suika.fam.cx>
1886
1887 * HTML.pm.src: HTML5 revision 867 (a LF at the beginning of
1888 a |textarea| is removed).
1889
1890 2007-06-05 Wakaba <wakaba@suika.fam.cx>
1891
1892 * NanoDOM.pm (get_attribute_node_ns): New method.
1893
1894 * ContentChecker.pm: |script| |async| and |defer|
1895 no longer require |src|. |async| MUST NOT be
1896 specified if |defer|. (HTML5 revision 858).
1897
1898 2007-05-30 Wakaba <wakaba@suika.fam.cx>
1899
1900 * HTML.pm.src: |<form><form>| went to inifinite loop.
1901
1902 2007-05-27 Wakaba <wakaba@suika.fam.cx>
1903
1904 * ContentChecker.pm (html): Set |is_root| (allowed
1905 as a document element) flag on.
1906 (new): Removed.
1907 (check_document): New method.
1908
1909 2007-05-27 Wakaba <wakaba@suika.fam.cx>
1910
1911 * ContentChecker.pm (thead, tfoot): Checker specifications
1912 were incorrect.
1913
1914 2007-05-27 Wakaba <wakaba@suika.fam.cx>
1915
1916 * ContentChecker.pm ($HTMLURIAttrChecker): Include
1917 error position in the |type| option of the error.
1918
1919 * HTMLTable.pm (form_table): The |$onerror| parameter
1920 is now optional. Some bugs are fixed.
1921
1922 2007-05-27 Wakaba <wakaba@suika.fam.cx>
1923
1924 * HTMLTable.pm: New module.
1925
1926 * ContentChecker.pm (table): Invoke table model error checker.
1927
1928 * NanoDOM.pm (first_child, get_attribute_ns): New.
1929
1930 2007-05-26 Wakaba <wakaba@suika.fam.cx>
1931
1932 * ContentChecker.pm ($HTMLLinkTypesAttrChecker): New checker.
1933 (link/@rel, a/@rel, area/@rel): Use new checker.
1934
1935 * Makefile (_LinkTypeList.pm, RelExtensions.html): New rules.
1936
1937 * _LinkTypeList.pm: New file.
1938
1939 * mklinktypelist.pl: New file.
1940
1941 * .cvsignore: |RelExtensions.html| added.
1942
1943 * NanoDOM.pm (child_nodes): Returns an empty array
1944 for non-child-containing node types.
1945 (text_content): New attribute.
1946
1947 2007-05-26 Wakaba <wakaba@suika.fam.cx>
1948
1949 * IMTChecker.pm: New module.
1950
1951 * ContentChecker.pm ($HTMLIMTAttrChecker): Call IMTChecker
1952 to test parameter value validity.
1953
1954 * HTML.pm.src ($style_start_tag): Attributes were
1955 discarded.
1956
1957 2007-05-25 Wakaba <wakaba@suika.fam.cx>
1958
1959 * ContentChecker.pm ($HTMLURIAttrChecker): Implemented.
1960
1961 2007-05-25 Wakaba <wakaba@suika.fam.cx>
1962
1963 * URIChecker.pm: All recommendations from RFC 3986
1964 and RFC 3987 are listed (not all testable items are checked yet).
1965
1966 2007-05-25 Wakaba <wakaba@suika.fam.cx>
1967
1968 * URIChecker.pm: New module.
1969
1970 2007-05-20 Wakaba <wakaba@suika.fam.cx>
1971
1972 * ContentChecker.pm: Now most attributes are implemented
1973 or associated to some placeholder.
1974 ($ElementDefault): Warn unknown attributes
1975 for unknown elements as "attribute not supported".
1976 ($HTMLLanguageTagAttrChecker, $HTMLMQAttrChecker): New placeholders.
1977 ($HTMLUsemapAttrChecker, $HTMLTargetAttrChecker): New checkers.
1978 (|a| attribute checker): Reimplemented.
1979
1980 2007-05-20 Wakaba <wakaba@suika.fam.cx>
1981
1982 * ContentChecker.pm ($HTMLEventHandlerAttrChecker): New placeholder.
1983 ($HTMLAttrChecker): Event handler content attributes
1984 are added.
1985 (link, embed): Required attribute is now checked.
1986 (embed): Unknown local attributes are no longer warned.
1987
1988 2007-05-20 Wakaba <wakaba@suika.fam.cx>
1989
1990 * ContentChecker.pm ($HTMLSpaceURIsAttrChecker): New placeholder.
1991 ($HTMLIMTAttrChecker): New checker.
1992 (link@rel, link@href, link@type, style@type,
1993 a@href, a@ping, a@ping, a@type, embed@src, embed@type,
1994 object@data, object@type, source@src, source@type, area@alt,
1995 area@shape, area@coords, area@href,
1996 area@ping, area@rel, area@type, script@src,
1997 script@defer, script@async, script@type): Checkers added.
1998
1999 2007-05-20 Wakaba <wakaba@suika.fam.cx>
2000
2001 * ContentChecker.pm: Descendant checking was incorrect.
2002
2003 2007-05-19 Wakaba <wakaba@suika.fam.cx>
2004
2005 * ContentChecker.pm: Support |xml:*| and |xmlns:*|
2006 attributes. Report an error if |Element.prefix|
2007 is |xmlns|.
2008
2009 * NanoDOM.pm (prefix): New attribute.
2010
2011 2007-05-19 Wakaba <wakaba@suika.fam.cx>
2012
2013 * HTML.pm.src: In |main| phase, |in body| insertion
2014 mode, action for |<iframe>| was missing.
2015
2016 2007-05-19 Wakaba <wakaba@suika.fam.cx>
2017
2018 * ContentChecker.pm: Support for many of HTML5 elements.
2019 ($GetHTMLNonNegativeIntegerAttrChecker): New.
2020
2021 2007-05-19 Wakaba <wakaba@suika.fam.cx>
2022
2023 * ContentChecker.pm: Support for most elements up to |progress|.
2024 ($HTMLURIAttrChecker): Placeholder.
2025 ($HTMLIntegerAttrChecker, $GetHTMLFloatingPointNumberAttrChecker):
2026 New.
2027
2028 2007-05-19 Wakaba <wakaba@suika.fam.cx>
2029
2030 * ContentChecker.pm: Attribute checkers
2031 for global attributes, |html|, |base|, |style|, and |meta|.
2032
2033 * NanoDOM.pm (insert_before): Weaken reference
2034 to the parent node.
2035 (Attr::new): Set |owner_element| attribute.
2036 (namespace_uri, manakai_local_name): New attribute implementations.
2037 (owner_element): New attribute.
2038
2039 2007-05-19 Wakaba <wakaba@suika.fam.cx>
2040
2041 * ContentChecker.pm ($AttrChecker, $HTMLAttrChecker,
2042 $AnyChecker->{attr_checker}, $HTMLAttrsChecker,
2043 $Element->{$HTML_NS}->{''}): New.
2044 (check_element): Invoke attrs_checker for each element.
2045
2046 2007-05-13 Wakaba <wakaba@suika.fam.cx>
2047
2048 * ContentChecker.pm: Don't use |manakai_element_type_match|.
2049
2050 2007-05-13 Wakaba <wakaba@suika.fam.cx>
2051
2052 * ContentChecker.pm: Use hashs rather than lists for
2053 element type testings.
2054
2055 2007-05-13 Wakaba <wakaba@suika.fam.cx>
2056
2057 * ContentChecker.pm: Don't generate duplicate
2058 error when an element type is put in the "minus" list
2059 and the element type is not allowed explicitly in the particular
2060 element content model.
2061 (html:a checker): New checker.
2062 (html:details, html:datagrid): New checkers.
2063 (html:legend): New checker.
2064
2065 2007-05-13 Wakaba <wakaba@suika.fam.cx>
2066
2067 * ContentChecker.pm (html:li checker): Implemented.
2068
2069 2007-05-13 Wakaba <wakaba@suika.fam.cx>
2070
2071 * ContentChecker.pm ($HTMLInlineOrStriclyInlineChecker): New
2072 checker.
2073 (html:dd checker): New checker.
2074 (html:q, html:em, html:strong, html:small,
2075 html:m, html:dfn, html:code, html:samp, html:span): New checkers.
2076
2077 2007-05-13 Wakaba <wakaba@suika.fam.cx>
2078
2079 * ContentChecker.pm ($AnyChecker): Renamed
2080 from |$ElementDefault->{checker}|.
2081 ($ElementDefault->{checker}): Throw an error that
2082 the element type is not supported by the checker.
2083 ($HTMLMetadataElement): |html:base| was missing.
2084 ($HTMLEmptyChecker): Don't throw an error
2085 for inter-element whitespace nodes.
2086 (html:html checker): Errors were not
2087 thrown even if |html:head| and/or |html:body|
2088 children were missing.
2089 (html:head checker): An error was not
2090 thrown if <meta charset> appered after other
2091 elements.
2092
2093 2007-05-05 Wakaba <wakaba@suika.fam.cx>
2094
2095 * ContentChecker.pm: |footer|, |video|, |audio|, |script|,
2096 and |noscript| elements are implemented.
2097 (new): New method.
2098
2099 2007-05-04 Wakaba <wakaba@suika.fam.cx>
2100
2101 * ContentChecker.pm: New module.
2102
2103 2007-05-04 Wakaba <wakaba@suika.fam.cx>
2104
2105 * NanoDOM.pm (manakai_parent_element,
2106 document_element, manakai_local_name,
2107 manakai_element_type_match): New method.
2108
2109 2007-05-03 Wakaba <wakaba@suika.fam.cx>
2110
2111 * HTML.pm.src: Replace decimal and hexadecimal numeric
2112 entities in C1 range using Windows-1252 mapping. Bare LF
2113 did not count as new line for error reporting.
2114
2115 2007-05-02 Wakaba <wakaba@suika.fam.cx>
2116
2117 * NanoDOM.pm (DOMImplementation): New class.
2118 (append_child): Weaken the |parent_node| reference.
2119 (create_element_ns, Element new): Set the |owner_document|
2120 reference.
2121 (implementation): New attribute.
2122 (owner_document, local_name, namespace_uri): New attributes.
2123
2124 * HTML.pm.src (parse_string): Line and column numbers
2125 are now provided to error handler.
2126 (!!!parse-error): Short descriptions are added.
2127 (_construct_tree): Split into three methods; support
2128 for innerHTML mode.
2129 (set_inner_html): New method.
2130
2131 2007-05-01 Wakaba <wakaba@suika.fam.cx>
2132
2133 * NanoDOM.html: Documentation is added.
2134
2135 * HTML.pod, ContentType.html: Documentation is revised.
2136
2137 * .cvsignore: Pod2html temporary files are added.
2138
2139 * Makefile: Make |NanoDOM.html|.
2140
2141 2007-05-01 Wakaba <wakaba@suika.fam.cx>
2142
2143 * HTML.pm.src (parse_string): New method.
2144 (get_inner_html): Renamed from |inner_html|.
2145
2146 * Makefile: A rule for |HTML.html| is added.
2147
2148 * HTML.pod: New documentation.
2149
2150 2007-05-01 Wakaba <wakaba@suika.fam.cx>
2151
2152 * NanoDOM.pm (last_child, previous_sibling): New attributes.
2153 (clone_node): Attribute nodes were not completely copied.
2154
2155 * HTML.pm.src: Many bugs are fixed.
2156
2157 2007-04-30 Wakaba <wakaba@suika.fam.cx>
2158
2159 * HTML.pm.src: Some typos are fixed.
2160
2161 2007-04-30 Wakaba <wakaba@suika.fam.cx>
2162
2163 * mkhtmlparser.pl, Makefile: References to the |HTML-consume-entity.src|
2164 are removed.
2165
2166 * HTML.pm.src: Tokenizer's handling on named entities are rewritten.
2167
2168 * HTML-consume-entity.src: Removed.
2169
2170 2007-04-30 Wakaba <wakaba@suika.fam.cx>
2171
2172 * HTML.pm.src: Tokenizer's handling on hexadecimal
2173 numeric entities are rewritten.
2174
2175 2007-04-30 Wakaba <wakaba@suika.fam.cx>
2176
2177 * HTML.pm.src: Some tokenizer bugs are fixed.
2178
2179 2007-04-30 Wakaba <wakaba@suika.fam.cx>
2180
2181 * HTML.pm.src: The tree construction stage is implemented.
2182
2183 * mkhtmlparser.pl: New macros are added.
2184
2185 2007-04-28 Wakaba <wakaba@suika.fam.cx>
2186
2187 * ContentType.pm: A note on bug in the specification
2188 is removed since it's been now fixed.
2189
2190 * .cvsignore: New file.
2191
2192 2007-04-28 Wakaba <wakaba@suika.fam.cx>
2193
2194 * HTML.pm.src, HTML-consume-entity.src: New files.
2195
2196 * Makefile (HTML.pm): New rule.
2197
2198 * mkhtmlparser.pl: New script.
2199
2200 2007-04-25 Wakaba <wakaba@suika.fam.cx>
2201
2202 * Makefile: New file.
2203
2204 2007-04-24 Wakaba <wakaba@suika.fam.cx>
2205
2206 * ContentType.pm: An error in pod is fixed.
2207
2208 2007-04-24 Wakaba <wakaba@suika.fam.cx>
2209
2210 * ContentType.pm: Documentation is added.
2211
2212 2007-04-24 Wakaba <wakaba@suika.fam.cx>
2213
2214 * ContentType.pm: New Perl module.
2215
2216 * ChangeLog: New file.
2217
2218

admin@suikawiki.org
ViewVC Help
Powered by ViewVC 1.1.24