/[suikacvs]/markup/html/whatpm/Whatpm/ChangeLog
Suika

Contents of /markup/html/whatpm/Whatpm/ChangeLog

Parent Directory Parent Directory | Revision Log Revision Log


Revision 1.345 - (show annotations) (download)
Sat Oct 4 14:31:27 2008 UTC (16 years, 1 month ago) by wakaba
Branch: MAIN
Changes since 1.344: +6 -0 lines
++ whatpm/t/ChangeLog	4 Oct 2008 14:31:16 -0000
	* tree-test-1.dat: Test results updated and new tests added on the
	boundaries of Text nodes (HTML5 revision 2124).

2008-10-04  Wakaba  <wakaba@suika.fam.cx>

++ whatpm/Whatpm/ChangeLog	4 Oct 2008 14:30:21 -0000
	* HTML.pm.src: If another node is inserted by the parser, don't
	reuse existing Text node to append a character (HTML5 revision
	2124).

2008-10-04  Wakaba  <wakaba@suika.fam.cx>

1 2008-10-04 Wakaba <wakaba@suika.fam.cx>
2
3 * HTML.pm.src: If another node is inserted by the parser, don't
4 reuse existing Text node to append a character (HTML5 revision
5 2124).
6
7 2008-10-04 Wakaba <wakaba@suika.fam.cx>
8
9 * HTML.pm.src: Support for <option> and <optgroup> in body (HTML5
10 revisions 1731 and 2128).
11
12 2008-10-04 Wakaba <wakaba@suika.fam.cx>
13
14 * HTML.pm.src: Make <foreignObject> scoping (HTML5 revision 1837).
15 Support for end tags of camelCase SVG elements were broken. A
16 wrong error type text fixed.
17
18 2008-10-04 Wakaba <wakaba@suika.fam.cx>
19
20 * HTML.pm.src: Drop redundant code (HTML5 revision 1731).
21
22 2008-10-04 Wakaba <wakaba@suika.fam.cx>
23
24 * HTML.pm.src: Support for new definition of |param| and |source|
25 start tag parsing (HTML5 revision 1731).
26
27 2008-10-04 Wakaba <wakaba@suika.fam.cx>
28
29 * HTML.pm.src: <p> steps reimplemented (HTML5 revision 1731).
30
31 2008-10-04 Wakaba <wakaba@suika.fam.cx>
32
33 * HTML.pm.src: <li>, <dt>, and <dd> steps reimplemented (HTML5
34 revisions 1731 and 1831).
35
36 2008-10-04 Wakaba <wakaba@suika.fam.cx>
37
38 * HTML.pm.src: Support for new flow (but not phrasing) elements (HTML5
39 revisions 1731 and 1778). Support for the </sarcasm> end tag (HTML5
40 revision 1731).
41
42 2008-10-04 Wakaba <wakaba@suika.fam.cx>
43
44 * HTML.pm.src: Support for |command| and |eventsource| elements (HTML5
45 revision 1731). End tags of |option| and |optgroup| elements are
46 now optional (HTML5 revision 1731).
47
48 2008-10-04 Wakaba <wakaba@suika.fam.cx>
49
50 * HTML.pm.src: New "special" elements added to the list (HTML5
51 revision 1778). "strile" -> "strike".
52
53 2008-10-02 Wakaba <wakaba@suika.fam.cx>
54
55 * ContentType.pm (get_sniffed_type): Support for the "better"
56 content sniffing (HTML5 revision 1927). In a case the official
57 type was not returned when the method is invoked in the list
58 context.
59
60 2008-09-22 Wakaba <wakaba@suika.fam.cx>
61
62 * HTML.pm.src: Character references for non-space C0 characters,
63 including U+000B VT, DEL character, noncharacter code points, are
64 now converted to the U+FFFD character (cf. HTML5 revision 2138).
65
66 2008-09-21 Wakaba <wakaba@suika.fam.cx>
67
68 * ContentChecker.pm: |form=""| check support added.
69
70 2008-09-21 Wakaba <wakaba@suika.fam.cx>
71
72 * ContentChecker.pm: |contextmenu| validness is now checked using
73 |id| and |id_type| properties, and |menu| property is removed.
74
75 2008-09-21 Wakaba <wakaba@suika.fam.cx>
76
77 * ContentChecker.pm: Prepare for |form| |name| attribute's
78 duplication checking.
79
80 2008-09-21 Wakaba <wakaba@suika.fam.cx>
81
82 * HTML.pm.src (parse_byte_stream): Support (or non-support) for
83 unsupported charset="" parameter value (HTML5 revision 2131).
84
85 2008-09-20 Wakaba <wakaba@suika.fam.cx>
86
87 * HTML.pm.src: Reminding places where U+000B is allowed as a space
88 character is fixed (cf. HTML5 revision 1738).
89
90 * ContentChecker.pm, HTMLTable.pm: U+000B is no longer part of
91 space characters (HTML5 revision 1738).
92
93 2008-09-20 Wakaba <wakaba@suika.fam.cx>
94
95 * HTML.pm.src: The "anything else" case for the "after after body"
96 insertion mode was not updated to swtich to the "in body"
97 insertion mode. U+000B is no longer a space character for the
98 purpose of tree construction phase (HTML5 revision 1738).
99
100 2008-09-20 Wakaba <wakaba@suika.fam.cx>
101
102 * HTML.pm.src: U+000B is no longer a space character (HTML5
103 revision 1738).
104
105 2008-09-20 Wakaba <wakaba@suika.fam.cx>
106
107 * ContentType.pm: 0x0B is no longer a space character (HTML5
108 revision 1738).
109
110 * HTML.pm.src: U+000B is no longer a space character for the
111 algorithm for extracting an encoding from a Content-Type (HTML5
112 revision 1738).
113
114 2008-09-20 Wakaba <wakaba@suika.fam.cx>
115
116 * ContentChecker.pm ($IsInHTMLInteractiveContent): New.
117
118 2008-09-18 Wakaba <wakaba@suika.fam.cx>
119
120 * LangTag.pm: Add checks for remaining requirements from RFC 4646.
121
122 * mklangreg.pl: Sort 'Prefix' values by their length, to ease
123 matching.
124
125 2008-09-18 Wakaba <wakaba@suika.fam.cx>
126
127 * LangTag.pm: Warn for private use language subtags. Error level
128 typos fixed. Support for Suppress-Script field.
129
130 * mklangreg.pl: Support for dumping of nested structure.
131
132 2008-09-18 Wakaba <wakaba@suika.fam.cx>
133
134 * LangTag.pm (check_rfc4646_langtag): Check if a tag is in the
135 recommended case as per RFC 4646.
136
137 2008-09-18 Wakaba <wakaba@suika.fam.cx>
138
139 * LangTag.pm (check_rfc4646_langtag): New method.
140
141 2008-09-18 Wakaba <wakaba@suika.fam.cx>
142
143 * mklangreg.pl: New script.
144
145 * Makefile: Updated for creation of the module for language subtag
146 registry.
147
148 2008-09-16 Wakaba <wakaba@suika.fam.cx>
149
150 * Makefile: WebIDL.html added.
151
152 * WebIDL.pod: New documentation.
153
154 2008-09-16 Wakaba <wakaba@suika.fam.cx>
155
156 * WebIDL.pm: Checker's error types are redefined.
157
158 2008-09-16 Wakaba <wakaba@suika.fam.cx>
159
160 * WebIDL.pm: Parser's error types are redefined. Some forward
161 compatible parsing bugs are fixed. Some unreachable codes are
162 commented out.
163
164 2008-09-16 Wakaba <wakaba@suika.fam.cx>
165
166 * WebIDL.pm: Support for the reminding extended attributes are
167 added. It does not satisfy the definition that a forward
168 interface declaration has an extended attribute. It seems that
169 unless explicitly allowed multiple extended attributes with the
170 same name is not allowed, though it is not explicitly mentioned in
171 the spec.
172
173 2008-09-16 Wakaba <wakaba@suika.fam.cx>
174
175 * WebIDL.pm: Unescapes extended attribute names and extended
176 attribute identifiers. Preserve whether an extended attribute has
177 an argument list of not. Support for extended attributes:
178 Constructor, ExceptionConsts, IndexGetter, IndexSetter,
179 NameGetter, NameSetter, and Null.
180 (has_argument_list): New attribute.
181 (idl_text): Stringifies argument lists, if any, even if it is
182 empty.
183
184 2008-09-15 Wakaba <wakaba@suika.fam.cx>
185
186 * HTML.pm.src: New state |PCDATA_STATE|. Use an empty string for
187 |{s_kwd}| in DATA_STATE as default.
188
189 2008-09-15 Wakaba <wakaba@suika.fam.cx>
190
191 * HTML.pm.src, mkhtmlparser.pl: Replace |{prev_char}|
192 by |{s_kwd}| in DATA_STATE.
193
194 2008-09-15 Wakaba <wakaba@suika.fam.cx>
195
196 * HTML.pm.src: Shorten keys.
197
198 2008-09-15 Wakaba <wakaba@suika.fam.cx>
199
200 * HTML.pm.src: Remove checking for control character, surrogate
201 pair, or noncharacter code points and non-Unicode code
202 points (they should be handled by Whatpm::Charset::UnicodeChecker).
203 (parse_char_stream): Support for the |$get_wrapper| argument and
204 character stream error handlers.
205
206 2008-09-15 Wakaba <wakaba@suika.fam.cx>
207
208 * ContentChecker.pm: Don't call |loda_ns_module|
209 for null-namespace elements/attributes.
210
211 * HTML.pm.src: Fact out $disallowed_control_chars
212 as a hash.
213
214 2008-09-14 Wakaba <wakaba@suika.fam.cx>
215
216 * HTML.pm.src: Regexp typo fixed. |{prev_char}|
217 and |{next_char}| initializations are moved to initialization
218 method. |{read_until}| now supports buffering. Sync |set_inner_html|
219 with |parse_char_stream|.
220
221 2008-09-14 Wakaba <wakaba@suika.fam.cx>
222
223 * HTML.pm.src (parse_char_stream): Make |set_next_char|
224 invoke |manakai_read_until|, not only |read|, where
225 possible, to decrease the number of |read| method calls.
226
227 * mkhtmlparser.pl: Related changes to the aforementioned
228 modification.
229
230 2008-09-14 Wakaba <wakaba@suika.fam.cx>
231
232 * HTML.pm.src: Use |read| instead of |getc|. |set_inner_html|
233 would report character error from now.
234
235 2008-09-14 Wakaba <wakaba@suika.fam.cx>
236
237 * HTML.pm.src: White-space-leaded non-white-space character
238 tokens in "before head insertion mode" was not
239 correctly handled.
240 (set_inner_html): Reimplemented using CharString decodehandle
241 class. Support for $get_wrapper argument. Support
242 for |{read_until}| feature.
243
244 2008-09-14 Wakaba <wakaba@suika.fam.cx>
245
246 * HTML.pm.src: Make a "bare ero" error for unknown
247 entities point the "&" character.
248
249 2008-09-14 Wakaba <wakaba@suika.fam.cx>
250
251 * HTML.pm.src: It turns out that U+FFFD don't have to
252 be added to the list of excluded characters.
253
254 2008-09-14 Wakaba <wakaba@suika.fam.cx>
255
256 * HTML.pm.src ($char_onerror): Have character decoder's |line|
257 and |column| a higher priority than the one set by the
258 tokenizer's input handler.
259 ($self->{read_until}): Exclude U+FFFD (but this might
260 not be necessary, since now we do line/column fixup in
261 the character decode handle).
262
263 2008-09-14 Wakaba <wakaba@suika.fam.cx>
264
265 * HTML.pm.src: Use |{read_until}| where possible.
266
267 2008-09-14 Wakaba <wakaba@suika.fam.cx>
268
269 * HTML.pm.src: Change |{getc_until}| to |{read_until}|
270 and |manakai_getc_until| to |manakai_read_until| to
271 reduce the number of string copies.
272
273 2008-09-14 Wakaba <wakaba@suika.fam.cx>
274
275 * HTML.pm.src (parse_char_string): Use newly created
276 |Whatpm::Charset::DecodeHandle::CharString| instead of Perl's
277 standard feature to |open| a string as a filehandle,
278 since Perl's string filehandle seems not supporting |ungetc|
279 method correctly.
280 (parse_char_stream): Define |{getc_until}| method.
281 (DATA_STATE): Experimental support for |getc_until| feature.
282
283 2008-09-13 Wakaba <wakaba@suika.fam.cx>
284
285 * HTML.pm.src: Check points added to newly added branches.
286
287 2008-09-13 Wakaba <wakaba@suika.fam.cx>
288
289 * HTML.pm.src: Remove |{char}|, which is no longer used.
290 Remove |{entity_in_attr}| and |{last_attribute_value_state}|
291 and replaced by |{prev_state}|.
292
293 * mkhtmlparser.pl: Remove |{char}| feature.
294 Remove |!!!back-next-input-character;| macro.
295
296 2008-09-13 Wakaba <wakaba@suika.fam.cx>
297
298 * HTML.pm.src: Finally we get rid of all the inner loops. Remove
299 entity related tokenizer states in favor of new states
300 implementing the consume character reference algorithm.
301
302 2008-09-13 Wakaba <wakaba@suika.fam.cx>
303
304 * HTML.pm.src: "Consume a character reference" algorithm is
305 now implemented as a tokenizer's state, rather than
306 a method, with minimum changes (more changes will
307 be made, in due course). "Bogus comment state"'s inner
308 loop gets removed.
309
310 2008-09-13 Wakaba <wakaba@suika.fam.cx>
311
312 * HTML.pm.src: Make |PUBLIC| and |SYSTEM| keyword tokenizing
313 into their own tokenizer states.
314
315 2008-09-13 Wakaba <wakaba@suika.fam.cx>
316
317 * HTML.pm.src: |CDATA_SECTION_STATE| (formally |CDATA_BLOCK_STATE|
318 is split into three states.
319
320 2008-09-13 Wakaba <wakaba@suika.fam.cx>
321
322 * HTML.pm.src: |CLOSE_TAG_OPEN_STATE| is broken into
323 itself and new |CDATA_PCDATA_CLOSE_TAG_STATE| so that
324 no longer does the tokenizer have to push back next input
325 characters in those states.
326
327 2008-09-13 Wakaba <wakaba@suika.fam.cx>
328
329 * HTML.pm.src: |MARKUP_DECLARATION_OPEN_STATE| broken
330 into four states so that no longer does the tokenizer have to push
331 back next input characters in that state.
332
333 2008-09-11 Wakaba <wakaba@suika.fam.cx>
334
335 * HTML.pm.src: Methods now accept additional parameter, $get_wrapper,
336 which can be used to insert some wrapper between the character
337 stream handle and the tokenizer. (It is currently not supported
338 for |set_inner_html| for |Element|s).
339
340 2008-09-10 Wakaba <wakaba@suika.fam.cx>
341
342 * HTML.pm.src: Ignore punctuations in charset names.
343
344 2008-09-10 Wakaba <wakaba@suika.fam.cx>
345
346 * ContentChecker.pm: Support for charset-layer error levels.
347
348 * HTML.pm.src: Don't specify |text| argument for the
349 |chardecode:fallback| error, since it is not the encoding
350 being used alternatively.
351
352 2008-09-06 Wakaba <wakaba@suika.fam.cx>
353
354 * HTML.pm.src: Support for |XSLT-compat| (HTML5 revision 2141).
355
356 2008-08-31 Wakaba <wakaba@suika.fam.cx>
357
358 * CacheManifest.pm: Support for extensibility (HTML5 revision 2051).
359
360 2008-08-31 Wakaba <wakaba@suika.fam.cx>
361
362 * HTML.pm.src: Bug fix and sync with the spec with regard
363 to after after frameset insertion mode processing (HTML5
364 revision 1909). Note that the implementation was wrong
365 per the old spec before the r1909 changes.
366
367 2008-08-30 Wakaba <wakaba@suika.fam.cx>
368
369 * HTMLTable.pm: scope=auto algorithm fix synced with the
370 spec (HTML5 revision 2093).
371 ($process_row): Algorithm step numbers synced with the
372 spec (HTML5 revision 2092).
373
374 2008-08-30 Wakaba <wakaba@suika.fam.cx>
375
376 * HTMLTable.pm: Zs is not what we want; we want White_Space! (HTML5
377 revision 2094).
378
379 2008-08-30 Wakaba <wakaba@suika.fam.cx>
380
381 * ContentType.pm: Support for image/svg+xml (HTML5 revision 2096).
382
383 2008-08-30 Wakaba <wakaba@suika.fam.cx>
384
385 * HTML.pm.src: '"' and "'" at the end of attribute
386 name (after another attribute) now raise parse error (HTML5
387 revision 2123). Empty unquoted attribute values are no
388 longer allowed (HTML5 revision 2122).
389
390 2008-08-30 Wakaba <wakaba@suika.fam.cx>
391
392 * mkhtmlparser.pl: Support for MathML |definitionURL| attribute (HTML5
393 revision 2130).
394
395 2008-08-30 Wakaba <wakaba@suika.fam.cx>
396
397 * ContentChecker.pm: |xml:lang| attribute value must be same
398 as |lang| attribute value for HTML elements (HTML5 revision 2062
399 and so on).
400
401 2008-08-30 Wakaba <wakaba@suika.fam.cx>
402
403 * ContentChecker.pm: Error level definition for |xml_id_error|
404 was missing.
405
406 * URIChecker.pm: The end of the URL should be marked as the
407 error location for an empty path error. The position
408 between the userinfo and the port components should be
409 marked as the error location for an empty host error.
410
411 2008-08-30 Wakaba <wakaba@suika.fam.cx>
412
413 * URIChecker.pm: Set parameters representing where in the
414 value the error occurs for errors. Report unknown
415 address format error in warning level, since address
416 formats are rarely added. Path segments starting with "/.."
417 were misinterpreted as a dot-segment.
418
419 2008-08-30 Wakaba <wakaba@suika.fam.cx>
420
421 * URIChecker.pm (check_iri_reference): Requires
422 |Message::DOM::DOMImplementation|.
423
424 2008-08-29 Wakaba <wakaba@suika.fam.cx>
425
426 * IMTChecker.pm: Updated for the new error reporting architecture.
427
428 * ContentChecker.pm: Error levels for IMTs are added.
429
430 2008-08-17 Wakaba <wakaba@suika.fam.cx>
431
432 * H2H.pm (_shift_token): Support for unquoted HTML attribute
433 values.
434
435 2008-08-16 Wakaba <wakaba@suika.fam.cx>
436
437 * CacheManifest.pm: Support for new style of error
438 reports.
439
440 * HTML.pm.src: Set line=1, column=1 to the document node.
441
442 2008-08-16 Wakaba <wakaba@suika.fam.cx>
443
444 * ContentChecker.pm, RDFXML.pm: Pass {level} object to language tag
445 and URL checkers. Support for more error levels for bogus
446 langauge tag and URL "standards".
447
448 * LangTag.pm, URIChecker.pm: Support for new style error
449 level reporting.
450
451 2008-08-15 Wakaba <wakaba@suika.fam.cx>
452
453 * ContentChecker.pm: Support for RDF/XML error levels.
454
455 * HTMLTable.pm, RDFXML.pm: Support for new style of error level
456 specifying. Error types are revised.
457
458 2008-08-15 Wakaba <wakaba@suika.fam.cx>
459
460 * ContentChecker.pm: All error reporting method calls are
461 renewed.
462
463 2008-08-15 Wakaba <wakaba@suika.fam.cx>
464
465 * HTML.pm.src: All error type names and "text" parameters
466 are revised. Use new style for "level" specification.
467
468 * mkhtmlparser.pl: Use new style for "level" specification.
469
470 2008-08-03 Wakaba <wakaba@suika.fam.cx>
471
472 * WebIDL.pm (parse_char_string): Simplified error
473 reporting process for broken ignored valuetype definition.
474 (Valuetype idl_text): Support for special "DOMString" name.
475
476 2008-08-03 Wakaba <wakaba@suika.fam.cx>
477
478 * WebIDL.pm ($get_scoped_name): Append "::::" if the last
479 terminal of the ScopedName is "DOMString", such that whether
480 the last part of the scoped name is "DOMString" or "_DOMString"
481 later. It is necessary to determine whether a |typedef|
482 definition should be ignored or not.
483 (parse_char_string): Unescape the identifier of
484 exception members.
485 ($resolve): Return undef for builtin types and sequence<T>
486 types (we might not have to do this, however...).
487 (check): Support checking for Exceptions, Valuetypes,
488 and Typedefs.
489 ($serialize_type): Support for "DOMString::::" syntax.
490 (Typedef idl_text): Output Type as "DOMString" if it
491 is really "DOMString" (i.e. its internal representation
492 is "::DOMString::").
493
494 2008-08-03 Wakaba <wakaba@suika.fam.cx>
495
496 * WebIDL.pm ($resolve): New code, based on resolve code
497 for constant types in the |check| method.
498 (check): Support for checking of attributes, operations, and
499 arguments.
500 (Attribute/Operation idl_text): Exception names in getraises,
501 setraises, and raises clauses is serizlied by |$serialize_type|
502 code.
503
504 2008-08-02 Wakaba <wakaba@suika.fam.cx>
505
506 * WebIDL.pm ($integer): Order of selections are changed to match
507 hexadecimal numbers (the original pattern, taken from the spec,
508 was not work for hexadecimal numbers, because the "0" prefix
509 matches to the [0-7]* part (as an empty string) and therefore
510 it does not match with remaining "x..." part of a "0x..." integer
511 literal.
512 ($get_type): It now returns a string, not an array reference,
513 for regular types and |sequence| types (i.e. it in any case
514 returns a string).
515 ($get_next_token): The second item in the array that represents
516 a integer or float token is now a Perl number value, not the
517 original string representation of the number.
518 (check): Support for const value consistency checking.
519 No extended attribute is defined for constants.
520 (Node subclasses): Use simple strings rather than array references
521 for default data type values.
522 ($serialize_type): Type values are now simple strings.
523 (value): If the new attribute value is a false value, then
524 a FALSE value is set to the attribute.
525
526 2008-08-02 Wakaba <wakaba@suika.fam.cx>
527
528 * WebIDL.pm ($get_scoped_name): Now scoped names are stored
529 in its stringified format ("scoped name" as defined in the
530 spec). Note that future version of this module should not use
531 array references for type values and the |type_text| attribute
532 should be made obsolete.
533 (parse_char_string): Unescape attribute names.
534 (check): Support for checking of whether inherited interfaces
535 are actually defined or not. Support for checking of whether
536 interface member identifiers are duplicated or not.
537 ($serialize_type): Scoped names are returned as is. A future
538 version of this code should escape identifiers other than "DOMString",
539 otherwise the idl_text would be non-conforming.
540
541 2008-08-02 Wakaba <wakaba@suika.fam.cx>
542
543 * WebIDL.pm (parse_char_string): Set line/column numbers
544 to generated nodes. Unescape identifiers. Extended attributes
545 for Definition's were ignored.
546 (append_child): Set |parent_node| attribute.
547 (parent_node): New attribute.
548 (check): Support interface/exception members. Support
549 extended attributes. Support definition identifier uniqueness
550 constraint.
551 (qualified_name): New attribute.
552 (Interface/Exception idl_text): Extended attributes were
553 not prepended to the returned text.
554
555 2008-08-02 Wakaba <wakaba@suika.fam.cx>
556
557 * WebIDL.pm (parse_char_string): Set line/column numbers
558 to interface object experimentally. s/shift/pop/g, shift
559 would make things wrong. Support for interface forward
560 declarations was missing. Broken interface declarations
561 with no block were not ignored entirely.
562 (Whatpm::WebIDL::Node): New abstract class. This class
563 makes things easier.
564 (child_nodes): New attribute. Unlike DOM's attribute with
565 same name, this attribute returns a dead list of nodes for
566 simplicity.
567 (get_user_data, set_user_data): New methods.
568 (Module idl_text): A SPACE character should be inserted
569 before the |{| character.
570 (Interface idl_text): Support for interface forward declarations.
571 (is_forward_declaration): New attribute.
572
573 2008-07-19 Wakaba <wakaba@suika.fam.cx>
574
575 * WebIDL.pm (type_text): Better serializer.
576
577 2008-07-19 Wakaba <wakaba@suika.fam.cx>
578
579 * WebIDL.pm: Revise forward-compatible parsing so that
580 it now can handle broken extended attributes and as such.
581
582 2008-07-19 Wakaba <wakaba@suika.fam.cx>
583
584 * WebIDL.pm: Real support for extended attributes.
585 Support for extended attributes with arguments.
586
587 2008-07-19 Wakaba <wakaba@suika.fam.cx>
588
589 * WebIDL.pm: Support for |exception| syntax.
590 (Interface->idl_text): Tentative support for inheritances.
591
592 2008-07-19 Wakaba <wakaba@suika.fam.cx>
593
594 * WebIDL.pm: Hierarchical scoped name support was broken.
595 Support for raises, setraises, and getraises syntaxes.
596
597 2008-07-18 Wakaba <wakaba@suika.fam.cx>
598
599 * WebIDL.pm: Support for |idl_text| attribute, version 1 (no
600 proper support for types, extended attributes, and exceptions yet).
601 WebIDL parser, version 1 (no support for exceptions yet,
602 no proper support for extended attributes yet).
603
604 2008-07-09 Wakaba <wakaba@suika.fam.cx>
605
606 * WebIDL.pm (parse_char_string): Support for basic attribute syntax.
607
608 2008-06-29 Wakaba <wakaba@suika.fam.cx>
609
610 * WebIDL.pm: Support for valuetype and const.
611
612 2008-06-29 Wakaba <wakaba@suika.fam.cx>
613
614 * WebIDL.pm: New module.
615
616 2008-06-15 Wakaba <wakaba@suika.fam.cx>
617
618 * Makefile (Entities.html): URI changed.
619
620 2008-06-08 Wakaba <wakaba@suika.fam.cx>
621
622 * HTML.pm.src: Support for ruby parsing (HTML5 revision 1704).
623
624 2008-06-01 Wakaba <wakaba@suika.fam.cx>
625
626 * HTML.pm.src (_get_next_token): A parse error was missing.
627
628 2008-06-01 Wakaba <wakaba@suika.fam.cx>
629
630 * mklinktypelist.pl: rel=contact is no longer part of the HTML5
631 spec (commented out). (HTML5 revision 1711).
632
633 2008-05-25 Wakaba <wakaba@suika.fam.cx>
634
635 * ContentType.pm: Drop support for UTF-32 (HTML5 revision 1701).
636
637 * HTML.pm.src: UTF-16BE and UTF-16LE should be considered
638 as UTF-16 (HTML5 revision 1701).
639
640 2008-05-25 Wakaba <wakaba@suika.fam.cx>
641
642 * HTML.pm.src: Support for <noframes> in <head> (HTML5 revision
643 1692).
644
645 2008-05-25 Wakaba <wakaba@suika.fam.cx>
646
647 * HTML.pm.src: The secondary insertion mode used when switching
648 to foreign content is the "in body" insertion mode (HTML5 revision
649 1696).
650
651 2008-05-25 Wakaba <wakaba@suika.fam.cx>
652
653 * HTML.pm.src: Don't raise parse error for <isindex/> (HTML5
654 revision 1697).
655
656 2008-05-24 Wakaba <wakaba@suika.fam.cx>
657
658 * HTML.pm.src: Support for end-of-file token in foreign content
659 insertion mode (HTML5 revision 1693). Update SVG camelCase
660 attribute list (HTML5 revision 1700). <textarea> closes
661 </select> (HTML5 revision 1699). More start tags close in
662 foreign content insertion mode (HTML5 revision 1698).
663
664 2008-05-24 Wakaba <wakaba@suika.fam.cx>
665
666 * HTML.pm.src: ";" is not part of charset name (HTML5 revision 1665).
667
668 2008-05-24 Wakaba <wakaba@suika.fam.cx>
669
670 * HTML.pm.src: More robust charset parameter detection (HTML5
671 revision 1674).
672
673 2008-05-24 Wakaba <wakaba@suika.fam.cx>
674
675 * ContentType.pm: Support for image/vnd.microsoft.icon (HTML5
676 revision 1676).
677
678 2008-05-24 Wakaba <wakaba@suika.fam.cx>
679
680 * HTML.pm.src: Ignore language part of public identifiers for
681 quriks mode detection (HTML5 revision 1679).
682
683 2008-05-24 Wakaba <wakaba@suika.fam.cx>
684
685 * HTML.pm.src: Reduce the number of errors in truncated doctypes (HTML5
686 revision 1685).
687
688 2008-05-24 Wakaba <wakaba@suika.fam.cx>
689
690 * HTML.pm.src: Support for EOF in new states for tags (HTML5
691 revision 1684).
692
693 2008-05-24 Wakaba <wakaba@suika.fam.cx>
694
695 * HTML.pm.src (_reset_insertion_mode): Make <td>.innerHTML
696 work (HTML5 revision 1690).
697
698 2008-05-24 Wakaba <wakaba@suika.fam.cx>
699
700 * HTML.pm.src (_tree_construction_main): Change handling of
701 end tags in head insertion modes (HTML5 revision 1686).
702 (parse_char_string): Bug fix for non-utf8 character string handlings.
703 (parse_char_stream): |ungetc| does not work well for this context.
704
705 2008-05-18 Wakaba <wakaba@suika.fam.cx>
706
707 * HTML.pm.src (parse_byte_string): Redefined to invoke
708 |parse_byte_stream|.
709 (parse_byte_stream): New method.
710
711 2008-05-18 Wakaba <wakaba@suika.fam.cx>
712
713 * HTML.pm.src (parse_byte_string): Fix the column number reported
714 by encoding layer error reporter.
715
716 2008-05-17 Wakaba <wakaba@suika.fam.cx>
717
718 * HTML.pm.src (parse_byte_string): Use streaming decoder
719 rather than converting the whole byte string and then parsing.
720 Propagate errors in character encoding layer.
721 (get_next_token): Precise error reporting for |bare stago| error.
722
723 2008-05-17 Wakaba <wakaba@suika.fam.cx>
724
725 * HTML.pm.src (parse_char_stream): New method.
726 (parse_char_string): This method is now defined as an invocation
727 of the |parse_char_stream| method.
728
729 2008-05-17 Wakaba <wakaba@suika.fam.cx>
730
731 * HTML.pm.src (parse_byte_string): Report various status
732 of the sniffing as info-level errors. Support for new
733 decoding framework in parser resestting.
734 (new): Various default error levels were not set.
735
736 2008-05-17 Wakaba <wakaba@suika.fam.cx>
737
738 * HTML.pm.src (parse_byte_string): HTML5 encoding siniffing
739 algorithm, except for the actual sniffing, is implemented
740 with new framework with Message::Charset::Info.
741
742 2008-05-16 Wakaba <wakaba@suika.fam.cx>
743
744 * CacheManifest.pm (_parse): Drop fragment identifiers from
745 URIs in fallback section (HTML5 revision 1596).
746
747 2008-05-10 Wakaba <wakaba@suika.fam.cx>
748
749 * Makefile (Entities.html): URI has changed.
750
751 2008-05-10 Wakaba <wakaba@suika.fam.cx>
752
753 * CacheManifest.pm: Don't replace U+0000 NULL (HTML5 revision
754 1553).
755
756 2008-05-06 Wakaba <wakaba@suika.fam.cx>
757
758 * ContentChecker.pm: Noted that those returned in |table| are
759 no longer table elements, but table objects returned
760 by Whatpm::HTMLTable.
761
762 * HTMLTable.pm (form_table): Return table element node
763 as |$table->{element}|.
764 (assign_header): Support for the |headers=""| attribute.
765
766 2008-05-06 Wakaba <wakaba@suika.fam.cx>
767
768 * HTMLTable.pm (assign_header): New function; first version
769 with no support for headers="".
770 (form_table): Include table width and height to the returned
771 table object for covenience. Indexing in column assignement was wrong.
772 Set whether a data cell is empty or not for convenience.
773
774 2008-05-05 Wakaba <wakaba@suika.fam.cx>
775
776 * HTMLTable.pm: Robuster caption support (HTML5 revision 1393).
777
778 2008-05-05 Wakaba <wakaba@suika.fam.cx>
779
780 * HTMLTable.pm: How table model errors are detected is
781 changed (HTML5 revision 1387).
782
783 2008-05-05 Wakaba <wakaba@suika.fam.cx>
784
785 * HTMLTable.pm: The algorithm now moves |tfoot| elements
786 to the end of the table (HTML5 revision 1380).
787
788 2008-05-05 Wakaba <wakaba@suika.fam.cx>
789
790 * HTMLTable.pm: The algorithm is now 0-based indexing, instead
791 of 1-based (HTML5 revision 1376).
792
793 2008-05-05 Wakaba <wakaba@suika.fam.cx>
794
795 * ContentType.pm: "Content-Type: text/plain; charset=UTF-8"
796 and "Content-Encoding" no longer prevent sniffing (HTML5
797 revision 1288).
798
799 2008-05-05 Wakaba <wakaba@suika.fam.cx>
800
801 * ContentType.pm: Skip BOMs n feed or HTML algorithm (HTML5 revision
802 1282).
803
804 2008-05-03 Wakaba <wakaba@suika.fam.cx>
805
806 * ContentChecker.pm: Support for global attributes.
807 Status of XML specs are added.
808
809 2008-05-03 Wakaba <wakaba@suika.fam.cx>
810
811 * ContentChecker.pm (check_element): Support for |template=""|
812 and |ref=""| attribute (referent element type checking).
813
814 2008-04-29 Wakaba <wakaba@suika.fam.cx>
815
816 * CacheManifest.pm (_parse): New same origin definition (HTML5 revision
817 1500) is implemented (except for IDNA part and URI-scheme-specific
818 knowledge). Line number counting was wrong for LF-only
819 documents.
820
821 2008-04-13 Wakaba <wakaba@suika.fam.cx>
822
823 * HTML.pm.src: Raise an parse error for any disallowed
824 character (HTML5 revision 1263).
825
826 2008-04-13 Wakaba <wakaba@suika.fam.cx>
827
828 * mkentitylist.pl: Support for new HTML5 entity table format (the
829 definition for |AElig;| was missing).
830
831 2008-04-13 Wakaba <wakaba@suika.fam.cx>
832
833 * HTML.pm.src, mkhtmlparser.pl: Support for element/attribute
834 name/namespace fixup (HTML5 revisions 1413, 1415, 1416, and 1417).
835
836 2008-04-13 Wakaba <wakaba@suika.fam.cx>
837
838 * HTML.pm.src: List of element names that close foreign content
839 insertion mode is added (HTML5 revisions 1412 and 1418).
840
841 2008-04-13 Wakaba <wakaba@suika.fam.cx>
842
843 * HTML.pm.src: Support for |mglyph| and |malignmark|
844 elements (HTML5 revision 1410).
845
846 2008-04-13 Wakaba <wakaba@suika.fam.cx>
847
848 * HTML.pm.src: Support for new long MathML entities (HTML5
849 revision 1406).
850
851 2008-04-13 Wakaba <wakaba@suika.fam.cx>
852
853 * HTML.pm.src: CDATA section support for MathML and SVG
854 elements (HTML5 revision 1404 and 1420).
855
856 2008-04-12 Wakaba <wakaba@suika.fam.cx>
857
858 * HTML.pm.src, mkhtmlparser.pl: Support for MathML and SVG
859 elements (HTML5 revision 1404). Unused !!!macro definitions
860 are removed.
861
862 2008-04-12 Wakaba <wakaba@suika.fam.cx>
863
864 * HTML.pm.src, mkhtmlparser.pl: The way permitted slash errors
865 are raised is changed (HTML5 revision 1404).
866
867 2008-04-06 Wakaba <wakaba@suika.fam.cx>
868
869 * HTML.pm.src: Category sets that are no longer used are removed.
870
871 2008-04-06 Wakaba <wakaba@suika.fam.cx>
872
873 * HTML.pm.src: The ->[1] property of stack entries are now
874 replaced by constants representing element category.
875
876 2008-04-06 Wakaba <wakaba@suika.fam.cx>
877
878 * HTML.pm.src: Don't use local name stored in stack (i.e. ->[1])
879 for error reporting. (This is a preparation for using constant
880 value for ->[1].)
881
882 2008-03-22 Wakaba <wakaba@suika.fam.cx>
883
884 * RDFXML.pm: Typo fixed.
885
886 2008-03-22 Wakaba <wakaba@suika.fam.cx>
887
888 * ContentChecker.pm: |fact_level| is now treated
889 as same as |must_level|, i.e. level = |m|.
890 (check_element): Make list of URIs in the DOM.
891
892 2008-03-21 Wakaba <wakaba@suika.fam.cx>
893
894 * RDFXML.pm: Language accessor implemented.
895 Local (null-namespace) attribute support.
896
897 2008-03-21 Wakaba <wakaba@suika.fam.cx>
898
899 * RDFXML.pm: Factored out ID checking code.
900
901 2008-03-21 Wakaba <wakaba@suika.fam.cx>
902
903 * RDFXML.pm: TODO items noted. Validation of ID
904 and URI attributes is implemented. Warn if unknown
905 value is used in rdf:parseType="" attribute.
906
907 * URIChecker.pm (check_rdf_uri_reference): New function.
908
909 2008-03-21 Wakaba <wakaba@suika.fam.cx>
910
911 * RDFXML.pm: bnodeid implemented. Relative references
912 are now resolved.
913
914 2008-03-21 Wakaba <wakaba@suika.fam.cx>
915
916 * ContentChecker.pm: RDF reification implemented.
917
918 * RDFXML.pm: undef vs false bug fixed.
919 Reification implemented.
920
921 2008-03-21 Wakaba <wakaba@suika.fam.cx>
922
923 * RDFXML.pm: s/id/ID/ for attribute name.
924 The |node| arguments are added for |ontriple| calls.
925 Too many "attribute not allowed" errors were raised.
926
927 * ContentChecker.pm: Initial experimental support for rdf:RDF
928 element.
929
930 2008-03-21 Wakaba <wakaba@suika.fam.cx>
931
932 * RDFXML.pm: New module.
933
934 2008-03-20 Wakaba <wakaba@suika.fam.cx>
935
936 * HTML.pm.src (set_inner_html): Line/column number
937 code was old one yet.
938
939 2008-03-20 Wakaba <wakaba@suika.fam.cx>
940
941 * HTML.pm.src: Better line/column reporting for "duplicate attribute"
942 errors. Line/column markings for DOCTYPE, comment, and
943 character tokens are reintroduced; otherwise,
944 error location for "not HTML5" error and errors
945 for implied elements are not attached.
946
947 2008-03-20 Wakaba <wakaba@suika.fam.cx>
948
949 * HTML.pm.src: Set line/column numbers to attributes.
950
951 * NanoDOM.pm (create_attribute_ns, set_attribute_node_ns): Added.
952 (value): Setter implemented.
953
954 * mkhtmlparser.pl: Set line/column numbers to Attr nodes.
955
956 2008-03-20 Wakaba <wakaba@suika.fam.cx>
957
958 * HTML.pm.src: Unused line/column markings are removed.
959
960 2008-03-20 Wakaba <wakaba@suika.fam.cx>
961
962 * HTML.pm.src (_get_next_token): Remove |first_start_tag|
963 flag, which is no longer used.
964
965 2008-03-17 Wakaba <wakaba@suika.fam.cx>
966
967 * HTML.pm.src: Set line/column information to element nodes.
968
969 * mkhtmlparser.pl (!!!create-element, !!!insert-element,
970 and !!!insert-element-t): Set line/column information to
971 element nodes.
972
973 2008-03-17 Wakaba <wakaba@suika.fam.cx>
974
975 * HTML.pm.src (_get_next_token): The first "<" character
976 in "<?", "<>", or "</>" should be the error point.
977
978 2008-03-16 Wakaba <wakaba@suika.fam.cx>
979
980 * HTML.pm.src: Some more fixes on error position reporting.
981
982 2008-03-16 Wakaba <wakaba@suika.fam.cx>
983
984 * HTML.pm.src: Token-level precious error reporting.
985
986 2008-03-16 Wakaba <wakaba@suika.fam.cx>
987
988 * HTML.pm.src: Preparation for more precious error point
989 reporting.
990
991 2008-03-11 Wakaba <wakaba@suika.fam.cx>
992
993 * HTML.pm.src: Error type revised.
994
995 2008-03-11 Wakaba <wakaba@suika.fam.cx>
996
997 * HTML.pm.src: Similar codes are merged together, again.
998
999 2008-03-11 Wakaba <wakaba@suika.fam.cx>
1000
1001 * HTML.pm.src: Similar codes are merged together.
1002
1003 2008-03-10 Wakaba <wakaba@suika.fam.cx>
1004
1005 * mkhtmlparser.pl: Set "level" parameter to parse errors.
1006
1007 * HTML.pm.src: Code refined.
1008
1009 2008-03-09 Wakaba <wakaba@suika.fam.cx>
1010
1011 * HTML.pm.src: |</body>| treatement has been changed (HTML5
1012 revision 1348). Note that I really don't know this makes
1013 any difference in the black-box behavior of the parser.
1014
1015 2008-03-09 Wakaba <wakaba@suika.fam.cx>
1016
1017 * HTML.pm.src: New end-of-file token implementation (HTML5
1018 revision 1348).
1019
1020 2008-03-09 Wakaba <wakaba@suika.fam.cx>
1021
1022 * HTML.pm.src: |applet| support (HTML5 revision 1347).
1023
1024 2008-03-09 Wakaba <wakaba@suika.fam.cx>
1025
1026 * HTML.pm.src: Foster parenting in AAA (HTML5 revision 1343).
1027
1028 2008-03-09 Wakaba <wakaba@suika.fam.cx>
1029
1030 * HTML.pm.src: Support for |<input>| in the "in select" insertion
1031 mode and support for the "in select in table" insertion mode (HTML5
1032 revision 1342).
1033
1034 2008-03-09 Wakaba <wakaba@suika.fam.cx>
1035
1036 * HTML.pm.src: No foster parenting for <script> and <script>
1037 in non-tainted <table>s (HTML5 revision 1336).
1038
1039 2008-03-09 Wakaba <wakaba@suika.fam.cx>
1040
1041 * HTML.pm.src: Ignore white space characters between <html>
1042 and <head> (HTML5 revision 1332).
1043
1044 2008-03-09 Wakaba <wakaba@suika.fam.cx>
1045
1046 * HTML.pm.src: Treat <input type=hidden> as if it were a
1047 white space (HTML5 revision 1331).
1048
1049 2008-03-08 Wakaba <wakaba@suika.fam.cx>
1050
1051 * HTML.pm.src: Ignore U+000A at the beginning of a |listing|
1052 element (HTML5 revision 1330).
1053
1054 2008-03-08 Wakaba <wakaba@suika.fam.cx>
1055
1056 * HTML.pm.src: <title> is always appended to the current
1057 element (HTML5 revision 1328).
1058
1059 2008-03-08 Wakaba <wakaba@suika.fam.cx>
1060
1061 * HTML.pm.src: White space in tainted tables are moved
1062 into foster parents (HTML5 revision 1326).
1063
1064 2008-03-08 Wakaba <wakaba@suika.fam.cx>
1065
1066 * HTML.pm.src: Reduce errors from foster parenting cases (HTML5
1067 revision 1321).
1068
1069 2008-03-08 Wakaba <wakaba@suika.fam.cx>
1070
1071 * HTML.pm.src: |</h/n/>| case code rearranged to align with
1072 the spec (HTML5 revision 1320). Note that we finally complete
1073 all of HTML5 revision 1320 changes.
1074
1075 2008-03-08 Wakaba <wakaba@suika.fam.cx>
1076
1077 * HTML.pm.src: |</form>| now works similar to |</div>| for unclosed
1078 tags (HTML5 revision 1320).
1079
1080 2008-03-08 Wakaba <wakaba@suika.fam.cx>
1081
1082 * HTML.pm.src: |</p>| case rearranged with no actual change in fact.
1083
1084 2008-03-08 Wakaba <wakaba@suika.fam.cx>
1085
1086 * HTML.pm.src: A "generate implied end tags" code (t409.1)
1087 could not be reached so that it is now removed (HTML5 revision
1088 1320).
1089
1090 2008-03-08 Wakaba <wakaba@suika.fam.cx>
1091
1092 * HTML.pm.src: Code for the case of |</div>| and so on are revised
1093 to align with new spec text (HTML5 revision 1320).
1094
1095 2008-03-08 Wakaba <wakaba@suika.fam.cx>
1096
1097 * HTML.pm.src: Remove strange |if| condition; however, it should
1098 have had no harm in theory.
1099
1100 2008-03-08 Wakaba <wakaba@suika.fam.cx>
1101
1102 * HTML.pm.src (_tree_construction_main): '</p>' in body
1103 case is split from other end tags for the preparation
1104 of implementing HTML revision 1320.
1105
1106 2008-03-07 Wakaba <wakaba@suika.fam.cx>
1107
1108 * HTML.pm.src: Simplified "generate implied end tag" (HTML5
1109 revision 1320).
1110
1111 2008-03-07 Wakaba <wakaba@suika.fam.cx>
1112
1113 * HTML.pm.src (_tree_construction_main): Merge rules for "h1"
1114 and "div" (HTML5 revision 1318). Add comments to where
1115 |form| pointer association codes should be inserted (HTML5 revision
1116 1319).
1117
1118 2008-03-06 Wakaba <wakaba@suika.fam.cx>
1119
1120 * HTML.pm.src: <html> treatement refined (HTML5 revision 1314).
1121
1122 2008-03-05 Wakaba <wakaba@suika.fam.cx>
1123
1124 * HTML.pm.src: Since the case t268 should never be reached (no
1125 other token type, there are), it is replaced by a |die| statement.
1126
1127 2008-03-05 Wakaba <wakaba@suika.fam.cx>
1128
1129 * HTML.pm.src: Typo fixed.
1130
1131 2008-03-04 Wakaba <wakaba@suika.fam.cx>
1132
1133 * HTML.pm.src (_tree_construction_initial): Some limited quirks
1134 doctypes were not uppercased for comparison.
1135
1136 2008-03-03 Wakaba <wakaba@suika.fam.cx>
1137
1138 * HTML.pm.src (tree construction and set_inner_html): Checkpoints
1139 are added.
1140
1141 2008-03-03 Wakaba <wakaba@suika.fam.cx>
1142
1143 * HTML.pm.src (_tokenize_attempt_to_consume_an_entity): Checkpoints
1144 are set. Cases that are unlikely reached are noted as so.
1145
1146 2008-03-03 Wakaba <wakaba@suika.fam.cx>
1147
1148 * HTML.pm.src: Checkpoints for debugging are added.
1149
1150 * mkhtmlparser.pl: Support for |!!!cp| syntax.
1151
1152 2008-03-03 Wakaba <wakaba@suika.fam.cx>
1153
1154 * HTML.pm.src, mkhtmlparser.pl: s/_input_character/_char/g
1155 for simplicity.
1156
1157 2008-03-03 Wakaba <wakaba@suika.fam.cx>
1158
1159 * HTML.pm.src: Flag name changed: s/correct/force-quirks/g (HTML5
1160 revision 1307).
1161
1162 2008-03-03 Wakaba <wakaba@suika.fam.cx>
1163
1164 * HTML.pm.src (_get_next_token): Where the /incorrect/ flag
1165 is set are changed (HTML5 revision 1305).
1166
1167 2008-03-02 Wakaba <wakaba@suika.fam.cx>
1168
1169 * HTML.pm.src: Raise a parse error for |<span ===>| (HTML5 revision
1170 1292). Entities are not parsed in comment-like part in RCDATA
1171 elements (HTML5 revision 1294). Allow bare & at the end
1172 of attribute value literals (HTML5 revision 1296). More
1173 quirks mode doctypes (HTML5 revision 1302). Requires spaces
1174 between attributes and ban attribute names or unquoted
1175 attribute values containing single or double quotes (HTML5
1176 revision 1303).
1177
1178 2008-03-02 Wakaba <wakaba@suika.fam.cx>
1179
1180 * ContentChecker.pm: Typo fixed. Don't raise "character encoding"
1181 and related errors unless it is an HTML document (though the spec
1182 is unclear on whether it is applied to XHTML document).
1183
1184 * HTML.pm (%HTMLAttrStatus): WF2 repetition model attributes
1185 are added.
1186
1187 2008-03-02 Wakaba <wakaba@suika.fam.cx>
1188
1189 * HTML.pm.src: s/local_name/manakai_local_name/g.
1190
1191 2008-03-01 Wakaba <wakaba@suika.fam.cx>
1192
1193 * _NamedEntityList.pm: Updated (HTML5 revision 1286).
1194
1195 * HTML.pm.src: |charset| in |content| attribute is
1196 case-insensitive (HTML5 revision 1270).
1197
1198 2008-02-26 Wakaba <wakaba@suika.fam.cx>
1199
1200 * ContentChecker.pm: New status constants are added.
1201 ($ElementDefault): |status| added.
1202 (check_element): Err for non-standard or deprecated elements.
1203 (_attr_status_info): For non-standard or deprecated attributes.
1204
1205 2008-02-24 Wakaba <wakaba@suika.fam.cx>
1206
1207 * ContentChecker.pm (_attr_status_info): New internal method.
1208
1209 2008-02-24 Wakaba <wakaba@suika.fam.cx>
1210
1211 * ContentChecker.pm (check_element): Element standardized
1212 status information is now dispatched.
1213
1214 2008-02-24 Wakaba <wakaba@suika.fam.cx>
1215
1216 * ContentChecker.pm (check_element): Fix |del|-and-significant
1217 problem by adding some more arguments.
1218
1219 2008-02-24 Wakaba <wakaba@suika.fam.cx>
1220
1221 * ContentChecker.pm (check_element): Use context of
1222 container-for-the-purpose-of-content-model element (not transparent
1223 element) for |check_child_element| calling and significant
1224 text flag marking. This reintroduces |<del>|-and-significant
1225 problem again.
1226
1227 2008-02-24 Wakaba <wakaba@suika.fam.cx>
1228
1229 * ContentChecker.pm (check_element): Make semi-transparent
1230 elements ignored for the purpose of phase changes in
1231 content model checking.
1232
1233 2008-02-23 Wakaba <wakaba@suika.fam.cx>
1234
1235 * ContentChecker.pm (check_element): In-element state
1236 was not properly managed for transparent cases.
1237
1238 2008-02-23 Wakaba <wakaba@suika.fam.cx>
1239
1240 * ContentChecker.pm (check_element): Support for |video|
1241 and |audio| as semi-transparent elements.
1242
1243 2008-02-23 Wakaba <wakaba@suika.fam.cx>
1244
1245 * ContentChecker.pm ($HTMLSemiTransparentElements): New.
1246 (check_element): s/minuses/minus_elements/, s/pluses/plus_elements/.
1247 Support for |html:object| as a semi-transparent element.
1248
1249 2008-02-23 Wakaba <wakaba@suika.fam.cx>
1250
1251 * ContentChecker.pm (check_element): The way to traverse
1252 the tree is entirely revised to make it easier to track
1253 the state of ancestors/descendants. As a result of this
1254 revision (which rewrites almost all of Whatpm::ContentChecker::HTML),
1255 support for content model checking for HTML elements |figure|,
1256 |object|, |video|, and |audio| and checking for XML elements (and
1257 some XMLNS checkings) are dropped for now. They will be
1258 reimplemented in due cource.
1259
1260 2008-02-17 Wakaba <wakaba@suika.fam.cx>
1261
1262 * HTML.pm.src: |>| in public or system literal closes the DOCTYPE
1263 token (HTML5 revision 1225).
1264
1265 2008-02-17 Wakaba <wakaba@suika.fam.cx>
1266
1267 * ContentChecker.pm ({unsupported_level}): New value.
1268
1269 * HTML.pm.src: Save whether |meta| |content| attribute
1270 contains character references or not.
1271
1272 2008-02-17 Wakaba <wakaba@suika.fam.cx>
1273
1274 * ContentChecker.pm (_get_children): (Incompleted) attempt to
1275 imlement significant content checking for contents
1276 with |del| elements.
1277
1278 2008-02-17 Wakaba <wakaba@suika.fam.cx>
1279
1280 * ContenteChecker.pm ($HTMLTransparentElements): More
1281 elements are added.
1282 (_get_children): HTML |object| elements are now semi-transparent.
1283
1284 * NanoDOM.pm (manakai_html, manakai_head): New methods.
1285
1286 2008-02-16 Wakaba <wakaba@suika.fam.cx>
1287
1288 * CacheManifest.pm: HTML5 revision 1211 implemented.
1289
1290 * CacheManifest.pod: Updated.
1291
1292 2008-02-10 Wakaba <wakaba@suika.fam.cx>
1293
1294 * ContentChecker.pm (check_document, check_element): Support
1295 for second argument ($onsubdoc).
1296 (_get_css_parser): Removed (now it is part of WDCC).
1297
1298 2008-02-09 Wakaba <wakaba@suika.fam.cx>
1299
1300 * ContentChecker.pm (_get_css_parser): New.
1301
1302 2007-11-25 Wakaba <wakaba@suika.fam.cx>
1303
1304 * ContentChecker.pm ($AnyChecker): Old way to add child elements
1305 for checking had been used.
1306
1307 2007-11-25 Wakaba <wakaba@suika.fam.cx>
1308
1309 * ContentChecker.pm (check_element): New todo item type |descendant|.
1310
1311 2007-11-23 Wakaba <wakaba@suika.fam.cx>
1312
1313 * IMTChecker.pm: Revised to raise errors and warnings as (poorly)
1314 specced in RFC 2046 and RFC 4288.
1315 (application/atom+xml): Definition added.
1316
1317 2007-11-23 Wakaba <wakaba@suika.fam.cx>
1318
1319 * URIChecker.pm: Make RFC 3986 should-level errors
1320 warnings (rather than SHOULD-level errors).
1321
1322 2007-11-23 Wakaba <wakaba@suika.fam.cx>
1323
1324 * NanoDOM.pm (get_user_data, set_user_data): New methods.
1325
1326 * HTML.pm.src: A flag for character references in attribute
1327 values are added. Set |manakai_has_reference| user data
1328 to |charset| attribute.
1329
1330 2007-11-23 Wakaba <wakaba@suika.fam.cx>
1331
1332 * NanoDOM.pm (input_encoding, manakai_charset, manakai_has_bom): New
1333 attributes.
1334
1335 * ContentChecker.pm (check_document): Warn if charset requirements
1336 cannot be tested.
1337
1338 2007-11-19 Wakaba <wakaba@suika.fam.cx>
1339
1340 * HTML.pm.src (parse_byte_string): Detect charset
1341 by universalchardet if charset parameter is not specified.
1342
1343 * Makefile (Charset-all, Charset-clean): New rules.
1344
1345 2007-11-18 Wakaba <wakaba@suika.fam.cx>
1346
1347 * ContentChecker.pm (check_document): Check the existence
1348 of character encoding declaration and selection of encoding
1349 for HTML document.
1350
1351 2007-11-18 Wakaba <wakaba@suika.fam.cx>
1352
1353 * ContentType.pm (get_sniffed_type): Return also the official
1354 type in list context.
1355
1356 2007-11-18 Wakaba <wakaba@suika.fam.cx>
1357
1358 * ContentType.pm: Sniffing with leading white space
1359 ignoring (HTML5 revisions 1013 and 1016).
1360
1361 2007-11-18 Wakaba <wakaba@suika.fam.cx>
1362
1363 * ContentType.pm: HTML5 revision 1013 changes, except for leading
1364 white spaces, are implemented.
1365
1366 2007-11-11 Wakaba <wakaba@suika.fam.cx>
1367
1368 * HTML.pm.src (parse_char_string): Set |inner_encoding|
1369 attribute if possible.
1370
1371 2007-11-11 Wakaba <wakaba@suika.fam.cx>
1372
1373 * HTML.pm.src (parse_byte_string): New method.
1374 (parse_char_string): New alias for |parse_string|.
1375 (main phase): Invoking "change the encoding" algorithm if desired.
1376
1377 * HTML.pod: Updated.
1378
1379 2007-11-11 Wakaba <wakaba@suika.fam.cx>
1380
1381 * HTML.pod (get_inner_html): Removed.
1382
1383 * Makefile (HTML-all, HTML-clean): New.
1384
1385 2007-11-11 Wakaba <wakaba@suika.fam.cx>
1386
1387 * HTML.pm.src (get_inner_html): Removed (moved to HTML/Serializer.pm).
1388
1389 2007-11-08 Wakaba <wakaba@suika.fam.cx>
1390
1391 * mklinktypelist.pl: s/noreferer/noreferrer/ (HTML5 revision 1132).
1392
1393 2007-11-04 Wakaba <wakaba@suika.fam.cx>
1394
1395 * Makefile: |CacheManifest.html| is added.
1396
1397 * CacheManifest.pod: New file.
1398
1399 2007-11-04 Wakaba <wakaba@suika.fam.cx>
1400
1401 * CacheManifest.pm: New module.
1402
1403 2007-11-04 Wakaba <wakaba@suika.fam.cx>
1404
1405 * HTML.pm.src: Support for application cache selection algorithm
1406 callback.
1407
1408 2007-11-04 Wakaba <wakaba@suika.fam.cx>
1409
1410 * mklinktypelist.pl: Support for rel=noreferer (HTML5 revision 1118).
1411
1412 2007-10-17 Wakaba <wakaba@suika.fam.cx>
1413
1414 * Makefile (clean): New rule.
1415
1416 * NanoDOM.pm (public_id, system_id): New attributes.
1417
1418 2007-10-17 Wakaba <wakaba@suika.fam.cx>
1419
1420 * Makefile (CSS-all, CSS-clean, clean): New rules.
1421
1422 2007-10-14 Wakaba <wakaba@suika.fam.cx>
1423
1424 * ContentChecker.pm (check_document): Support for
1425 new |is_xml_root| flag.
1426 (check_element): Support for new |pluses| state.
1427 (_add_pluses): New method.
1428 (_remove_minuses): Support for new |minus| item.
1429
1430 2007-09-24 Wakaba <wakaba@suika.fam.cx>
1431
1432 * ContentChecker.pm: Raise specific error for invalid
1433 root element.
1434
1435 2007-09-24 Wakaba <wakaba@suika.fam.cx>
1436
1437 * ContentChecker.pm: Set level values for later uses.
1438
1439 2007-09-09 Wakaba <wakaba@suika.fam.cx>
1440
1441 * ContentChecker.pm: Support for language tag validation.
1442
1443 2007-09-09 Wakaba <wakaba@suika.fam.cx>
1444
1445 * LangTag.pm (check_rfc3066_language_tag): New method.
1446
1447 2007-09-09 Wakaba <wakaba@suika.fam.cx>
1448
1449 * LangTag.pm: New module.
1450
1451 2007-09-04 Wakaba <wakaba@suika.fam.cx>
1452
1453 * HTML.pm.src: Some error types were wrong.
1454
1455 2007-08-17 Wakaba <wakaba@suika.fam.cx>
1456
1457 * CSS/: New directory.
1458
1459 2007-08-17 Wakaba <wakaba@suika.fam.cx>
1460
1461 * ContentChecker.pm (_check_get_children): Support
1462 for |noscript| in |head|.
1463
1464 2007-08-12 Wakaba <wakaba@suika.fam.cx>
1465
1466 * URI/: New directory.
1467
1468 2007-08-11 Wakaba <wakaba@suika.fam.cx>
1469
1470 * HTML.pm.src: Tokenizer's states are now represented in
1471 number.
1472
1473 2007-08-11 Wakaba <wakaba@suika.fam.cx>
1474
1475 * HTML.pm.src: |or|s for insertion modes are replaced
1476 by |&|s.
1477
1478 2007-08-11 Wakaba <wakaba@suika.fam.cx>
1479
1480 * HTML.pm.src: Token types are now represented in number.
1481
1482 2007-08-11 Wakaba <wakaba@suika.fam.cx>
1483
1484 * ContentType.pm (SEE ALSO): Updated.
1485
1486 * HTML.pm.src: Insertion modes are now represented in number.
1487
1488 2007-08-11 Wakaba <wakaba@suika.fam.cx>
1489
1490 * ContentType.pm: Sniffing for bitmap images (HTML5 revision
1491 999) is implemented.
1492
1493 2007-08-08 Wakaba <wakaba@suika.fam.cx>
1494
1495 * ContentType.pm: Sniffing for |<script| (HTML5 revision
1496 983) is implemented.
1497
1498 2007-08-06 Wakaba <wakaba@suika.fam.cx>
1499
1500 * ContentChecker.pod: New documentation.
1501
1502 * Makefile: A rule for |ContentChecker.html| is added.
1503
1504 * ContentChecker.pm: A pod "LICENSE" section is added.
1505
1506 * NanoDOM.pm ($VERSION): New variable.
1507
1508 2007-08-05 Wakaba <wakaba@suika.fam.cx>
1509
1510 * H2H.pm: |b|, |i|, and |sub| are added to the
1511 list of allowed HTML elements.
1512
1513 2007-08-05 Wakaba <wakaba@suika.fam.cx>
1514
1515 * H2H.pm: |samp| is added to the list of allowed
1516 HTML elements.
1517
1518 * URIChecker.pm (check_iri): New.
1519 (check_iri_reference): Error type for IRI reference
1520 syntax error is changed.
1521
1522 2007-08-04 Wakaba <wakaba@suika.fam.cx>
1523
1524 * ContentChecker.pm: Reference to the |Whatpm::ContentChecker::Atom|
1525 is added.
1526 (check_document): Load appropriate module before validation.
1527
1528 2007-08-04 Wakaba <wakaba@suika.fam.cx>
1529
1530 * ContentChecker/: New directory.
1531
1532 2007-08-04 Wakaba <wakaba@suika.fam.cx>
1533
1534 * ContentChecker.pm: HTML |time| element is implemented.
1535
1536 * HTMLTable.pm: Comments are updated as HTML5 is revised.
1537
1538 2007-07-21 Wakaba <wakaba@suika.fam.cx>
1539
1540 * ContentChecker.pm (check_document): Return value
1541 even if no document element is found.
1542
1543 2007-07-21 Wakaba <wakaba@suika.fam.cx>
1544
1545 * HTML.pm.src: |$in_body| is no longer a function.
1546
1547 2007-07-21 Wakaba <wakaba@suika.fam.cx>
1548
1549 * HTML.pm.src: The |$in_body| code has been moved down.
1550
1551 2007-07-21 Wakaba <wakaba@suika.fam.cx>
1552
1553 * HTML.pm.src: The "trailing end" insertion mode
1554 is split into "after html body" and "after html frameset"
1555 insertion modes. Their codes are merged with "after body"
1556 and "after frameset" codes. |$previous_insertion_mode|
1557 has been removed. "after frameset" code is
1558 merged with "in frameset" code.
1559
1560 2007-07-21 Wakaba <wakaba@suika.fam.cx>
1561
1562 * HTML.pm.src: The "before head" insertion mode is
1563 merged with the "in head" insertion mode.
1564
1565 2007-07-21 Wakaba <wakaba@suika.fam.cx>
1566
1567 * HTML.pm.src: Cases in "in head" insertion mode are
1568 reorganized.
1569
1570 2007-07-21 Wakaba <wakaba@suika.fam.cx>
1571
1572 * HTML.pm.src: Some cases in "in table" insertion mode
1573 are merged.
1574
1575 2007-07-21 Wakaba <wakaba@suika.fam.cx>
1576
1577 * HTML.pm.src: The "in row" insertion mode is merged
1578 with "in table" insertion mode.
1579
1580 2007-07-21 Wakaba <wakaba@suika.fam.cx>
1581
1582 * HTML.pm.src: The "in table" and "in table body" insertion
1583 modes are merged.
1584
1585 2007-07-21 Wakaba <wakaba@suika.fam.cx>
1586
1587 * HTML.pm.src: There is no "in table head" or "in table foot"
1588 insertion mode!
1589
1590 2007-07-21 Wakaba <wakaba@suika.fam.cx>
1591
1592 * HTML.pm.src: |<noframes>| "in frameset" and "in noframes"
1593 now directly invoke the handler.
1594
1595 2007-07-21 Wakaba <wakaba@suika.fam.cx>
1596
1597 * HTML.pm.src: Codes for "in cell" insertion mode
1598 is merged to the "in body" insertion mode code.
1599
1600 2007-07-21 Wakaba <wakaba@suika.fam.cx>
1601
1602 * HTML.pm.src: Codes for "in body" and "in caption"
1603 insertion modes are merged.
1604
1605 2007-07-21 Wakaba <wakaba@suika.fam.cx>
1606
1607 * HTML.pm.src: Two |!!!next-token|s were missing.
1608
1609 2007-07-21 Wakaba <wakaba@suika.fam.cx>
1610
1611 * HTML.pm.src: Use numeric constant for |{content_mode}|
1612 instead of string constant for |{content_model_flag}|.
1613
1614 2007-07-21 Wakaba <wakaba@suika.fam.cx>
1615
1616 * HTML.pm.src: Add the name of the attribute
1617 to the "duplicate attribute" error.
1618
1619 2007-07-17 Wakaba <wakaba@suika.fam.cx>
1620
1621 * ContentChecker.pm: Return the |class| node list.
1622
1623 2007-07-17 Wakaba <wakaba@suika.fam.cx>
1624
1625 * ContentChecker.pm: Return the |id| node list.
1626
1627 * HTML.pm.src: A typo is fixed.
1628
1629 2007-07-16 Wakaba <wakaba@suika.fam.cx>
1630
1631 * ContentChecker.pm: Drop wrong |level => 'error'| specification
1632 from "in HTML:xml:lang" error. Character position
1633 is now the last part of the error type in the URI error
1634 description. Report "unsupported" status for language
1635 tags, media queries, script codes, and style sheets.
1636
1637 2007-07-16 Wakaba <wakaba@suika.fam.cx>
1638
1639 * ContentChecker.pm: Report error if |xml:lang|
1640 in HTML, |lang| in XML, |xmlns| in XML, and |meta| |charset|
1641 in XML.
1642
1643 * NanoDOM.pm (Attr.owner_document): New attribute.
1644
1645 2007-07-16 Wakaba <wakaba@suika.fam.cx>
1646
1647 * HTML.pm.src: The character immediately following
1648 a bare |hcro| was discarded. Fix handling of
1649 entity references in attribute values.
1650
1651 2007-07-16 Wakaba <wakaba@suika.fam.cx>
1652
1653 * HTML.pm.src (main and trailin end phases): Token
1654 types |DOCTYPE|, |comment|, |end-of-file|, and |<html>| are
1655 factored out. Error types |in html:#DOCTYPE|
1656 and |after html:#DOCTYPE| are merged
1657 into |DOCTYPE in the middle|. |</frameset>|
1658 in fragment parsing mode changed the
1659 insertion mode.
1660
1661 2007-07-16 Wakaba <wakaba@suika.fam.cx>
1662
1663 * HTML.pm.src: |$phase| has been removed; The |trailing end|
1664 phase is now an insertion mode. Treatments for white
1665 space character tokens were incorrect for some
1666 insertion modes. An old |meta| case was not removed.
1667
1668 2007-07-16 Wakaba <wakaba@suika.fam.cx>
1669
1670 * HTML.pm.src: |meta| charset declaration extraction
1671 implemented (but changing the encoding is not yet:-).
1672
1673 2007-07-15 Wakaba <wakaba@suika.fam.cx>
1674
1675 * Charset/: New directory.
1676
1677 2007-07-15 Wakaba <wakaba@suika.fam.cx>
1678
1679 * H2H.pm: New Perl module (created from
1680 manakai's H2H.dis).
1681
1682 2007-07-15 Wakaba <wakaba@suika.fam.cx>
1683
1684 * XMLSerializer.pm: New Perl module (created from
1685 manakai's SimpleLS.dis).
1686
1687 2007-07-07 Wakaba <wakaba@suika.fam.cx>
1688
1689 * HTML.pm.src: |<!---x-->| was not processed correctly.
1690
1691 2007-07-01 Wakaba <wakaba@suika.fam.cx>
1692
1693 * HTML.pm.src: Report correct error message
1694 for |<body></div></body>|.
1695
1696 2007-07-01 Wakaba <wakaba@suika.fam.cx>
1697
1698 * HTMLTable.pm: An error description was incorrect.
1699
1700 2007-06-30 Wakaba <wakaba@suika.fam.cx>
1701
1702 * ContentChecker.pm: Return |{term}| list.
1703
1704 2007-06-30 Wakaba <wakaba@suika.fam.cx>
1705
1706 * HTML.pm.src: HTML5 revisions 961-966 (</p>, </br>,
1707 nested <nobr>, implied </tbody>, </tfoot>, and </thead>,
1708 and <title> outside of head).
1709
1710 2007-06-30 Wakaba <wakaba@suika.fam.cx>
1711
1712 * IMTChecker.pm: Report warning for unregistered
1713 and private types/subtypes.
1714
1715 * ContentChecker.pm, HTML.pm.src, IMTChecker.pm,
1716 URIChecker.pm, HTMLTable.pm: Error messages are now
1717 consistent; they are all listed in
1718 <http://suika.fam.cx/gate/2005/sw/Whatpm%20Error%20Types>.
1719
1720 2007-06-25 Wakaba <wakaba@suika.fam.cx>
1721
1722 * ContentChecker.pm: |<img ismap>| not in |<a></a>|
1723 is now erred. |<datalist>| is implemented.
1724 Attribute checker for |<command>| and |<menu>| are
1725 added. Support for |contextmenu| global attribute
1726 is added.
1727
1728 2007-06-25 Wakaba <wakaba@suika.fam.cx>
1729
1730 * HTML.pm.src (_reset_insertion_mode): Interpretation
1731 of Step 3 has been changed.
1732
1733 2007-06-25 Wakaba <wakaba@suika.fam.cx>
1734
1735 * HTML.pm.src: Late |<html>| parse error is implemented.
1736
1737 2007-06-24 Wakaba <wakaba@suika.fam.cx>
1738
1739 * URIChecker.pm (check_iri_reference): A |decode| method name was
1740 incorrect.
1741
1742 * ContentChecker.pm: Support for the |footer| element.
1743 Check URI syntax for space-separated URI attributes.
1744 Support for the |tabindex| attribute. Support
1745 for |datetime| attribute.
1746
1747 2007-06-24 Wakaba <wakaba@suika.fam.cx>
1748
1749 * HTML.pm.src: HTML5 revision 1.144 (&#x0D;) and 1.145 (invalid
1750 character references). HTML5 revision 1.146 (white space
1751 characters before root start tag). HTML5 revision
1752 1.148 (named character references in attribute values).
1753 HTML5 revision 1.152 (<plaintext>.innerHTML get).
1754
1755 2007-06-24 Wakaba <wakaba@suika.fam.cx>
1756
1757 * HTML.pm.src: HTML5 revisions 1.142 and 1.143 (<noscript>
1758 in <head>).
1759
1760 2007-06-24 Wakaba <wakaba@suika.fam.cx>
1761
1762 * HTML.pm.src: HTML5 revision 935 (<base>, <link>, <meta>
1763 in body).
1764
1765 * ContentChecker.pm: HTML5 revision 938 (scoped="").
1766
1767 2007-06-24 Wakaba <wakaba@suika.fam.cx>
1768
1769 * HTML.pm.src: HTML5 revision 923 (matching end tag in CDATA
1770 or RCDATA in fragment parsing mode). HTML5 revision
1771 924 (<!--> and <!--->). HTML5 revision 926 (hn in hn).
1772
1773 2007-06-23 Wakaba <wakaba@suika.fam.cx>
1774
1775 * HTML.pm.src (get_inner_html): HTML5 revision 922 (inner_html
1776 for <pre> and <textarea>).
1777
1778 2007-06-23 Wakaba <wakaba@suika.fam.cx>
1779
1780 * HTML.pm.src: HTML5 revision 920 (<isindex>).
1781
1782 2007-06-23 Wakaba <wakaba@suika.fam.cx>
1783
1784 * HTML.pm.src: HTML5 revision 918 (</head>, </body>, </html>).
1785
1786 2007-06-23 Wakaba <wakaba@suika.fam.cx>
1787
1788 * HTML.pm.src: HTML5 revision 916 (</body>).
1789 HTML5 revision 917 (conforming bare &).
1790
1791 2007-06-23 Wakaba <wakaba@suika.fam.cx>
1792
1793 * NanoDOM.pm (manakai_is_html): Setting to false did not work.
1794
1795 * HTML.pm.src: HTML5 revision 914 (</ in CDATA, RCDATA).
1796 HTML5 revision 915 (<nobr>).
1797
1798 2007-06-23 Wakaba <wakaba@suika.fam.cx>
1799
1800 * HTML.pm.src: HTML5 revisions 908, 909, 912, and 913 (quirks mode).
1801
1802 * NanoDOM.pm (manakai_is_html, manakai_compat_mode, compat_mode):
1803 New attributes.
1804
1805 2007-06-23 Wakaba <wakaba@suika.fam.cx>
1806
1807 * HTML.pm.src: HTML5 revisions 900, 901, 902, and 911 (<
1808 in tags).
1809
1810 2007-06-23 Wakaba <wakaba@suika.fam.cx>
1811
1812 * .cvsignore: |Entities.html| is added.
1813
1814 * HTML.pm.src: |$entity_char| is removed and
1815 requires |Whatpm::_NamedEntityList| instead.
1816 HTML5 revision 898 (refc), except that lack of refc
1817 is parse error.
1818
1819 * mkentitylist.pl: New script.
1820
1821 * Makefile (all): |_NamedEntityList.pm| is added.
1822 (_NamedEntityList.pm, Entities.html): New rules.
1823
1824 2007-06-23 Wakaba <wakaba@suika.fam.cx>
1825
1826 * HTML.pm.src: Parse errors immediately after U+000D
1827 were ignored and U+000D immediately following another
1828 U+000D was not converted to U+000A.
1829
1830 2007-06-23 Wakaba <wakaba@suika.fam.cx>
1831
1832 * HTML.pm.src (set_inner_html): HTML5 revision 892 (adopt
1833 nodes before appended). Parser was not ready for NULL
1834 parse error and escape flag.
1835
1836 * NanoDOM.pm (adopt_node): New.
1837
1838 2007-06-23 Wakaba <wakaba@suika.fam.cx>
1839
1840 * HTML.pm.src: HTML5 revision 886 (insane comment in
1841 CDATA and RCDATA). Note that current implementation
1842 is simply repeating what the spec says and it is maybe not
1843 a best way to do it.
1844
1845 2007-06-23 Wakaba <wakaba@suika.fam.cx>
1846
1847 * HTML.pm.src: HTML5 revision 884 (</form> don't close
1848 the form element if a descendant element without implied end tag has
1849 still been open).
1850
1851 2007-06-23 Wakaba <wakaba@suika.fam.cx>
1852
1853 * ContentChecker.pm: HTML5 revision 881 (Make |id| attribute with
1854 space characters non-conforming).
1855
1856 2007-06-23 Wakaba <wakaba@suika.fam.cx>
1857
1858 * HTML.pm.src: An error message was incorrect.
1859 HTML5 revision 869 (C1 character references).
1860
1861 2007-06-23 Wakaba <wakaba@suika.fam.cx>
1862
1863 * HTML.pm.src: HTML5 revision 867 (a LF at the beginning of
1864 a |textarea| is removed).
1865
1866 2007-06-05 Wakaba <wakaba@suika.fam.cx>
1867
1868 * NanoDOM.pm (get_attribute_node_ns): New method.
1869
1870 * ContentChecker.pm: |script| |async| and |defer|
1871 no longer require |src|. |async| MUST NOT be
1872 specified if |defer|. (HTML5 revision 858).
1873
1874 2007-05-30 Wakaba <wakaba@suika.fam.cx>
1875
1876 * HTML.pm.src: |<form><form>| went to inifinite loop.
1877
1878 2007-05-27 Wakaba <wakaba@suika.fam.cx>
1879
1880 * ContentChecker.pm (html): Set |is_root| (allowed
1881 as a document element) flag on.
1882 (new): Removed.
1883 (check_document): New method.
1884
1885 2007-05-27 Wakaba <wakaba@suika.fam.cx>
1886
1887 * ContentChecker.pm (thead, tfoot): Checker specifications
1888 were incorrect.
1889
1890 2007-05-27 Wakaba <wakaba@suika.fam.cx>
1891
1892 * ContentChecker.pm ($HTMLURIAttrChecker): Include
1893 error position in the |type| option of the error.
1894
1895 * HTMLTable.pm (form_table): The |$onerror| parameter
1896 is now optional. Some bugs are fixed.
1897
1898 2007-05-27 Wakaba <wakaba@suika.fam.cx>
1899
1900 * HTMLTable.pm: New module.
1901
1902 * ContentChecker.pm (table): Invoke table model error checker.
1903
1904 * NanoDOM.pm (first_child, get_attribute_ns): New.
1905
1906 2007-05-26 Wakaba <wakaba@suika.fam.cx>
1907
1908 * ContentChecker.pm ($HTMLLinkTypesAttrChecker): New checker.
1909 (link/@rel, a/@rel, area/@rel): Use new checker.
1910
1911 * Makefile (_LinkTypeList.pm, RelExtensions.html): New rules.
1912
1913 * _LinkTypeList.pm: New file.
1914
1915 * mklinktypelist.pl: New file.
1916
1917 * .cvsignore: |RelExtensions.html| added.
1918
1919 * NanoDOM.pm (child_nodes): Returns an empty array
1920 for non-child-containing node types.
1921 (text_content): New attribute.
1922
1923 2007-05-26 Wakaba <wakaba@suika.fam.cx>
1924
1925 * IMTChecker.pm: New module.
1926
1927 * ContentChecker.pm ($HTMLIMTAttrChecker): Call IMTChecker
1928 to test parameter value validity.
1929
1930 * HTML.pm.src ($style_start_tag): Attributes were
1931 discarded.
1932
1933 2007-05-25 Wakaba <wakaba@suika.fam.cx>
1934
1935 * ContentChecker.pm ($HTMLURIAttrChecker): Implemented.
1936
1937 2007-05-25 Wakaba <wakaba@suika.fam.cx>
1938
1939 * URIChecker.pm: All recommendations from RFC 3986
1940 and RFC 3987 are listed (not all testable items are checked yet).
1941
1942 2007-05-25 Wakaba <wakaba@suika.fam.cx>
1943
1944 * URIChecker.pm: New module.
1945
1946 2007-05-20 Wakaba <wakaba@suika.fam.cx>
1947
1948 * ContentChecker.pm: Now most attributes are implemented
1949 or associated to some placeholder.
1950 ($ElementDefault): Warn unknown attributes
1951 for unknown elements as "attribute not supported".
1952 ($HTMLLanguageTagAttrChecker, $HTMLMQAttrChecker): New placeholders.
1953 ($HTMLUsemapAttrChecker, $HTMLTargetAttrChecker): New checkers.
1954 (|a| attribute checker): Reimplemented.
1955
1956 2007-05-20 Wakaba <wakaba@suika.fam.cx>
1957
1958 * ContentChecker.pm ($HTMLEventHandlerAttrChecker): New placeholder.
1959 ($HTMLAttrChecker): Event handler content attributes
1960 are added.
1961 (link, embed): Required attribute is now checked.
1962 (embed): Unknown local attributes are no longer warned.
1963
1964 2007-05-20 Wakaba <wakaba@suika.fam.cx>
1965
1966 * ContentChecker.pm ($HTMLSpaceURIsAttrChecker): New placeholder.
1967 ($HTMLIMTAttrChecker): New checker.
1968 (link@rel, link@href, link@type, style@type,
1969 a@href, a@ping, a@ping, a@type, embed@src, embed@type,
1970 object@data, object@type, source@src, source@type, area@alt,
1971 area@shape, area@coords, area@href,
1972 area@ping, area@rel, area@type, script@src,
1973 script@defer, script@async, script@type): Checkers added.
1974
1975 2007-05-20 Wakaba <wakaba@suika.fam.cx>
1976
1977 * ContentChecker.pm: Descendant checking was incorrect.
1978
1979 2007-05-19 Wakaba <wakaba@suika.fam.cx>
1980
1981 * ContentChecker.pm: Support |xml:*| and |xmlns:*|
1982 attributes. Report an error if |Element.prefix|
1983 is |xmlns|.
1984
1985 * NanoDOM.pm (prefix): New attribute.
1986
1987 2007-05-19 Wakaba <wakaba@suika.fam.cx>
1988
1989 * HTML.pm.src: In |main| phase, |in body| insertion
1990 mode, action for |<iframe>| was missing.
1991
1992 2007-05-19 Wakaba <wakaba@suika.fam.cx>
1993
1994 * ContentChecker.pm: Support for many of HTML5 elements.
1995 ($GetHTMLNonNegativeIntegerAttrChecker): New.
1996
1997 2007-05-19 Wakaba <wakaba@suika.fam.cx>
1998
1999 * ContentChecker.pm: Support for most elements up to |progress|.
2000 ($HTMLURIAttrChecker): Placeholder.
2001 ($HTMLIntegerAttrChecker, $GetHTMLFloatingPointNumberAttrChecker):
2002 New.
2003
2004 2007-05-19 Wakaba <wakaba@suika.fam.cx>
2005
2006 * ContentChecker.pm: Attribute checkers
2007 for global attributes, |html|, |base|, |style|, and |meta|.
2008
2009 * NanoDOM.pm (insert_before): Weaken reference
2010 to the parent node.
2011 (Attr::new): Set |owner_element| attribute.
2012 (namespace_uri, manakai_local_name): New attribute implementations.
2013 (owner_element): New attribute.
2014
2015 2007-05-19 Wakaba <wakaba@suika.fam.cx>
2016
2017 * ContentChecker.pm ($AttrChecker, $HTMLAttrChecker,
2018 $AnyChecker->{attr_checker}, $HTMLAttrsChecker,
2019 $Element->{$HTML_NS}->{''}): New.
2020 (check_element): Invoke attrs_checker for each element.
2021
2022 2007-05-13 Wakaba <wakaba@suika.fam.cx>
2023
2024 * ContentChecker.pm: Don't use |manakai_element_type_match|.
2025
2026 2007-05-13 Wakaba <wakaba@suika.fam.cx>
2027
2028 * ContentChecker.pm: Use hashs rather than lists for
2029 element type testings.
2030
2031 2007-05-13 Wakaba <wakaba@suika.fam.cx>
2032
2033 * ContentChecker.pm: Don't generate duplicate
2034 error when an element type is put in the "minus" list
2035 and the element type is not allowed explicitly in the particular
2036 element content model.
2037 (html:a checker): New checker.
2038 (html:details, html:datagrid): New checkers.
2039 (html:legend): New checker.
2040
2041 2007-05-13 Wakaba <wakaba@suika.fam.cx>
2042
2043 * ContentChecker.pm (html:li checker): Implemented.
2044
2045 2007-05-13 Wakaba <wakaba@suika.fam.cx>
2046
2047 * ContentChecker.pm ($HTMLInlineOrStriclyInlineChecker): New
2048 checker.
2049 (html:dd checker): New checker.
2050 (html:q, html:em, html:strong, html:small,
2051 html:m, html:dfn, html:code, html:samp, html:span): New checkers.
2052
2053 2007-05-13 Wakaba <wakaba@suika.fam.cx>
2054
2055 * ContentChecker.pm ($AnyChecker): Renamed
2056 from |$ElementDefault->{checker}|.
2057 ($ElementDefault->{checker}): Throw an error that
2058 the element type is not supported by the checker.
2059 ($HTMLMetadataElement): |html:base| was missing.
2060 ($HTMLEmptyChecker): Don't throw an error
2061 for inter-element whitespace nodes.
2062 (html:html checker): Errors were not
2063 thrown even if |html:head| and/or |html:body|
2064 children were missing.
2065 (html:head checker): An error was not
2066 thrown if <meta charset> appered after other
2067 elements.
2068
2069 2007-05-05 Wakaba <wakaba@suika.fam.cx>
2070
2071 * ContentChecker.pm: |footer|, |video|, |audio|, |script|,
2072 and |noscript| elements are implemented.
2073 (new): New method.
2074
2075 2007-05-04 Wakaba <wakaba@suika.fam.cx>
2076
2077 * ContentChecker.pm: New module.
2078
2079 2007-05-04 Wakaba <wakaba@suika.fam.cx>
2080
2081 * NanoDOM.pm (manakai_parent_element,
2082 document_element, manakai_local_name,
2083 manakai_element_type_match): New method.
2084
2085 2007-05-03 Wakaba <wakaba@suika.fam.cx>
2086
2087 * HTML.pm.src: Replace decimal and hexadecimal numeric
2088 entities in C1 range using Windows-1252 mapping. Bare LF
2089 did not count as new line for error reporting.
2090
2091 2007-05-02 Wakaba <wakaba@suika.fam.cx>
2092
2093 * NanoDOM.pm (DOMImplementation): New class.
2094 (append_child): Weaken the |parent_node| reference.
2095 (create_element_ns, Element new): Set the |owner_document|
2096 reference.
2097 (implementation): New attribute.
2098 (owner_document, local_name, namespace_uri): New attributes.
2099
2100 * HTML.pm.src (parse_string): Line and column numbers
2101 are now provided to error handler.
2102 (!!!parse-error): Short descriptions are added.
2103 (_construct_tree): Split into three methods; support
2104 for innerHTML mode.
2105 (set_inner_html): New method.
2106
2107 2007-05-01 Wakaba <wakaba@suika.fam.cx>
2108
2109 * NanoDOM.html: Documentation is added.
2110
2111 * HTML.pod, ContentType.html: Documentation is revised.
2112
2113 * .cvsignore: Pod2html temporary files are added.
2114
2115 * Makefile: Make |NanoDOM.html|.
2116
2117 2007-05-01 Wakaba <wakaba@suika.fam.cx>
2118
2119 * HTML.pm.src (parse_string): New method.
2120 (get_inner_html): Renamed from |inner_html|.
2121
2122 * Makefile: A rule for |HTML.html| is added.
2123
2124 * HTML.pod: New documentation.
2125
2126 2007-05-01 Wakaba <wakaba@suika.fam.cx>
2127
2128 * NanoDOM.pm (last_child, previous_sibling): New attributes.
2129 (clone_node): Attribute nodes were not completely copied.
2130
2131 * HTML.pm.src: Many bugs are fixed.
2132
2133 2007-04-30 Wakaba <wakaba@suika.fam.cx>
2134
2135 * HTML.pm.src: Some typos are fixed.
2136
2137 2007-04-30 Wakaba <wakaba@suika.fam.cx>
2138
2139 * mkhtmlparser.pl, Makefile: References to the |HTML-consume-entity.src|
2140 are removed.
2141
2142 * HTML.pm.src: Tokenizer's handling on named entities are rewritten.
2143
2144 * HTML-consume-entity.src: Removed.
2145
2146 2007-04-30 Wakaba <wakaba@suika.fam.cx>
2147
2148 * HTML.pm.src: Tokenizer's handling on hexadecimal
2149 numeric entities are rewritten.
2150
2151 2007-04-30 Wakaba <wakaba@suika.fam.cx>
2152
2153 * HTML.pm.src: Some tokenizer bugs are fixed.
2154
2155 2007-04-30 Wakaba <wakaba@suika.fam.cx>
2156
2157 * HTML.pm.src: The tree construction stage is implemented.
2158
2159 * mkhtmlparser.pl: New macros are added.
2160
2161 2007-04-28 Wakaba <wakaba@suika.fam.cx>
2162
2163 * ContentType.pm: A note on bug in the specification
2164 is removed since it's been now fixed.
2165
2166 * .cvsignore: New file.
2167
2168 2007-04-28 Wakaba <wakaba@suika.fam.cx>
2169
2170 * HTML.pm.src, HTML-consume-entity.src: New files.
2171
2172 * Makefile (HTML.pm): New rule.
2173
2174 * mkhtmlparser.pl: New script.
2175
2176 2007-04-25 Wakaba <wakaba@suika.fam.cx>
2177
2178 * Makefile: New file.
2179
2180 2007-04-24 Wakaba <wakaba@suika.fam.cx>
2181
2182 * ContentType.pm: An error in pod is fixed.
2183
2184 2007-04-24 Wakaba <wakaba@suika.fam.cx>
2185
2186 * ContentType.pm: Documentation is added.
2187
2188 2007-04-24 Wakaba <wakaba@suika.fam.cx>
2189
2190 * ContentType.pm: New Perl module.
2191
2192 * ChangeLog: New file.
2193
2194

admin@suikawiki.org
ViewVC Help
Powered by ViewVC 1.1.24