/[suikacvs]/markup/html/whatpm/Whatpm/ChangeLog
Suika

Contents of /markup/html/whatpm/Whatpm/ChangeLog

Parent Directory Parent Directory | Revision Log Revision Log


Revision 1.347 - (show annotations) (download)
Sun Oct 5 05:59:35 2008 UTC (16 years, 1 month ago) by wakaba
Branch: MAIN
Changes since 1.346: +4 -0 lines
++ whatpm/t/ChangeLog	5 Oct 2008 05:38:58 -0000
	* content-model-1.dat, content-model-2.dat: Test results on
	|label| contents fixed.

	* ContentChecker.t: |dom-conformance/html-form-label.dat| added.

	* dom-conformance/: New directory.

2008-10-05  Wakaba  <wakaba@suika.fam.cx>

++ whatpm/t/dom-conformance/ChangeLog	5 Oct 2008 05:35:48 -0000
2008-10-05  Wakaba  <wakaba@suika.fam.cx>

	* html-form-label.dat: New test data file.

	* ChangeLog: New file.

++ whatpm/Whatpm/ChangeLog	5 Oct 2008 05:58:50 -0000
	* ContentChecker.pod: Note on internal flags is added.

2008-10-05  Wakaba  <wakaba@suika.fam.cx>

++ whatpm/Whatpm/ContentChecker/ChangeLog	5 Oct 2008 05:36:36 -0000
2008-10-05  Wakaba  <wakaba@suika.fam.cx>

	* HTML.pm: <label> content conformance checking implemented.

1 2008-10-05 Wakaba <wakaba@suika.fam.cx>
2
3 * ContentChecker.pod: Note on internal flags is added.
4
5 2008-10-05 Wakaba <wakaba@suika.fam.cx>
6
7 * HTML.pm.src: An AAA bug fixed.
8
9 2008-10-04 Wakaba <wakaba@suika.fam.cx>
10
11 * HTML.pm.src: If another node is inserted by the parser, don't
12 reuse existing Text node to append a character (HTML5 revision
13 2124).
14
15 2008-10-04 Wakaba <wakaba@suika.fam.cx>
16
17 * HTML.pm.src: Support for <option> and <optgroup> in body (HTML5
18 revisions 1731 and 2128).
19
20 2008-10-04 Wakaba <wakaba@suika.fam.cx>
21
22 * HTML.pm.src: Make <foreignObject> scoping (HTML5 revision 1837).
23 Support for end tags of camelCase SVG elements were broken. A
24 wrong error type text fixed.
25
26 2008-10-04 Wakaba <wakaba@suika.fam.cx>
27
28 * HTML.pm.src: Drop redundant code (HTML5 revision 1731).
29
30 2008-10-04 Wakaba <wakaba@suika.fam.cx>
31
32 * HTML.pm.src: Support for new definition of |param| and |source|
33 start tag parsing (HTML5 revision 1731).
34
35 2008-10-04 Wakaba <wakaba@suika.fam.cx>
36
37 * HTML.pm.src: <p> steps reimplemented (HTML5 revision 1731).
38
39 2008-10-04 Wakaba <wakaba@suika.fam.cx>
40
41 * HTML.pm.src: <li>, <dt>, and <dd> steps reimplemented (HTML5
42 revisions 1731 and 1831).
43
44 2008-10-04 Wakaba <wakaba@suika.fam.cx>
45
46 * HTML.pm.src: Support for new flow (but not phrasing) elements (HTML5
47 revisions 1731 and 1778). Support for the </sarcasm> end tag (HTML5
48 revision 1731).
49
50 2008-10-04 Wakaba <wakaba@suika.fam.cx>
51
52 * HTML.pm.src: Support for |command| and |eventsource| elements (HTML5
53 revision 1731). End tags of |option| and |optgroup| elements are
54 now optional (HTML5 revision 1731).
55
56 2008-10-04 Wakaba <wakaba@suika.fam.cx>
57
58 * HTML.pm.src: New "special" elements added to the list (HTML5
59 revision 1778). "strile" -> "strike".
60
61 2008-10-02 Wakaba <wakaba@suika.fam.cx>
62
63 * ContentType.pm (get_sniffed_type): Support for the "better"
64 content sniffing (HTML5 revision 1927). In a case the official
65 type was not returned when the method is invoked in the list
66 context.
67
68 2008-09-22 Wakaba <wakaba@suika.fam.cx>
69
70 * HTML.pm.src: Character references for non-space C0 characters,
71 including U+000B VT, DEL character, noncharacter code points, are
72 now converted to the U+FFFD character (cf. HTML5 revision 2138).
73
74 2008-09-21 Wakaba <wakaba@suika.fam.cx>
75
76 * ContentChecker.pm: |form=""| check support added.
77
78 2008-09-21 Wakaba <wakaba@suika.fam.cx>
79
80 * ContentChecker.pm: |contextmenu| validness is now checked using
81 |id| and |id_type| properties, and |menu| property is removed.
82
83 2008-09-21 Wakaba <wakaba@suika.fam.cx>
84
85 * ContentChecker.pm: Prepare for |form| |name| attribute's
86 duplication checking.
87
88 2008-09-21 Wakaba <wakaba@suika.fam.cx>
89
90 * HTML.pm.src (parse_byte_stream): Support (or non-support) for
91 unsupported charset="" parameter value (HTML5 revision 2131).
92
93 2008-09-20 Wakaba <wakaba@suika.fam.cx>
94
95 * HTML.pm.src: Reminding places where U+000B is allowed as a space
96 character is fixed (cf. HTML5 revision 1738).
97
98 * ContentChecker.pm, HTMLTable.pm: U+000B is no longer part of
99 space characters (HTML5 revision 1738).
100
101 2008-09-20 Wakaba <wakaba@suika.fam.cx>
102
103 * HTML.pm.src: The "anything else" case for the "after after body"
104 insertion mode was not updated to swtich to the "in body"
105 insertion mode. U+000B is no longer a space character for the
106 purpose of tree construction phase (HTML5 revision 1738).
107
108 2008-09-20 Wakaba <wakaba@suika.fam.cx>
109
110 * HTML.pm.src: U+000B is no longer a space character (HTML5
111 revision 1738).
112
113 2008-09-20 Wakaba <wakaba@suika.fam.cx>
114
115 * ContentType.pm: 0x0B is no longer a space character (HTML5
116 revision 1738).
117
118 * HTML.pm.src: U+000B is no longer a space character for the
119 algorithm for extracting an encoding from a Content-Type (HTML5
120 revision 1738).
121
122 2008-09-20 Wakaba <wakaba@suika.fam.cx>
123
124 * ContentChecker.pm ($IsInHTMLInteractiveContent): New.
125
126 2008-09-18 Wakaba <wakaba@suika.fam.cx>
127
128 * LangTag.pm: Add checks for remaining requirements from RFC 4646.
129
130 * mklangreg.pl: Sort 'Prefix' values by their length, to ease
131 matching.
132
133 2008-09-18 Wakaba <wakaba@suika.fam.cx>
134
135 * LangTag.pm: Warn for private use language subtags. Error level
136 typos fixed. Support for Suppress-Script field.
137
138 * mklangreg.pl: Support for dumping of nested structure.
139
140 2008-09-18 Wakaba <wakaba@suika.fam.cx>
141
142 * LangTag.pm (check_rfc4646_langtag): Check if a tag is in the
143 recommended case as per RFC 4646.
144
145 2008-09-18 Wakaba <wakaba@suika.fam.cx>
146
147 * LangTag.pm (check_rfc4646_langtag): New method.
148
149 2008-09-18 Wakaba <wakaba@suika.fam.cx>
150
151 * mklangreg.pl: New script.
152
153 * Makefile: Updated for creation of the module for language subtag
154 registry.
155
156 2008-09-16 Wakaba <wakaba@suika.fam.cx>
157
158 * Makefile: WebIDL.html added.
159
160 * WebIDL.pod: New documentation.
161
162 2008-09-16 Wakaba <wakaba@suika.fam.cx>
163
164 * WebIDL.pm: Checker's error types are redefined.
165
166 2008-09-16 Wakaba <wakaba@suika.fam.cx>
167
168 * WebIDL.pm: Parser's error types are redefined. Some forward
169 compatible parsing bugs are fixed. Some unreachable codes are
170 commented out.
171
172 2008-09-16 Wakaba <wakaba@suika.fam.cx>
173
174 * WebIDL.pm: Support for the reminding extended attributes are
175 added. It does not satisfy the definition that a forward
176 interface declaration has an extended attribute. It seems that
177 unless explicitly allowed multiple extended attributes with the
178 same name is not allowed, though it is not explicitly mentioned in
179 the spec.
180
181 2008-09-16 Wakaba <wakaba@suika.fam.cx>
182
183 * WebIDL.pm: Unescapes extended attribute names and extended
184 attribute identifiers. Preserve whether an extended attribute has
185 an argument list of not. Support for extended attributes:
186 Constructor, ExceptionConsts, IndexGetter, IndexSetter,
187 NameGetter, NameSetter, and Null.
188 (has_argument_list): New attribute.
189 (idl_text): Stringifies argument lists, if any, even if it is
190 empty.
191
192 2008-09-15 Wakaba <wakaba@suika.fam.cx>
193
194 * HTML.pm.src: New state |PCDATA_STATE|. Use an empty string for
195 |{s_kwd}| in DATA_STATE as default.
196
197 2008-09-15 Wakaba <wakaba@suika.fam.cx>
198
199 * HTML.pm.src, mkhtmlparser.pl: Replace |{prev_char}|
200 by |{s_kwd}| in DATA_STATE.
201
202 2008-09-15 Wakaba <wakaba@suika.fam.cx>
203
204 * HTML.pm.src: Shorten keys.
205
206 2008-09-15 Wakaba <wakaba@suika.fam.cx>
207
208 * HTML.pm.src: Remove checking for control character, surrogate
209 pair, or noncharacter code points and non-Unicode code
210 points (they should be handled by Whatpm::Charset::UnicodeChecker).
211 (parse_char_stream): Support for the |$get_wrapper| argument and
212 character stream error handlers.
213
214 2008-09-15 Wakaba <wakaba@suika.fam.cx>
215
216 * ContentChecker.pm: Don't call |loda_ns_module|
217 for null-namespace elements/attributes.
218
219 * HTML.pm.src: Fact out $disallowed_control_chars
220 as a hash.
221
222 2008-09-14 Wakaba <wakaba@suika.fam.cx>
223
224 * HTML.pm.src: Regexp typo fixed. |{prev_char}|
225 and |{next_char}| initializations are moved to initialization
226 method. |{read_until}| now supports buffering. Sync |set_inner_html|
227 with |parse_char_stream|.
228
229 2008-09-14 Wakaba <wakaba@suika.fam.cx>
230
231 * HTML.pm.src (parse_char_stream): Make |set_next_char|
232 invoke |manakai_read_until|, not only |read|, where
233 possible, to decrease the number of |read| method calls.
234
235 * mkhtmlparser.pl: Related changes to the aforementioned
236 modification.
237
238 2008-09-14 Wakaba <wakaba@suika.fam.cx>
239
240 * HTML.pm.src: Use |read| instead of |getc|. |set_inner_html|
241 would report character error from now.
242
243 2008-09-14 Wakaba <wakaba@suika.fam.cx>
244
245 * HTML.pm.src: White-space-leaded non-white-space character
246 tokens in "before head insertion mode" was not
247 correctly handled.
248 (set_inner_html): Reimplemented using CharString decodehandle
249 class. Support for $get_wrapper argument. Support
250 for |{read_until}| feature.
251
252 2008-09-14 Wakaba <wakaba@suika.fam.cx>
253
254 * HTML.pm.src: Make a "bare ero" error for unknown
255 entities point the "&" character.
256
257 2008-09-14 Wakaba <wakaba@suika.fam.cx>
258
259 * HTML.pm.src: It turns out that U+FFFD don't have to
260 be added to the list of excluded characters.
261
262 2008-09-14 Wakaba <wakaba@suika.fam.cx>
263
264 * HTML.pm.src ($char_onerror): Have character decoder's |line|
265 and |column| a higher priority than the one set by the
266 tokenizer's input handler.
267 ($self->{read_until}): Exclude U+FFFD (but this might
268 not be necessary, since now we do line/column fixup in
269 the character decode handle).
270
271 2008-09-14 Wakaba <wakaba@suika.fam.cx>
272
273 * HTML.pm.src: Use |{read_until}| where possible.
274
275 2008-09-14 Wakaba <wakaba@suika.fam.cx>
276
277 * HTML.pm.src: Change |{getc_until}| to |{read_until}|
278 and |manakai_getc_until| to |manakai_read_until| to
279 reduce the number of string copies.
280
281 2008-09-14 Wakaba <wakaba@suika.fam.cx>
282
283 * HTML.pm.src (parse_char_string): Use newly created
284 |Whatpm::Charset::DecodeHandle::CharString| instead of Perl's
285 standard feature to |open| a string as a filehandle,
286 since Perl's string filehandle seems not supporting |ungetc|
287 method correctly.
288 (parse_char_stream): Define |{getc_until}| method.
289 (DATA_STATE): Experimental support for |getc_until| feature.
290
291 2008-09-13 Wakaba <wakaba@suika.fam.cx>
292
293 * HTML.pm.src: Check points added to newly added branches.
294
295 2008-09-13 Wakaba <wakaba@suika.fam.cx>
296
297 * HTML.pm.src: Remove |{char}|, which is no longer used.
298 Remove |{entity_in_attr}| and |{last_attribute_value_state}|
299 and replaced by |{prev_state}|.
300
301 * mkhtmlparser.pl: Remove |{char}| feature.
302 Remove |!!!back-next-input-character;| macro.
303
304 2008-09-13 Wakaba <wakaba@suika.fam.cx>
305
306 * HTML.pm.src: Finally we get rid of all the inner loops. Remove
307 entity related tokenizer states in favor of new states
308 implementing the consume character reference algorithm.
309
310 2008-09-13 Wakaba <wakaba@suika.fam.cx>
311
312 * HTML.pm.src: "Consume a character reference" algorithm is
313 now implemented as a tokenizer's state, rather than
314 a method, with minimum changes (more changes will
315 be made, in due course). "Bogus comment state"'s inner
316 loop gets removed.
317
318 2008-09-13 Wakaba <wakaba@suika.fam.cx>
319
320 * HTML.pm.src: Make |PUBLIC| and |SYSTEM| keyword tokenizing
321 into their own tokenizer states.
322
323 2008-09-13 Wakaba <wakaba@suika.fam.cx>
324
325 * HTML.pm.src: |CDATA_SECTION_STATE| (formally |CDATA_BLOCK_STATE|
326 is split into three states.
327
328 2008-09-13 Wakaba <wakaba@suika.fam.cx>
329
330 * HTML.pm.src: |CLOSE_TAG_OPEN_STATE| is broken into
331 itself and new |CDATA_PCDATA_CLOSE_TAG_STATE| so that
332 no longer does the tokenizer have to push back next input
333 characters in those states.
334
335 2008-09-13 Wakaba <wakaba@suika.fam.cx>
336
337 * HTML.pm.src: |MARKUP_DECLARATION_OPEN_STATE| broken
338 into four states so that no longer does the tokenizer have to push
339 back next input characters in that state.
340
341 2008-09-11 Wakaba <wakaba@suika.fam.cx>
342
343 * HTML.pm.src: Methods now accept additional parameter, $get_wrapper,
344 which can be used to insert some wrapper between the character
345 stream handle and the tokenizer. (It is currently not supported
346 for |set_inner_html| for |Element|s).
347
348 2008-09-10 Wakaba <wakaba@suika.fam.cx>
349
350 * HTML.pm.src: Ignore punctuations in charset names.
351
352 2008-09-10 Wakaba <wakaba@suika.fam.cx>
353
354 * ContentChecker.pm: Support for charset-layer error levels.
355
356 * HTML.pm.src: Don't specify |text| argument for the
357 |chardecode:fallback| error, since it is not the encoding
358 being used alternatively.
359
360 2008-09-06 Wakaba <wakaba@suika.fam.cx>
361
362 * HTML.pm.src: Support for |XSLT-compat| (HTML5 revision 2141).
363
364 2008-08-31 Wakaba <wakaba@suika.fam.cx>
365
366 * CacheManifest.pm: Support for extensibility (HTML5 revision 2051).
367
368 2008-08-31 Wakaba <wakaba@suika.fam.cx>
369
370 * HTML.pm.src: Bug fix and sync with the spec with regard
371 to after after frameset insertion mode processing (HTML5
372 revision 1909). Note that the implementation was wrong
373 per the old spec before the r1909 changes.
374
375 2008-08-30 Wakaba <wakaba@suika.fam.cx>
376
377 * HTMLTable.pm: scope=auto algorithm fix synced with the
378 spec (HTML5 revision 2093).
379 ($process_row): Algorithm step numbers synced with the
380 spec (HTML5 revision 2092).
381
382 2008-08-30 Wakaba <wakaba@suika.fam.cx>
383
384 * HTMLTable.pm: Zs is not what we want; we want White_Space! (HTML5
385 revision 2094).
386
387 2008-08-30 Wakaba <wakaba@suika.fam.cx>
388
389 * ContentType.pm: Support for image/svg+xml (HTML5 revision 2096).
390
391 2008-08-30 Wakaba <wakaba@suika.fam.cx>
392
393 * HTML.pm.src: '"' and "'" at the end of attribute
394 name (after another attribute) now raise parse error (HTML5
395 revision 2123). Empty unquoted attribute values are no
396 longer allowed (HTML5 revision 2122).
397
398 2008-08-30 Wakaba <wakaba@suika.fam.cx>
399
400 * mkhtmlparser.pl: Support for MathML |definitionURL| attribute (HTML5
401 revision 2130).
402
403 2008-08-30 Wakaba <wakaba@suika.fam.cx>
404
405 * ContentChecker.pm: |xml:lang| attribute value must be same
406 as |lang| attribute value for HTML elements (HTML5 revision 2062
407 and so on).
408
409 2008-08-30 Wakaba <wakaba@suika.fam.cx>
410
411 * ContentChecker.pm: Error level definition for |xml_id_error|
412 was missing.
413
414 * URIChecker.pm: The end of the URL should be marked as the
415 error location for an empty path error. The position
416 between the userinfo and the port components should be
417 marked as the error location for an empty host error.
418
419 2008-08-30 Wakaba <wakaba@suika.fam.cx>
420
421 * URIChecker.pm: Set parameters representing where in the
422 value the error occurs for errors. Report unknown
423 address format error in warning level, since address
424 formats are rarely added. Path segments starting with "/.."
425 were misinterpreted as a dot-segment.
426
427 2008-08-30 Wakaba <wakaba@suika.fam.cx>
428
429 * URIChecker.pm (check_iri_reference): Requires
430 |Message::DOM::DOMImplementation|.
431
432 2008-08-29 Wakaba <wakaba@suika.fam.cx>
433
434 * IMTChecker.pm: Updated for the new error reporting architecture.
435
436 * ContentChecker.pm: Error levels for IMTs are added.
437
438 2008-08-17 Wakaba <wakaba@suika.fam.cx>
439
440 * H2H.pm (_shift_token): Support for unquoted HTML attribute
441 values.
442
443 2008-08-16 Wakaba <wakaba@suika.fam.cx>
444
445 * CacheManifest.pm: Support for new style of error
446 reports.
447
448 * HTML.pm.src: Set line=1, column=1 to the document node.
449
450 2008-08-16 Wakaba <wakaba@suika.fam.cx>
451
452 * ContentChecker.pm, RDFXML.pm: Pass {level} object to language tag
453 and URL checkers. Support for more error levels for bogus
454 langauge tag and URL "standards".
455
456 * LangTag.pm, URIChecker.pm: Support for new style error
457 level reporting.
458
459 2008-08-15 Wakaba <wakaba@suika.fam.cx>
460
461 * ContentChecker.pm: Support for RDF/XML error levels.
462
463 * HTMLTable.pm, RDFXML.pm: Support for new style of error level
464 specifying. Error types are revised.
465
466 2008-08-15 Wakaba <wakaba@suika.fam.cx>
467
468 * ContentChecker.pm: All error reporting method calls are
469 renewed.
470
471 2008-08-15 Wakaba <wakaba@suika.fam.cx>
472
473 * HTML.pm.src: All error type names and "text" parameters
474 are revised. Use new style for "level" specification.
475
476 * mkhtmlparser.pl: Use new style for "level" specification.
477
478 2008-08-03 Wakaba <wakaba@suika.fam.cx>
479
480 * WebIDL.pm (parse_char_string): Simplified error
481 reporting process for broken ignored valuetype definition.
482 (Valuetype idl_text): Support for special "DOMString" name.
483
484 2008-08-03 Wakaba <wakaba@suika.fam.cx>
485
486 * WebIDL.pm ($get_scoped_name): Append "::::" if the last
487 terminal of the ScopedName is "DOMString", such that whether
488 the last part of the scoped name is "DOMString" or "_DOMString"
489 later. It is necessary to determine whether a |typedef|
490 definition should be ignored or not.
491 (parse_char_string): Unescape the identifier of
492 exception members.
493 ($resolve): Return undef for builtin types and sequence<T>
494 types (we might not have to do this, however...).
495 (check): Support checking for Exceptions, Valuetypes,
496 and Typedefs.
497 ($serialize_type): Support for "DOMString::::" syntax.
498 (Typedef idl_text): Output Type as "DOMString" if it
499 is really "DOMString" (i.e. its internal representation
500 is "::DOMString::").
501
502 2008-08-03 Wakaba <wakaba@suika.fam.cx>
503
504 * WebIDL.pm ($resolve): New code, based on resolve code
505 for constant types in the |check| method.
506 (check): Support for checking of attributes, operations, and
507 arguments.
508 (Attribute/Operation idl_text): Exception names in getraises,
509 setraises, and raises clauses is serizlied by |$serialize_type|
510 code.
511
512 2008-08-02 Wakaba <wakaba@suika.fam.cx>
513
514 * WebIDL.pm ($integer): Order of selections are changed to match
515 hexadecimal numbers (the original pattern, taken from the spec,
516 was not work for hexadecimal numbers, because the "0" prefix
517 matches to the [0-7]* part (as an empty string) and therefore
518 it does not match with remaining "x..." part of a "0x..." integer
519 literal.
520 ($get_type): It now returns a string, not an array reference,
521 for regular types and |sequence| types (i.e. it in any case
522 returns a string).
523 ($get_next_token): The second item in the array that represents
524 a integer or float token is now a Perl number value, not the
525 original string representation of the number.
526 (check): Support for const value consistency checking.
527 No extended attribute is defined for constants.
528 (Node subclasses): Use simple strings rather than array references
529 for default data type values.
530 ($serialize_type): Type values are now simple strings.
531 (value): If the new attribute value is a false value, then
532 a FALSE value is set to the attribute.
533
534 2008-08-02 Wakaba <wakaba@suika.fam.cx>
535
536 * WebIDL.pm ($get_scoped_name): Now scoped names are stored
537 in its stringified format ("scoped name" as defined in the
538 spec). Note that future version of this module should not use
539 array references for type values and the |type_text| attribute
540 should be made obsolete.
541 (parse_char_string): Unescape attribute names.
542 (check): Support for checking of whether inherited interfaces
543 are actually defined or not. Support for checking of whether
544 interface member identifiers are duplicated or not.
545 ($serialize_type): Scoped names are returned as is. A future
546 version of this code should escape identifiers other than "DOMString",
547 otherwise the idl_text would be non-conforming.
548
549 2008-08-02 Wakaba <wakaba@suika.fam.cx>
550
551 * WebIDL.pm (parse_char_string): Set line/column numbers
552 to generated nodes. Unescape identifiers. Extended attributes
553 for Definition's were ignored.
554 (append_child): Set |parent_node| attribute.
555 (parent_node): New attribute.
556 (check): Support interface/exception members. Support
557 extended attributes. Support definition identifier uniqueness
558 constraint.
559 (qualified_name): New attribute.
560 (Interface/Exception idl_text): Extended attributes were
561 not prepended to the returned text.
562
563 2008-08-02 Wakaba <wakaba@suika.fam.cx>
564
565 * WebIDL.pm (parse_char_string): Set line/column numbers
566 to interface object experimentally. s/shift/pop/g, shift
567 would make things wrong. Support for interface forward
568 declarations was missing. Broken interface declarations
569 with no block were not ignored entirely.
570 (Whatpm::WebIDL::Node): New abstract class. This class
571 makes things easier.
572 (child_nodes): New attribute. Unlike DOM's attribute with
573 same name, this attribute returns a dead list of nodes for
574 simplicity.
575 (get_user_data, set_user_data): New methods.
576 (Module idl_text): A SPACE character should be inserted
577 before the |{| character.
578 (Interface idl_text): Support for interface forward declarations.
579 (is_forward_declaration): New attribute.
580
581 2008-07-19 Wakaba <wakaba@suika.fam.cx>
582
583 * WebIDL.pm (type_text): Better serializer.
584
585 2008-07-19 Wakaba <wakaba@suika.fam.cx>
586
587 * WebIDL.pm: Revise forward-compatible parsing so that
588 it now can handle broken extended attributes and as such.
589
590 2008-07-19 Wakaba <wakaba@suika.fam.cx>
591
592 * WebIDL.pm: Real support for extended attributes.
593 Support for extended attributes with arguments.
594
595 2008-07-19 Wakaba <wakaba@suika.fam.cx>
596
597 * WebIDL.pm: Support for |exception| syntax.
598 (Interface->idl_text): Tentative support for inheritances.
599
600 2008-07-19 Wakaba <wakaba@suika.fam.cx>
601
602 * WebIDL.pm: Hierarchical scoped name support was broken.
603 Support for raises, setraises, and getraises syntaxes.
604
605 2008-07-18 Wakaba <wakaba@suika.fam.cx>
606
607 * WebIDL.pm: Support for |idl_text| attribute, version 1 (no
608 proper support for types, extended attributes, and exceptions yet).
609 WebIDL parser, version 1 (no support for exceptions yet,
610 no proper support for extended attributes yet).
611
612 2008-07-09 Wakaba <wakaba@suika.fam.cx>
613
614 * WebIDL.pm (parse_char_string): Support for basic attribute syntax.
615
616 2008-06-29 Wakaba <wakaba@suika.fam.cx>
617
618 * WebIDL.pm: Support for valuetype and const.
619
620 2008-06-29 Wakaba <wakaba@suika.fam.cx>
621
622 * WebIDL.pm: New module.
623
624 2008-06-15 Wakaba <wakaba@suika.fam.cx>
625
626 * Makefile (Entities.html): URI changed.
627
628 2008-06-08 Wakaba <wakaba@suika.fam.cx>
629
630 * HTML.pm.src: Support for ruby parsing (HTML5 revision 1704).
631
632 2008-06-01 Wakaba <wakaba@suika.fam.cx>
633
634 * HTML.pm.src (_get_next_token): A parse error was missing.
635
636 2008-06-01 Wakaba <wakaba@suika.fam.cx>
637
638 * mklinktypelist.pl: rel=contact is no longer part of the HTML5
639 spec (commented out). (HTML5 revision 1711).
640
641 2008-05-25 Wakaba <wakaba@suika.fam.cx>
642
643 * ContentType.pm: Drop support for UTF-32 (HTML5 revision 1701).
644
645 * HTML.pm.src: UTF-16BE and UTF-16LE should be considered
646 as UTF-16 (HTML5 revision 1701).
647
648 2008-05-25 Wakaba <wakaba@suika.fam.cx>
649
650 * HTML.pm.src: Support for <noframes> in <head> (HTML5 revision
651 1692).
652
653 2008-05-25 Wakaba <wakaba@suika.fam.cx>
654
655 * HTML.pm.src: The secondary insertion mode used when switching
656 to foreign content is the "in body" insertion mode (HTML5 revision
657 1696).
658
659 2008-05-25 Wakaba <wakaba@suika.fam.cx>
660
661 * HTML.pm.src: Don't raise parse error for <isindex/> (HTML5
662 revision 1697).
663
664 2008-05-24 Wakaba <wakaba@suika.fam.cx>
665
666 * HTML.pm.src: Support for end-of-file token in foreign content
667 insertion mode (HTML5 revision 1693). Update SVG camelCase
668 attribute list (HTML5 revision 1700). <textarea> closes
669 </select> (HTML5 revision 1699). More start tags close in
670 foreign content insertion mode (HTML5 revision 1698).
671
672 2008-05-24 Wakaba <wakaba@suika.fam.cx>
673
674 * HTML.pm.src: ";" is not part of charset name (HTML5 revision 1665).
675
676 2008-05-24 Wakaba <wakaba@suika.fam.cx>
677
678 * HTML.pm.src: More robust charset parameter detection (HTML5
679 revision 1674).
680
681 2008-05-24 Wakaba <wakaba@suika.fam.cx>
682
683 * ContentType.pm: Support for image/vnd.microsoft.icon (HTML5
684 revision 1676).
685
686 2008-05-24 Wakaba <wakaba@suika.fam.cx>
687
688 * HTML.pm.src: Ignore language part of public identifiers for
689 quriks mode detection (HTML5 revision 1679).
690
691 2008-05-24 Wakaba <wakaba@suika.fam.cx>
692
693 * HTML.pm.src: Reduce the number of errors in truncated doctypes (HTML5
694 revision 1685).
695
696 2008-05-24 Wakaba <wakaba@suika.fam.cx>
697
698 * HTML.pm.src: Support for EOF in new states for tags (HTML5
699 revision 1684).
700
701 2008-05-24 Wakaba <wakaba@suika.fam.cx>
702
703 * HTML.pm.src (_reset_insertion_mode): Make <td>.innerHTML
704 work (HTML5 revision 1690).
705
706 2008-05-24 Wakaba <wakaba@suika.fam.cx>
707
708 * HTML.pm.src (_tree_construction_main): Change handling of
709 end tags in head insertion modes (HTML5 revision 1686).
710 (parse_char_string): Bug fix for non-utf8 character string handlings.
711 (parse_char_stream): |ungetc| does not work well for this context.
712
713 2008-05-18 Wakaba <wakaba@suika.fam.cx>
714
715 * HTML.pm.src (parse_byte_string): Redefined to invoke
716 |parse_byte_stream|.
717 (parse_byte_stream): New method.
718
719 2008-05-18 Wakaba <wakaba@suika.fam.cx>
720
721 * HTML.pm.src (parse_byte_string): Fix the column number reported
722 by encoding layer error reporter.
723
724 2008-05-17 Wakaba <wakaba@suika.fam.cx>
725
726 * HTML.pm.src (parse_byte_string): Use streaming decoder
727 rather than converting the whole byte string and then parsing.
728 Propagate errors in character encoding layer.
729 (get_next_token): Precise error reporting for |bare stago| error.
730
731 2008-05-17 Wakaba <wakaba@suika.fam.cx>
732
733 * HTML.pm.src (parse_char_stream): New method.
734 (parse_char_string): This method is now defined as an invocation
735 of the |parse_char_stream| method.
736
737 2008-05-17 Wakaba <wakaba@suika.fam.cx>
738
739 * HTML.pm.src (parse_byte_string): Report various status
740 of the sniffing as info-level errors. Support for new
741 decoding framework in parser resestting.
742 (new): Various default error levels were not set.
743
744 2008-05-17 Wakaba <wakaba@suika.fam.cx>
745
746 * HTML.pm.src (parse_byte_string): HTML5 encoding siniffing
747 algorithm, except for the actual sniffing, is implemented
748 with new framework with Message::Charset::Info.
749
750 2008-05-16 Wakaba <wakaba@suika.fam.cx>
751
752 * CacheManifest.pm (_parse): Drop fragment identifiers from
753 URIs in fallback section (HTML5 revision 1596).
754
755 2008-05-10 Wakaba <wakaba@suika.fam.cx>
756
757 * Makefile (Entities.html): URI has changed.
758
759 2008-05-10 Wakaba <wakaba@suika.fam.cx>
760
761 * CacheManifest.pm: Don't replace U+0000 NULL (HTML5 revision
762 1553).
763
764 2008-05-06 Wakaba <wakaba@suika.fam.cx>
765
766 * ContentChecker.pm: Noted that those returned in |table| are
767 no longer table elements, but table objects returned
768 by Whatpm::HTMLTable.
769
770 * HTMLTable.pm (form_table): Return table element node
771 as |$table->{element}|.
772 (assign_header): Support for the |headers=""| attribute.
773
774 2008-05-06 Wakaba <wakaba@suika.fam.cx>
775
776 * HTMLTable.pm (assign_header): New function; first version
777 with no support for headers="".
778 (form_table): Include table width and height to the returned
779 table object for covenience. Indexing in column assignement was wrong.
780 Set whether a data cell is empty or not for convenience.
781
782 2008-05-05 Wakaba <wakaba@suika.fam.cx>
783
784 * HTMLTable.pm: Robuster caption support (HTML5 revision 1393).
785
786 2008-05-05 Wakaba <wakaba@suika.fam.cx>
787
788 * HTMLTable.pm: How table model errors are detected is
789 changed (HTML5 revision 1387).
790
791 2008-05-05 Wakaba <wakaba@suika.fam.cx>
792
793 * HTMLTable.pm: The algorithm now moves |tfoot| elements
794 to the end of the table (HTML5 revision 1380).
795
796 2008-05-05 Wakaba <wakaba@suika.fam.cx>
797
798 * HTMLTable.pm: The algorithm is now 0-based indexing, instead
799 of 1-based (HTML5 revision 1376).
800
801 2008-05-05 Wakaba <wakaba@suika.fam.cx>
802
803 * ContentType.pm: "Content-Type: text/plain; charset=UTF-8"
804 and "Content-Encoding" no longer prevent sniffing (HTML5
805 revision 1288).
806
807 2008-05-05 Wakaba <wakaba@suika.fam.cx>
808
809 * ContentType.pm: Skip BOMs n feed or HTML algorithm (HTML5 revision
810 1282).
811
812 2008-05-03 Wakaba <wakaba@suika.fam.cx>
813
814 * ContentChecker.pm: Support for global attributes.
815 Status of XML specs are added.
816
817 2008-05-03 Wakaba <wakaba@suika.fam.cx>
818
819 * ContentChecker.pm (check_element): Support for |template=""|
820 and |ref=""| attribute (referent element type checking).
821
822 2008-04-29 Wakaba <wakaba@suika.fam.cx>
823
824 * CacheManifest.pm (_parse): New same origin definition (HTML5 revision
825 1500) is implemented (except for IDNA part and URI-scheme-specific
826 knowledge). Line number counting was wrong for LF-only
827 documents.
828
829 2008-04-13 Wakaba <wakaba@suika.fam.cx>
830
831 * HTML.pm.src: Raise an parse error for any disallowed
832 character (HTML5 revision 1263).
833
834 2008-04-13 Wakaba <wakaba@suika.fam.cx>
835
836 * mkentitylist.pl: Support for new HTML5 entity table format (the
837 definition for |AElig;| was missing).
838
839 2008-04-13 Wakaba <wakaba@suika.fam.cx>
840
841 * HTML.pm.src, mkhtmlparser.pl: Support for element/attribute
842 name/namespace fixup (HTML5 revisions 1413, 1415, 1416, and 1417).
843
844 2008-04-13 Wakaba <wakaba@suika.fam.cx>
845
846 * HTML.pm.src: List of element names that close foreign content
847 insertion mode is added (HTML5 revisions 1412 and 1418).
848
849 2008-04-13 Wakaba <wakaba@suika.fam.cx>
850
851 * HTML.pm.src: Support for |mglyph| and |malignmark|
852 elements (HTML5 revision 1410).
853
854 2008-04-13 Wakaba <wakaba@suika.fam.cx>
855
856 * HTML.pm.src: Support for new long MathML entities (HTML5
857 revision 1406).
858
859 2008-04-13 Wakaba <wakaba@suika.fam.cx>
860
861 * HTML.pm.src: CDATA section support for MathML and SVG
862 elements (HTML5 revision 1404 and 1420).
863
864 2008-04-12 Wakaba <wakaba@suika.fam.cx>
865
866 * HTML.pm.src, mkhtmlparser.pl: Support for MathML and SVG
867 elements (HTML5 revision 1404). Unused !!!macro definitions
868 are removed.
869
870 2008-04-12 Wakaba <wakaba@suika.fam.cx>
871
872 * HTML.pm.src, mkhtmlparser.pl: The way permitted slash errors
873 are raised is changed (HTML5 revision 1404).
874
875 2008-04-06 Wakaba <wakaba@suika.fam.cx>
876
877 * HTML.pm.src: Category sets that are no longer used are removed.
878
879 2008-04-06 Wakaba <wakaba@suika.fam.cx>
880
881 * HTML.pm.src: The ->[1] property of stack entries are now
882 replaced by constants representing element category.
883
884 2008-04-06 Wakaba <wakaba@suika.fam.cx>
885
886 * HTML.pm.src: Don't use local name stored in stack (i.e. ->[1])
887 for error reporting. (This is a preparation for using constant
888 value for ->[1].)
889
890 2008-03-22 Wakaba <wakaba@suika.fam.cx>
891
892 * RDFXML.pm: Typo fixed.
893
894 2008-03-22 Wakaba <wakaba@suika.fam.cx>
895
896 * ContentChecker.pm: |fact_level| is now treated
897 as same as |must_level|, i.e. level = |m|.
898 (check_element): Make list of URIs in the DOM.
899
900 2008-03-21 Wakaba <wakaba@suika.fam.cx>
901
902 * RDFXML.pm: Language accessor implemented.
903 Local (null-namespace) attribute support.
904
905 2008-03-21 Wakaba <wakaba@suika.fam.cx>
906
907 * RDFXML.pm: Factored out ID checking code.
908
909 2008-03-21 Wakaba <wakaba@suika.fam.cx>
910
911 * RDFXML.pm: TODO items noted. Validation of ID
912 and URI attributes is implemented. Warn if unknown
913 value is used in rdf:parseType="" attribute.
914
915 * URIChecker.pm (check_rdf_uri_reference): New function.
916
917 2008-03-21 Wakaba <wakaba@suika.fam.cx>
918
919 * RDFXML.pm: bnodeid implemented. Relative references
920 are now resolved.
921
922 2008-03-21 Wakaba <wakaba@suika.fam.cx>
923
924 * ContentChecker.pm: RDF reification implemented.
925
926 * RDFXML.pm: undef vs false bug fixed.
927 Reification implemented.
928
929 2008-03-21 Wakaba <wakaba@suika.fam.cx>
930
931 * RDFXML.pm: s/id/ID/ for attribute name.
932 The |node| arguments are added for |ontriple| calls.
933 Too many "attribute not allowed" errors were raised.
934
935 * ContentChecker.pm: Initial experimental support for rdf:RDF
936 element.
937
938 2008-03-21 Wakaba <wakaba@suika.fam.cx>
939
940 * RDFXML.pm: New module.
941
942 2008-03-20 Wakaba <wakaba@suika.fam.cx>
943
944 * HTML.pm.src (set_inner_html): Line/column number
945 code was old one yet.
946
947 2008-03-20 Wakaba <wakaba@suika.fam.cx>
948
949 * HTML.pm.src: Better line/column reporting for "duplicate attribute"
950 errors. Line/column markings for DOCTYPE, comment, and
951 character tokens are reintroduced; otherwise,
952 error location for "not HTML5" error and errors
953 for implied elements are not attached.
954
955 2008-03-20 Wakaba <wakaba@suika.fam.cx>
956
957 * HTML.pm.src: Set line/column numbers to attributes.
958
959 * NanoDOM.pm (create_attribute_ns, set_attribute_node_ns): Added.
960 (value): Setter implemented.
961
962 * mkhtmlparser.pl: Set line/column numbers to Attr nodes.
963
964 2008-03-20 Wakaba <wakaba@suika.fam.cx>
965
966 * HTML.pm.src: Unused line/column markings are removed.
967
968 2008-03-20 Wakaba <wakaba@suika.fam.cx>
969
970 * HTML.pm.src (_get_next_token): Remove |first_start_tag|
971 flag, which is no longer used.
972
973 2008-03-17 Wakaba <wakaba@suika.fam.cx>
974
975 * HTML.pm.src: Set line/column information to element nodes.
976
977 * mkhtmlparser.pl (!!!create-element, !!!insert-element,
978 and !!!insert-element-t): Set line/column information to
979 element nodes.
980
981 2008-03-17 Wakaba <wakaba@suika.fam.cx>
982
983 * HTML.pm.src (_get_next_token): The first "<" character
984 in "<?", "<>", or "</>" should be the error point.
985
986 2008-03-16 Wakaba <wakaba@suika.fam.cx>
987
988 * HTML.pm.src: Some more fixes on error position reporting.
989
990 2008-03-16 Wakaba <wakaba@suika.fam.cx>
991
992 * HTML.pm.src: Token-level precious error reporting.
993
994 2008-03-16 Wakaba <wakaba@suika.fam.cx>
995
996 * HTML.pm.src: Preparation for more precious error point
997 reporting.
998
999 2008-03-11 Wakaba <wakaba@suika.fam.cx>
1000
1001 * HTML.pm.src: Error type revised.
1002
1003 2008-03-11 Wakaba <wakaba@suika.fam.cx>
1004
1005 * HTML.pm.src: Similar codes are merged together, again.
1006
1007 2008-03-11 Wakaba <wakaba@suika.fam.cx>
1008
1009 * HTML.pm.src: Similar codes are merged together.
1010
1011 2008-03-10 Wakaba <wakaba@suika.fam.cx>
1012
1013 * mkhtmlparser.pl: Set "level" parameter to parse errors.
1014
1015 * HTML.pm.src: Code refined.
1016
1017 2008-03-09 Wakaba <wakaba@suika.fam.cx>
1018
1019 * HTML.pm.src: |</body>| treatement has been changed (HTML5
1020 revision 1348). Note that I really don't know this makes
1021 any difference in the black-box behavior of the parser.
1022
1023 2008-03-09 Wakaba <wakaba@suika.fam.cx>
1024
1025 * HTML.pm.src: New end-of-file token implementation (HTML5
1026 revision 1348).
1027
1028 2008-03-09 Wakaba <wakaba@suika.fam.cx>
1029
1030 * HTML.pm.src: |applet| support (HTML5 revision 1347).
1031
1032 2008-03-09 Wakaba <wakaba@suika.fam.cx>
1033
1034 * HTML.pm.src: Foster parenting in AAA (HTML5 revision 1343).
1035
1036 2008-03-09 Wakaba <wakaba@suika.fam.cx>
1037
1038 * HTML.pm.src: Support for |<input>| in the "in select" insertion
1039 mode and support for the "in select in table" insertion mode (HTML5
1040 revision 1342).
1041
1042 2008-03-09 Wakaba <wakaba@suika.fam.cx>
1043
1044 * HTML.pm.src: No foster parenting for <script> and <script>
1045 in non-tainted <table>s (HTML5 revision 1336).
1046
1047 2008-03-09 Wakaba <wakaba@suika.fam.cx>
1048
1049 * HTML.pm.src: Ignore white space characters between <html>
1050 and <head> (HTML5 revision 1332).
1051
1052 2008-03-09 Wakaba <wakaba@suika.fam.cx>
1053
1054 * HTML.pm.src: Treat <input type=hidden> as if it were a
1055 white space (HTML5 revision 1331).
1056
1057 2008-03-08 Wakaba <wakaba@suika.fam.cx>
1058
1059 * HTML.pm.src: Ignore U+000A at the beginning of a |listing|
1060 element (HTML5 revision 1330).
1061
1062 2008-03-08 Wakaba <wakaba@suika.fam.cx>
1063
1064 * HTML.pm.src: <title> is always appended to the current
1065 element (HTML5 revision 1328).
1066
1067 2008-03-08 Wakaba <wakaba@suika.fam.cx>
1068
1069 * HTML.pm.src: White space in tainted tables are moved
1070 into foster parents (HTML5 revision 1326).
1071
1072 2008-03-08 Wakaba <wakaba@suika.fam.cx>
1073
1074 * HTML.pm.src: Reduce errors from foster parenting cases (HTML5
1075 revision 1321).
1076
1077 2008-03-08 Wakaba <wakaba@suika.fam.cx>
1078
1079 * HTML.pm.src: |</h/n/>| case code rearranged to align with
1080 the spec (HTML5 revision 1320). Note that we finally complete
1081 all of HTML5 revision 1320 changes.
1082
1083 2008-03-08 Wakaba <wakaba@suika.fam.cx>
1084
1085 * HTML.pm.src: |</form>| now works similar to |</div>| for unclosed
1086 tags (HTML5 revision 1320).
1087
1088 2008-03-08 Wakaba <wakaba@suika.fam.cx>
1089
1090 * HTML.pm.src: |</p>| case rearranged with no actual change in fact.
1091
1092 2008-03-08 Wakaba <wakaba@suika.fam.cx>
1093
1094 * HTML.pm.src: A "generate implied end tags" code (t409.1)
1095 could not be reached so that it is now removed (HTML5 revision
1096 1320).
1097
1098 2008-03-08 Wakaba <wakaba@suika.fam.cx>
1099
1100 * HTML.pm.src: Code for the case of |</div>| and so on are revised
1101 to align with new spec text (HTML5 revision 1320).
1102
1103 2008-03-08 Wakaba <wakaba@suika.fam.cx>
1104
1105 * HTML.pm.src: Remove strange |if| condition; however, it should
1106 have had no harm in theory.
1107
1108 2008-03-08 Wakaba <wakaba@suika.fam.cx>
1109
1110 * HTML.pm.src (_tree_construction_main): '</p>' in body
1111 case is split from other end tags for the preparation
1112 of implementing HTML revision 1320.
1113
1114 2008-03-07 Wakaba <wakaba@suika.fam.cx>
1115
1116 * HTML.pm.src: Simplified "generate implied end tag" (HTML5
1117 revision 1320).
1118
1119 2008-03-07 Wakaba <wakaba@suika.fam.cx>
1120
1121 * HTML.pm.src (_tree_construction_main): Merge rules for "h1"
1122 and "div" (HTML5 revision 1318). Add comments to where
1123 |form| pointer association codes should be inserted (HTML5 revision
1124 1319).
1125
1126 2008-03-06 Wakaba <wakaba@suika.fam.cx>
1127
1128 * HTML.pm.src: <html> treatement refined (HTML5 revision 1314).
1129
1130 2008-03-05 Wakaba <wakaba@suika.fam.cx>
1131
1132 * HTML.pm.src: Since the case t268 should never be reached (no
1133 other token type, there are), it is replaced by a |die| statement.
1134
1135 2008-03-05 Wakaba <wakaba@suika.fam.cx>
1136
1137 * HTML.pm.src: Typo fixed.
1138
1139 2008-03-04 Wakaba <wakaba@suika.fam.cx>
1140
1141 * HTML.pm.src (_tree_construction_initial): Some limited quirks
1142 doctypes were not uppercased for comparison.
1143
1144 2008-03-03 Wakaba <wakaba@suika.fam.cx>
1145
1146 * HTML.pm.src (tree construction and set_inner_html): Checkpoints
1147 are added.
1148
1149 2008-03-03 Wakaba <wakaba@suika.fam.cx>
1150
1151 * HTML.pm.src (_tokenize_attempt_to_consume_an_entity): Checkpoints
1152 are set. Cases that are unlikely reached are noted as so.
1153
1154 2008-03-03 Wakaba <wakaba@suika.fam.cx>
1155
1156 * HTML.pm.src: Checkpoints for debugging are added.
1157
1158 * mkhtmlparser.pl: Support for |!!!cp| syntax.
1159
1160 2008-03-03 Wakaba <wakaba@suika.fam.cx>
1161
1162 * HTML.pm.src, mkhtmlparser.pl: s/_input_character/_char/g
1163 for simplicity.
1164
1165 2008-03-03 Wakaba <wakaba@suika.fam.cx>
1166
1167 * HTML.pm.src: Flag name changed: s/correct/force-quirks/g (HTML5
1168 revision 1307).
1169
1170 2008-03-03 Wakaba <wakaba@suika.fam.cx>
1171
1172 * HTML.pm.src (_get_next_token): Where the /incorrect/ flag
1173 is set are changed (HTML5 revision 1305).
1174
1175 2008-03-02 Wakaba <wakaba@suika.fam.cx>
1176
1177 * HTML.pm.src: Raise a parse error for |<span ===>| (HTML5 revision
1178 1292). Entities are not parsed in comment-like part in RCDATA
1179 elements (HTML5 revision 1294). Allow bare & at the end
1180 of attribute value literals (HTML5 revision 1296). More
1181 quirks mode doctypes (HTML5 revision 1302). Requires spaces
1182 between attributes and ban attribute names or unquoted
1183 attribute values containing single or double quotes (HTML5
1184 revision 1303).
1185
1186 2008-03-02 Wakaba <wakaba@suika.fam.cx>
1187
1188 * ContentChecker.pm: Typo fixed. Don't raise "character encoding"
1189 and related errors unless it is an HTML document (though the spec
1190 is unclear on whether it is applied to XHTML document).
1191
1192 * HTML.pm (%HTMLAttrStatus): WF2 repetition model attributes
1193 are added.
1194
1195 2008-03-02 Wakaba <wakaba@suika.fam.cx>
1196
1197 * HTML.pm.src: s/local_name/manakai_local_name/g.
1198
1199 2008-03-01 Wakaba <wakaba@suika.fam.cx>
1200
1201 * _NamedEntityList.pm: Updated (HTML5 revision 1286).
1202
1203 * HTML.pm.src: |charset| in |content| attribute is
1204 case-insensitive (HTML5 revision 1270).
1205
1206 2008-02-26 Wakaba <wakaba@suika.fam.cx>
1207
1208 * ContentChecker.pm: New status constants are added.
1209 ($ElementDefault): |status| added.
1210 (check_element): Err for non-standard or deprecated elements.
1211 (_attr_status_info): For non-standard or deprecated attributes.
1212
1213 2008-02-24 Wakaba <wakaba@suika.fam.cx>
1214
1215 * ContentChecker.pm (_attr_status_info): New internal method.
1216
1217 2008-02-24 Wakaba <wakaba@suika.fam.cx>
1218
1219 * ContentChecker.pm (check_element): Element standardized
1220 status information is now dispatched.
1221
1222 2008-02-24 Wakaba <wakaba@suika.fam.cx>
1223
1224 * ContentChecker.pm (check_element): Fix |del|-and-significant
1225 problem by adding some more arguments.
1226
1227 2008-02-24 Wakaba <wakaba@suika.fam.cx>
1228
1229 * ContentChecker.pm (check_element): Use context of
1230 container-for-the-purpose-of-content-model element (not transparent
1231 element) for |check_child_element| calling and significant
1232 text flag marking. This reintroduces |<del>|-and-significant
1233 problem again.
1234
1235 2008-02-24 Wakaba <wakaba@suika.fam.cx>
1236
1237 * ContentChecker.pm (check_element): Make semi-transparent
1238 elements ignored for the purpose of phase changes in
1239 content model checking.
1240
1241 2008-02-23 Wakaba <wakaba@suika.fam.cx>
1242
1243 * ContentChecker.pm (check_element): In-element state
1244 was not properly managed for transparent cases.
1245
1246 2008-02-23 Wakaba <wakaba@suika.fam.cx>
1247
1248 * ContentChecker.pm (check_element): Support for |video|
1249 and |audio| as semi-transparent elements.
1250
1251 2008-02-23 Wakaba <wakaba@suika.fam.cx>
1252
1253 * ContentChecker.pm ($HTMLSemiTransparentElements): New.
1254 (check_element): s/minuses/minus_elements/, s/pluses/plus_elements/.
1255 Support for |html:object| as a semi-transparent element.
1256
1257 2008-02-23 Wakaba <wakaba@suika.fam.cx>
1258
1259 * ContentChecker.pm (check_element): The way to traverse
1260 the tree is entirely revised to make it easier to track
1261 the state of ancestors/descendants. As a result of this
1262 revision (which rewrites almost all of Whatpm::ContentChecker::HTML),
1263 support for content model checking for HTML elements |figure|,
1264 |object|, |video|, and |audio| and checking for XML elements (and
1265 some XMLNS checkings) are dropped for now. They will be
1266 reimplemented in due cource.
1267
1268 2008-02-17 Wakaba <wakaba@suika.fam.cx>
1269
1270 * HTML.pm.src: |>| in public or system literal closes the DOCTYPE
1271 token (HTML5 revision 1225).
1272
1273 2008-02-17 Wakaba <wakaba@suika.fam.cx>
1274
1275 * ContentChecker.pm ({unsupported_level}): New value.
1276
1277 * HTML.pm.src: Save whether |meta| |content| attribute
1278 contains character references or not.
1279
1280 2008-02-17 Wakaba <wakaba@suika.fam.cx>
1281
1282 * ContentChecker.pm (_get_children): (Incompleted) attempt to
1283 imlement significant content checking for contents
1284 with |del| elements.
1285
1286 2008-02-17 Wakaba <wakaba@suika.fam.cx>
1287
1288 * ContenteChecker.pm ($HTMLTransparentElements): More
1289 elements are added.
1290 (_get_children): HTML |object| elements are now semi-transparent.
1291
1292 * NanoDOM.pm (manakai_html, manakai_head): New methods.
1293
1294 2008-02-16 Wakaba <wakaba@suika.fam.cx>
1295
1296 * CacheManifest.pm: HTML5 revision 1211 implemented.
1297
1298 * CacheManifest.pod: Updated.
1299
1300 2008-02-10 Wakaba <wakaba@suika.fam.cx>
1301
1302 * ContentChecker.pm (check_document, check_element): Support
1303 for second argument ($onsubdoc).
1304 (_get_css_parser): Removed (now it is part of WDCC).
1305
1306 2008-02-09 Wakaba <wakaba@suika.fam.cx>
1307
1308 * ContentChecker.pm (_get_css_parser): New.
1309
1310 2007-11-25 Wakaba <wakaba@suika.fam.cx>
1311
1312 * ContentChecker.pm ($AnyChecker): Old way to add child elements
1313 for checking had been used.
1314
1315 2007-11-25 Wakaba <wakaba@suika.fam.cx>
1316
1317 * ContentChecker.pm (check_element): New todo item type |descendant|.
1318
1319 2007-11-23 Wakaba <wakaba@suika.fam.cx>
1320
1321 * IMTChecker.pm: Revised to raise errors and warnings as (poorly)
1322 specced in RFC 2046 and RFC 4288.
1323 (application/atom+xml): Definition added.
1324
1325 2007-11-23 Wakaba <wakaba@suika.fam.cx>
1326
1327 * URIChecker.pm: Make RFC 3986 should-level errors
1328 warnings (rather than SHOULD-level errors).
1329
1330 2007-11-23 Wakaba <wakaba@suika.fam.cx>
1331
1332 * NanoDOM.pm (get_user_data, set_user_data): New methods.
1333
1334 * HTML.pm.src: A flag for character references in attribute
1335 values are added. Set |manakai_has_reference| user data
1336 to |charset| attribute.
1337
1338 2007-11-23 Wakaba <wakaba@suika.fam.cx>
1339
1340 * NanoDOM.pm (input_encoding, manakai_charset, manakai_has_bom): New
1341 attributes.
1342
1343 * ContentChecker.pm (check_document): Warn if charset requirements
1344 cannot be tested.
1345
1346 2007-11-19 Wakaba <wakaba@suika.fam.cx>
1347
1348 * HTML.pm.src (parse_byte_string): Detect charset
1349 by universalchardet if charset parameter is not specified.
1350
1351 * Makefile (Charset-all, Charset-clean): New rules.
1352
1353 2007-11-18 Wakaba <wakaba@suika.fam.cx>
1354
1355 * ContentChecker.pm (check_document): Check the existence
1356 of character encoding declaration and selection of encoding
1357 for HTML document.
1358
1359 2007-11-18 Wakaba <wakaba@suika.fam.cx>
1360
1361 * ContentType.pm (get_sniffed_type): Return also the official
1362 type in list context.
1363
1364 2007-11-18 Wakaba <wakaba@suika.fam.cx>
1365
1366 * ContentType.pm: Sniffing with leading white space
1367 ignoring (HTML5 revisions 1013 and 1016).
1368
1369 2007-11-18 Wakaba <wakaba@suika.fam.cx>
1370
1371 * ContentType.pm: HTML5 revision 1013 changes, except for leading
1372 white spaces, are implemented.
1373
1374 2007-11-11 Wakaba <wakaba@suika.fam.cx>
1375
1376 * HTML.pm.src (parse_char_string): Set |inner_encoding|
1377 attribute if possible.
1378
1379 2007-11-11 Wakaba <wakaba@suika.fam.cx>
1380
1381 * HTML.pm.src (parse_byte_string): New method.
1382 (parse_char_string): New alias for |parse_string|.
1383 (main phase): Invoking "change the encoding" algorithm if desired.
1384
1385 * HTML.pod: Updated.
1386
1387 2007-11-11 Wakaba <wakaba@suika.fam.cx>
1388
1389 * HTML.pod (get_inner_html): Removed.
1390
1391 * Makefile (HTML-all, HTML-clean): New.
1392
1393 2007-11-11 Wakaba <wakaba@suika.fam.cx>
1394
1395 * HTML.pm.src (get_inner_html): Removed (moved to HTML/Serializer.pm).
1396
1397 2007-11-08 Wakaba <wakaba@suika.fam.cx>
1398
1399 * mklinktypelist.pl: s/noreferer/noreferrer/ (HTML5 revision 1132).
1400
1401 2007-11-04 Wakaba <wakaba@suika.fam.cx>
1402
1403 * Makefile: |CacheManifest.html| is added.
1404
1405 * CacheManifest.pod: New file.
1406
1407 2007-11-04 Wakaba <wakaba@suika.fam.cx>
1408
1409 * CacheManifest.pm: New module.
1410
1411 2007-11-04 Wakaba <wakaba@suika.fam.cx>
1412
1413 * HTML.pm.src: Support for application cache selection algorithm
1414 callback.
1415
1416 2007-11-04 Wakaba <wakaba@suika.fam.cx>
1417
1418 * mklinktypelist.pl: Support for rel=noreferer (HTML5 revision 1118).
1419
1420 2007-10-17 Wakaba <wakaba@suika.fam.cx>
1421
1422 * Makefile (clean): New rule.
1423
1424 * NanoDOM.pm (public_id, system_id): New attributes.
1425
1426 2007-10-17 Wakaba <wakaba@suika.fam.cx>
1427
1428 * Makefile (CSS-all, CSS-clean, clean): New rules.
1429
1430 2007-10-14 Wakaba <wakaba@suika.fam.cx>
1431
1432 * ContentChecker.pm (check_document): Support for
1433 new |is_xml_root| flag.
1434 (check_element): Support for new |pluses| state.
1435 (_add_pluses): New method.
1436 (_remove_minuses): Support for new |minus| item.
1437
1438 2007-09-24 Wakaba <wakaba@suika.fam.cx>
1439
1440 * ContentChecker.pm: Raise specific error for invalid
1441 root element.
1442
1443 2007-09-24 Wakaba <wakaba@suika.fam.cx>
1444
1445 * ContentChecker.pm: Set level values for later uses.
1446
1447 2007-09-09 Wakaba <wakaba@suika.fam.cx>
1448
1449 * ContentChecker.pm: Support for language tag validation.
1450
1451 2007-09-09 Wakaba <wakaba@suika.fam.cx>
1452
1453 * LangTag.pm (check_rfc3066_language_tag): New method.
1454
1455 2007-09-09 Wakaba <wakaba@suika.fam.cx>
1456
1457 * LangTag.pm: New module.
1458
1459 2007-09-04 Wakaba <wakaba@suika.fam.cx>
1460
1461 * HTML.pm.src: Some error types were wrong.
1462
1463 2007-08-17 Wakaba <wakaba@suika.fam.cx>
1464
1465 * CSS/: New directory.
1466
1467 2007-08-17 Wakaba <wakaba@suika.fam.cx>
1468
1469 * ContentChecker.pm (_check_get_children): Support
1470 for |noscript| in |head|.
1471
1472 2007-08-12 Wakaba <wakaba@suika.fam.cx>
1473
1474 * URI/: New directory.
1475
1476 2007-08-11 Wakaba <wakaba@suika.fam.cx>
1477
1478 * HTML.pm.src: Tokenizer's states are now represented in
1479 number.
1480
1481 2007-08-11 Wakaba <wakaba@suika.fam.cx>
1482
1483 * HTML.pm.src: |or|s for insertion modes are replaced
1484 by |&|s.
1485
1486 2007-08-11 Wakaba <wakaba@suika.fam.cx>
1487
1488 * HTML.pm.src: Token types are now represented in number.
1489
1490 2007-08-11 Wakaba <wakaba@suika.fam.cx>
1491
1492 * ContentType.pm (SEE ALSO): Updated.
1493
1494 * HTML.pm.src: Insertion modes are now represented in number.
1495
1496 2007-08-11 Wakaba <wakaba@suika.fam.cx>
1497
1498 * ContentType.pm: Sniffing for bitmap images (HTML5 revision
1499 999) is implemented.
1500
1501 2007-08-08 Wakaba <wakaba@suika.fam.cx>
1502
1503 * ContentType.pm: Sniffing for |<script| (HTML5 revision
1504 983) is implemented.
1505
1506 2007-08-06 Wakaba <wakaba@suika.fam.cx>
1507
1508 * ContentChecker.pod: New documentation.
1509
1510 * Makefile: A rule for |ContentChecker.html| is added.
1511
1512 * ContentChecker.pm: A pod "LICENSE" section is added.
1513
1514 * NanoDOM.pm ($VERSION): New variable.
1515
1516 2007-08-05 Wakaba <wakaba@suika.fam.cx>
1517
1518 * H2H.pm: |b|, |i|, and |sub| are added to the
1519 list of allowed HTML elements.
1520
1521 2007-08-05 Wakaba <wakaba@suika.fam.cx>
1522
1523 * H2H.pm: |samp| is added to the list of allowed
1524 HTML elements.
1525
1526 * URIChecker.pm (check_iri): New.
1527 (check_iri_reference): Error type for IRI reference
1528 syntax error is changed.
1529
1530 2007-08-04 Wakaba <wakaba@suika.fam.cx>
1531
1532 * ContentChecker.pm: Reference to the |Whatpm::ContentChecker::Atom|
1533 is added.
1534 (check_document): Load appropriate module before validation.
1535
1536 2007-08-04 Wakaba <wakaba@suika.fam.cx>
1537
1538 * ContentChecker/: New directory.
1539
1540 2007-08-04 Wakaba <wakaba@suika.fam.cx>
1541
1542 * ContentChecker.pm: HTML |time| element is implemented.
1543
1544 * HTMLTable.pm: Comments are updated as HTML5 is revised.
1545
1546 2007-07-21 Wakaba <wakaba@suika.fam.cx>
1547
1548 * ContentChecker.pm (check_document): Return value
1549 even if no document element is found.
1550
1551 2007-07-21 Wakaba <wakaba@suika.fam.cx>
1552
1553 * HTML.pm.src: |$in_body| is no longer a function.
1554
1555 2007-07-21 Wakaba <wakaba@suika.fam.cx>
1556
1557 * HTML.pm.src: The |$in_body| code has been moved down.
1558
1559 2007-07-21 Wakaba <wakaba@suika.fam.cx>
1560
1561 * HTML.pm.src: The "trailing end" insertion mode
1562 is split into "after html body" and "after html frameset"
1563 insertion modes. Their codes are merged with "after body"
1564 and "after frameset" codes. |$previous_insertion_mode|
1565 has been removed. "after frameset" code is
1566 merged with "in frameset" code.
1567
1568 2007-07-21 Wakaba <wakaba@suika.fam.cx>
1569
1570 * HTML.pm.src: The "before head" insertion mode is
1571 merged with the "in head" insertion mode.
1572
1573 2007-07-21 Wakaba <wakaba@suika.fam.cx>
1574
1575 * HTML.pm.src: Cases in "in head" insertion mode are
1576 reorganized.
1577
1578 2007-07-21 Wakaba <wakaba@suika.fam.cx>
1579
1580 * HTML.pm.src: Some cases in "in table" insertion mode
1581 are merged.
1582
1583 2007-07-21 Wakaba <wakaba@suika.fam.cx>
1584
1585 * HTML.pm.src: The "in row" insertion mode is merged
1586 with "in table" insertion mode.
1587
1588 2007-07-21 Wakaba <wakaba@suika.fam.cx>
1589
1590 * HTML.pm.src: The "in table" and "in table body" insertion
1591 modes are merged.
1592
1593 2007-07-21 Wakaba <wakaba@suika.fam.cx>
1594
1595 * HTML.pm.src: There is no "in table head" or "in table foot"
1596 insertion mode!
1597
1598 2007-07-21 Wakaba <wakaba@suika.fam.cx>
1599
1600 * HTML.pm.src: |<noframes>| "in frameset" and "in noframes"
1601 now directly invoke the handler.
1602
1603 2007-07-21 Wakaba <wakaba@suika.fam.cx>
1604
1605 * HTML.pm.src: Codes for "in cell" insertion mode
1606 is merged to the "in body" insertion mode code.
1607
1608 2007-07-21 Wakaba <wakaba@suika.fam.cx>
1609
1610 * HTML.pm.src: Codes for "in body" and "in caption"
1611 insertion modes are merged.
1612
1613 2007-07-21 Wakaba <wakaba@suika.fam.cx>
1614
1615 * HTML.pm.src: Two |!!!next-token|s were missing.
1616
1617 2007-07-21 Wakaba <wakaba@suika.fam.cx>
1618
1619 * HTML.pm.src: Use numeric constant for |{content_mode}|
1620 instead of string constant for |{content_model_flag}|.
1621
1622 2007-07-21 Wakaba <wakaba@suika.fam.cx>
1623
1624 * HTML.pm.src: Add the name of the attribute
1625 to the "duplicate attribute" error.
1626
1627 2007-07-17 Wakaba <wakaba@suika.fam.cx>
1628
1629 * ContentChecker.pm: Return the |class| node list.
1630
1631 2007-07-17 Wakaba <wakaba@suika.fam.cx>
1632
1633 * ContentChecker.pm: Return the |id| node list.
1634
1635 * HTML.pm.src: A typo is fixed.
1636
1637 2007-07-16 Wakaba <wakaba@suika.fam.cx>
1638
1639 * ContentChecker.pm: Drop wrong |level => 'error'| specification
1640 from "in HTML:xml:lang" error. Character position
1641 is now the last part of the error type in the URI error
1642 description. Report "unsupported" status for language
1643 tags, media queries, script codes, and style sheets.
1644
1645 2007-07-16 Wakaba <wakaba@suika.fam.cx>
1646
1647 * ContentChecker.pm: Report error if |xml:lang|
1648 in HTML, |lang| in XML, |xmlns| in XML, and |meta| |charset|
1649 in XML.
1650
1651 * NanoDOM.pm (Attr.owner_document): New attribute.
1652
1653 2007-07-16 Wakaba <wakaba@suika.fam.cx>
1654
1655 * HTML.pm.src: The character immediately following
1656 a bare |hcro| was discarded. Fix handling of
1657 entity references in attribute values.
1658
1659 2007-07-16 Wakaba <wakaba@suika.fam.cx>
1660
1661 * HTML.pm.src (main and trailin end phases): Token
1662 types |DOCTYPE|, |comment|, |end-of-file|, and |<html>| are
1663 factored out. Error types |in html:#DOCTYPE|
1664 and |after html:#DOCTYPE| are merged
1665 into |DOCTYPE in the middle|. |</frameset>|
1666 in fragment parsing mode changed the
1667 insertion mode.
1668
1669 2007-07-16 Wakaba <wakaba@suika.fam.cx>
1670
1671 * HTML.pm.src: |$phase| has been removed; The |trailing end|
1672 phase is now an insertion mode. Treatments for white
1673 space character tokens were incorrect for some
1674 insertion modes. An old |meta| case was not removed.
1675
1676 2007-07-16 Wakaba <wakaba@suika.fam.cx>
1677
1678 * HTML.pm.src: |meta| charset declaration extraction
1679 implemented (but changing the encoding is not yet:-).
1680
1681 2007-07-15 Wakaba <wakaba@suika.fam.cx>
1682
1683 * Charset/: New directory.
1684
1685 2007-07-15 Wakaba <wakaba@suika.fam.cx>
1686
1687 * H2H.pm: New Perl module (created from
1688 manakai's H2H.dis).
1689
1690 2007-07-15 Wakaba <wakaba@suika.fam.cx>
1691
1692 * XMLSerializer.pm: New Perl module (created from
1693 manakai's SimpleLS.dis).
1694
1695 2007-07-07 Wakaba <wakaba@suika.fam.cx>
1696
1697 * HTML.pm.src: |<!---x-->| was not processed correctly.
1698
1699 2007-07-01 Wakaba <wakaba@suika.fam.cx>
1700
1701 * HTML.pm.src: Report correct error message
1702 for |<body></div></body>|.
1703
1704 2007-07-01 Wakaba <wakaba@suika.fam.cx>
1705
1706 * HTMLTable.pm: An error description was incorrect.
1707
1708 2007-06-30 Wakaba <wakaba@suika.fam.cx>
1709
1710 * ContentChecker.pm: Return |{term}| list.
1711
1712 2007-06-30 Wakaba <wakaba@suika.fam.cx>
1713
1714 * HTML.pm.src: HTML5 revisions 961-966 (</p>, </br>,
1715 nested <nobr>, implied </tbody>, </tfoot>, and </thead>,
1716 and <title> outside of head).
1717
1718 2007-06-30 Wakaba <wakaba@suika.fam.cx>
1719
1720 * IMTChecker.pm: Report warning for unregistered
1721 and private types/subtypes.
1722
1723 * ContentChecker.pm, HTML.pm.src, IMTChecker.pm,
1724 URIChecker.pm, HTMLTable.pm: Error messages are now
1725 consistent; they are all listed in
1726 <http://suika.fam.cx/gate/2005/sw/Whatpm%20Error%20Types>.
1727
1728 2007-06-25 Wakaba <wakaba@suika.fam.cx>
1729
1730 * ContentChecker.pm: |<img ismap>| not in |<a></a>|
1731 is now erred. |<datalist>| is implemented.
1732 Attribute checker for |<command>| and |<menu>| are
1733 added. Support for |contextmenu| global attribute
1734 is added.
1735
1736 2007-06-25 Wakaba <wakaba@suika.fam.cx>
1737
1738 * HTML.pm.src (_reset_insertion_mode): Interpretation
1739 of Step 3 has been changed.
1740
1741 2007-06-25 Wakaba <wakaba@suika.fam.cx>
1742
1743 * HTML.pm.src: Late |<html>| parse error is implemented.
1744
1745 2007-06-24 Wakaba <wakaba@suika.fam.cx>
1746
1747 * URIChecker.pm (check_iri_reference): A |decode| method name was
1748 incorrect.
1749
1750 * ContentChecker.pm: Support for the |footer| element.
1751 Check URI syntax for space-separated URI attributes.
1752 Support for the |tabindex| attribute. Support
1753 for |datetime| attribute.
1754
1755 2007-06-24 Wakaba <wakaba@suika.fam.cx>
1756
1757 * HTML.pm.src: HTML5 revision 1.144 (&#x0D;) and 1.145 (invalid
1758 character references). HTML5 revision 1.146 (white space
1759 characters before root start tag). HTML5 revision
1760 1.148 (named character references in attribute values).
1761 HTML5 revision 1.152 (<plaintext>.innerHTML get).
1762
1763 2007-06-24 Wakaba <wakaba@suika.fam.cx>
1764
1765 * HTML.pm.src: HTML5 revisions 1.142 and 1.143 (<noscript>
1766 in <head>).
1767
1768 2007-06-24 Wakaba <wakaba@suika.fam.cx>
1769
1770 * HTML.pm.src: HTML5 revision 935 (<base>, <link>, <meta>
1771 in body).
1772
1773 * ContentChecker.pm: HTML5 revision 938 (scoped="").
1774
1775 2007-06-24 Wakaba <wakaba@suika.fam.cx>
1776
1777 * HTML.pm.src: HTML5 revision 923 (matching end tag in CDATA
1778 or RCDATA in fragment parsing mode). HTML5 revision
1779 924 (<!--> and <!--->). HTML5 revision 926 (hn in hn).
1780
1781 2007-06-23 Wakaba <wakaba@suika.fam.cx>
1782
1783 * HTML.pm.src (get_inner_html): HTML5 revision 922 (inner_html
1784 for <pre> and <textarea>).
1785
1786 2007-06-23 Wakaba <wakaba@suika.fam.cx>
1787
1788 * HTML.pm.src: HTML5 revision 920 (<isindex>).
1789
1790 2007-06-23 Wakaba <wakaba@suika.fam.cx>
1791
1792 * HTML.pm.src: HTML5 revision 918 (</head>, </body>, </html>).
1793
1794 2007-06-23 Wakaba <wakaba@suika.fam.cx>
1795
1796 * HTML.pm.src: HTML5 revision 916 (</body>).
1797 HTML5 revision 917 (conforming bare &).
1798
1799 2007-06-23 Wakaba <wakaba@suika.fam.cx>
1800
1801 * NanoDOM.pm (manakai_is_html): Setting to false did not work.
1802
1803 * HTML.pm.src: HTML5 revision 914 (</ in CDATA, RCDATA).
1804 HTML5 revision 915 (<nobr>).
1805
1806 2007-06-23 Wakaba <wakaba@suika.fam.cx>
1807
1808 * HTML.pm.src: HTML5 revisions 908, 909, 912, and 913 (quirks mode).
1809
1810 * NanoDOM.pm (manakai_is_html, manakai_compat_mode, compat_mode):
1811 New attributes.
1812
1813 2007-06-23 Wakaba <wakaba@suika.fam.cx>
1814
1815 * HTML.pm.src: HTML5 revisions 900, 901, 902, and 911 (<
1816 in tags).
1817
1818 2007-06-23 Wakaba <wakaba@suika.fam.cx>
1819
1820 * .cvsignore: |Entities.html| is added.
1821
1822 * HTML.pm.src: |$entity_char| is removed and
1823 requires |Whatpm::_NamedEntityList| instead.
1824 HTML5 revision 898 (refc), except that lack of refc
1825 is parse error.
1826
1827 * mkentitylist.pl: New script.
1828
1829 * Makefile (all): |_NamedEntityList.pm| is added.
1830 (_NamedEntityList.pm, Entities.html): New rules.
1831
1832 2007-06-23 Wakaba <wakaba@suika.fam.cx>
1833
1834 * HTML.pm.src: Parse errors immediately after U+000D
1835 were ignored and U+000D immediately following another
1836 U+000D was not converted to U+000A.
1837
1838 2007-06-23 Wakaba <wakaba@suika.fam.cx>
1839
1840 * HTML.pm.src (set_inner_html): HTML5 revision 892 (adopt
1841 nodes before appended). Parser was not ready for NULL
1842 parse error and escape flag.
1843
1844 * NanoDOM.pm (adopt_node): New.
1845
1846 2007-06-23 Wakaba <wakaba@suika.fam.cx>
1847
1848 * HTML.pm.src: HTML5 revision 886 (insane comment in
1849 CDATA and RCDATA). Note that current implementation
1850 is simply repeating what the spec says and it is maybe not
1851 a best way to do it.
1852
1853 2007-06-23 Wakaba <wakaba@suika.fam.cx>
1854
1855 * HTML.pm.src: HTML5 revision 884 (</form> don't close
1856 the form element if a descendant element without implied end tag has
1857 still been open).
1858
1859 2007-06-23 Wakaba <wakaba@suika.fam.cx>
1860
1861 * ContentChecker.pm: HTML5 revision 881 (Make |id| attribute with
1862 space characters non-conforming).
1863
1864 2007-06-23 Wakaba <wakaba@suika.fam.cx>
1865
1866 * HTML.pm.src: An error message was incorrect.
1867 HTML5 revision 869 (C1 character references).
1868
1869 2007-06-23 Wakaba <wakaba@suika.fam.cx>
1870
1871 * HTML.pm.src: HTML5 revision 867 (a LF at the beginning of
1872 a |textarea| is removed).
1873
1874 2007-06-05 Wakaba <wakaba@suika.fam.cx>
1875
1876 * NanoDOM.pm (get_attribute_node_ns): New method.
1877
1878 * ContentChecker.pm: |script| |async| and |defer|
1879 no longer require |src|. |async| MUST NOT be
1880 specified if |defer|. (HTML5 revision 858).
1881
1882 2007-05-30 Wakaba <wakaba@suika.fam.cx>
1883
1884 * HTML.pm.src: |<form><form>| went to inifinite loop.
1885
1886 2007-05-27 Wakaba <wakaba@suika.fam.cx>
1887
1888 * ContentChecker.pm (html): Set |is_root| (allowed
1889 as a document element) flag on.
1890 (new): Removed.
1891 (check_document): New method.
1892
1893 2007-05-27 Wakaba <wakaba@suika.fam.cx>
1894
1895 * ContentChecker.pm (thead, tfoot): Checker specifications
1896 were incorrect.
1897
1898 2007-05-27 Wakaba <wakaba@suika.fam.cx>
1899
1900 * ContentChecker.pm ($HTMLURIAttrChecker): Include
1901 error position in the |type| option of the error.
1902
1903 * HTMLTable.pm (form_table): The |$onerror| parameter
1904 is now optional. Some bugs are fixed.
1905
1906 2007-05-27 Wakaba <wakaba@suika.fam.cx>
1907
1908 * HTMLTable.pm: New module.
1909
1910 * ContentChecker.pm (table): Invoke table model error checker.
1911
1912 * NanoDOM.pm (first_child, get_attribute_ns): New.
1913
1914 2007-05-26 Wakaba <wakaba@suika.fam.cx>
1915
1916 * ContentChecker.pm ($HTMLLinkTypesAttrChecker): New checker.
1917 (link/@rel, a/@rel, area/@rel): Use new checker.
1918
1919 * Makefile (_LinkTypeList.pm, RelExtensions.html): New rules.
1920
1921 * _LinkTypeList.pm: New file.
1922
1923 * mklinktypelist.pl: New file.
1924
1925 * .cvsignore: |RelExtensions.html| added.
1926
1927 * NanoDOM.pm (child_nodes): Returns an empty array
1928 for non-child-containing node types.
1929 (text_content): New attribute.
1930
1931 2007-05-26 Wakaba <wakaba@suika.fam.cx>
1932
1933 * IMTChecker.pm: New module.
1934
1935 * ContentChecker.pm ($HTMLIMTAttrChecker): Call IMTChecker
1936 to test parameter value validity.
1937
1938 * HTML.pm.src ($style_start_tag): Attributes were
1939 discarded.
1940
1941 2007-05-25 Wakaba <wakaba@suika.fam.cx>
1942
1943 * ContentChecker.pm ($HTMLURIAttrChecker): Implemented.
1944
1945 2007-05-25 Wakaba <wakaba@suika.fam.cx>
1946
1947 * URIChecker.pm: All recommendations from RFC 3986
1948 and RFC 3987 are listed (not all testable items are checked yet).
1949
1950 2007-05-25 Wakaba <wakaba@suika.fam.cx>
1951
1952 * URIChecker.pm: New module.
1953
1954 2007-05-20 Wakaba <wakaba@suika.fam.cx>
1955
1956 * ContentChecker.pm: Now most attributes are implemented
1957 or associated to some placeholder.
1958 ($ElementDefault): Warn unknown attributes
1959 for unknown elements as "attribute not supported".
1960 ($HTMLLanguageTagAttrChecker, $HTMLMQAttrChecker): New placeholders.
1961 ($HTMLUsemapAttrChecker, $HTMLTargetAttrChecker): New checkers.
1962 (|a| attribute checker): Reimplemented.
1963
1964 2007-05-20 Wakaba <wakaba@suika.fam.cx>
1965
1966 * ContentChecker.pm ($HTMLEventHandlerAttrChecker): New placeholder.
1967 ($HTMLAttrChecker): Event handler content attributes
1968 are added.
1969 (link, embed): Required attribute is now checked.
1970 (embed): Unknown local attributes are no longer warned.
1971
1972 2007-05-20 Wakaba <wakaba@suika.fam.cx>
1973
1974 * ContentChecker.pm ($HTMLSpaceURIsAttrChecker): New placeholder.
1975 ($HTMLIMTAttrChecker): New checker.
1976 (link@rel, link@href, link@type, style@type,
1977 a@href, a@ping, a@ping, a@type, embed@src, embed@type,
1978 object@data, object@type, source@src, source@type, area@alt,
1979 area@shape, area@coords, area@href,
1980 area@ping, area@rel, area@type, script@src,
1981 script@defer, script@async, script@type): Checkers added.
1982
1983 2007-05-20 Wakaba <wakaba@suika.fam.cx>
1984
1985 * ContentChecker.pm: Descendant checking was incorrect.
1986
1987 2007-05-19 Wakaba <wakaba@suika.fam.cx>
1988
1989 * ContentChecker.pm: Support |xml:*| and |xmlns:*|
1990 attributes. Report an error if |Element.prefix|
1991 is |xmlns|.
1992
1993 * NanoDOM.pm (prefix): New attribute.
1994
1995 2007-05-19 Wakaba <wakaba@suika.fam.cx>
1996
1997 * HTML.pm.src: In |main| phase, |in body| insertion
1998 mode, action for |<iframe>| was missing.
1999
2000 2007-05-19 Wakaba <wakaba@suika.fam.cx>
2001
2002 * ContentChecker.pm: Support for many of HTML5 elements.
2003 ($GetHTMLNonNegativeIntegerAttrChecker): New.
2004
2005 2007-05-19 Wakaba <wakaba@suika.fam.cx>
2006
2007 * ContentChecker.pm: Support for most elements up to |progress|.
2008 ($HTMLURIAttrChecker): Placeholder.
2009 ($HTMLIntegerAttrChecker, $GetHTMLFloatingPointNumberAttrChecker):
2010 New.
2011
2012 2007-05-19 Wakaba <wakaba@suika.fam.cx>
2013
2014 * ContentChecker.pm: Attribute checkers
2015 for global attributes, |html|, |base|, |style|, and |meta|.
2016
2017 * NanoDOM.pm (insert_before): Weaken reference
2018 to the parent node.
2019 (Attr::new): Set |owner_element| attribute.
2020 (namespace_uri, manakai_local_name): New attribute implementations.
2021 (owner_element): New attribute.
2022
2023 2007-05-19 Wakaba <wakaba@suika.fam.cx>
2024
2025 * ContentChecker.pm ($AttrChecker, $HTMLAttrChecker,
2026 $AnyChecker->{attr_checker}, $HTMLAttrsChecker,
2027 $Element->{$HTML_NS}->{''}): New.
2028 (check_element): Invoke attrs_checker for each element.
2029
2030 2007-05-13 Wakaba <wakaba@suika.fam.cx>
2031
2032 * ContentChecker.pm: Don't use |manakai_element_type_match|.
2033
2034 2007-05-13 Wakaba <wakaba@suika.fam.cx>
2035
2036 * ContentChecker.pm: Use hashs rather than lists for
2037 element type testings.
2038
2039 2007-05-13 Wakaba <wakaba@suika.fam.cx>
2040
2041 * ContentChecker.pm: Don't generate duplicate
2042 error when an element type is put in the "minus" list
2043 and the element type is not allowed explicitly in the particular
2044 element content model.
2045 (html:a checker): New checker.
2046 (html:details, html:datagrid): New checkers.
2047 (html:legend): New checker.
2048
2049 2007-05-13 Wakaba <wakaba@suika.fam.cx>
2050
2051 * ContentChecker.pm (html:li checker): Implemented.
2052
2053 2007-05-13 Wakaba <wakaba@suika.fam.cx>
2054
2055 * ContentChecker.pm ($HTMLInlineOrStriclyInlineChecker): New
2056 checker.
2057 (html:dd checker): New checker.
2058 (html:q, html:em, html:strong, html:small,
2059 html:m, html:dfn, html:code, html:samp, html:span): New checkers.
2060
2061 2007-05-13 Wakaba <wakaba@suika.fam.cx>
2062
2063 * ContentChecker.pm ($AnyChecker): Renamed
2064 from |$ElementDefault->{checker}|.
2065 ($ElementDefault->{checker}): Throw an error that
2066 the element type is not supported by the checker.
2067 ($HTMLMetadataElement): |html:base| was missing.
2068 ($HTMLEmptyChecker): Don't throw an error
2069 for inter-element whitespace nodes.
2070 (html:html checker): Errors were not
2071 thrown even if |html:head| and/or |html:body|
2072 children were missing.
2073 (html:head checker): An error was not
2074 thrown if <meta charset> appered after other
2075 elements.
2076
2077 2007-05-05 Wakaba <wakaba@suika.fam.cx>
2078
2079 * ContentChecker.pm: |footer|, |video|, |audio|, |script|,
2080 and |noscript| elements are implemented.
2081 (new): New method.
2082
2083 2007-05-04 Wakaba <wakaba@suika.fam.cx>
2084
2085 * ContentChecker.pm: New module.
2086
2087 2007-05-04 Wakaba <wakaba@suika.fam.cx>
2088
2089 * NanoDOM.pm (manakai_parent_element,
2090 document_element, manakai_local_name,
2091 manakai_element_type_match): New method.
2092
2093 2007-05-03 Wakaba <wakaba@suika.fam.cx>
2094
2095 * HTML.pm.src: Replace decimal and hexadecimal numeric
2096 entities in C1 range using Windows-1252 mapping. Bare LF
2097 did not count as new line for error reporting.
2098
2099 2007-05-02 Wakaba <wakaba@suika.fam.cx>
2100
2101 * NanoDOM.pm (DOMImplementation): New class.
2102 (append_child): Weaken the |parent_node| reference.
2103 (create_element_ns, Element new): Set the |owner_document|
2104 reference.
2105 (implementation): New attribute.
2106 (owner_document, local_name, namespace_uri): New attributes.
2107
2108 * HTML.pm.src (parse_string): Line and column numbers
2109 are now provided to error handler.
2110 (!!!parse-error): Short descriptions are added.
2111 (_construct_tree): Split into three methods; support
2112 for innerHTML mode.
2113 (set_inner_html): New method.
2114
2115 2007-05-01 Wakaba <wakaba@suika.fam.cx>
2116
2117 * NanoDOM.html: Documentation is added.
2118
2119 * HTML.pod, ContentType.html: Documentation is revised.
2120
2121 * .cvsignore: Pod2html temporary files are added.
2122
2123 * Makefile: Make |NanoDOM.html|.
2124
2125 2007-05-01 Wakaba <wakaba@suika.fam.cx>
2126
2127 * HTML.pm.src (parse_string): New method.
2128 (get_inner_html): Renamed from |inner_html|.
2129
2130 * Makefile: A rule for |HTML.html| is added.
2131
2132 * HTML.pod: New documentation.
2133
2134 2007-05-01 Wakaba <wakaba@suika.fam.cx>
2135
2136 * NanoDOM.pm (last_child, previous_sibling): New attributes.
2137 (clone_node): Attribute nodes were not completely copied.
2138
2139 * HTML.pm.src: Many bugs are fixed.
2140
2141 2007-04-30 Wakaba <wakaba@suika.fam.cx>
2142
2143 * HTML.pm.src: Some typos are fixed.
2144
2145 2007-04-30 Wakaba <wakaba@suika.fam.cx>
2146
2147 * mkhtmlparser.pl, Makefile: References to the |HTML-consume-entity.src|
2148 are removed.
2149
2150 * HTML.pm.src: Tokenizer's handling on named entities are rewritten.
2151
2152 * HTML-consume-entity.src: Removed.
2153
2154 2007-04-30 Wakaba <wakaba@suika.fam.cx>
2155
2156 * HTML.pm.src: Tokenizer's handling on hexadecimal
2157 numeric entities are rewritten.
2158
2159 2007-04-30 Wakaba <wakaba@suika.fam.cx>
2160
2161 * HTML.pm.src: Some tokenizer bugs are fixed.
2162
2163 2007-04-30 Wakaba <wakaba@suika.fam.cx>
2164
2165 * HTML.pm.src: The tree construction stage is implemented.
2166
2167 * mkhtmlparser.pl: New macros are added.
2168
2169 2007-04-28 Wakaba <wakaba@suika.fam.cx>
2170
2171 * ContentType.pm: A note on bug in the specification
2172 is removed since it's been now fixed.
2173
2174 * .cvsignore: New file.
2175
2176 2007-04-28 Wakaba <wakaba@suika.fam.cx>
2177
2178 * HTML.pm.src, HTML-consume-entity.src: New files.
2179
2180 * Makefile (HTML.pm): New rule.
2181
2182 * mkhtmlparser.pl: New script.
2183
2184 2007-04-25 Wakaba <wakaba@suika.fam.cx>
2185
2186 * Makefile: New file.
2187
2188 2007-04-24 Wakaba <wakaba@suika.fam.cx>
2189
2190 * ContentType.pm: An error in pod is fixed.
2191
2192 2007-04-24 Wakaba <wakaba@suika.fam.cx>
2193
2194 * ContentType.pm: Documentation is added.
2195
2196 2007-04-24 Wakaba <wakaba@suika.fam.cx>
2197
2198 * ContentType.pm: New Perl module.
2199
2200 * ChangeLog: New file.
2201
2202

admin@suikawiki.org
ViewVC Help
Powered by ViewVC 1.1.24