/[suikacvs]/markup/html/whatpm/Whatpm/ChangeLog
Suika

Contents of /markup/html/whatpm/Whatpm/ChangeLog

Parent Directory Parent Directory | Revision Log Revision Log


Revision 1.311 - (show annotations) (download)
Mon Sep 15 07:19:03 2008 UTC (16 years, 10 months ago) by wakaba
Branch: MAIN
Changes since 1.310: +8 -0 lines
++ whatpm/Whatpm/ChangeLog	15 Sep 2008 07:17:34 -0000
	* HTML.pm.src: Remove checking for control character, surrogate
	pair, or noncharacter code points and non-Unicode code
	points (they should be handled by Whatpm::Charset::UnicodeChecker).
	(parse_char_stream): Support for the |$get_wrapper| argument and
	character stream error handlers.

2008-09-15  Wakaba  <wakaba@suika.fam.cx>

++ whatpm/Whatpm/Charset/ChangeLog	15 Sep 2008 07:18:45 -0000
	* DecodeHandle.pm (onerror): Return |undef| if no explicit value
	is set.

	* UnicodeChecker.pm: Support for HTML5 parse errors.
	(onerror): Return |undef| if no explicit value is set.

2008-09-15  Wakaba  <wakaba@suika.fam.cx>

1 2008-09-15 Wakaba <wakaba@suika.fam.cx>
2
3 * HTML.pm.src: Remove checking for control character, surrogate
4 pair, or noncharacter code points and non-Unicode code
5 points (they should be handled by Whatpm::Charset::UnicodeChecker).
6 (parse_char_stream): Support for the |$get_wrapper| argument and
7 character stream error handlers.
8
9 2008-09-15 Wakaba <wakaba@suika.fam.cx>
10
11 * ContentChecker.pm: Don't call |loda_ns_module|
12 for null-namespace elements/attributes.
13
14 * HTML.pm.src: Fact out $disallowed_control_chars
15 as a hash.
16
17 2008-09-14 Wakaba <wakaba@suika.fam.cx>
18
19 * HTML.pm.src: Regexp typo fixed. |{prev_char}|
20 and |{next_char}| initializations are moved to initialization
21 method. |{read_until}| now supports buffering. Sync |set_inner_html|
22 with |parse_char_stream|.
23
24 2008-09-14 Wakaba <wakaba@suika.fam.cx>
25
26 * HTML.pm.src (parse_char_stream): Make |set_next_char|
27 invoke |manakai_read_until|, not only |read|, where
28 possible, to decrease the number of |read| method calls.
29
30 * mkhtmlparser.pl: Related changes to the aforementioned
31 modification.
32
33 2008-09-14 Wakaba <wakaba@suika.fam.cx>
34
35 * HTML.pm.src: Use |read| instead of |getc|. |set_inner_html|
36 would report character error from now.
37
38 2008-09-14 Wakaba <wakaba@suika.fam.cx>
39
40 * HTML.pm.src: White-space-leaded non-white-space character
41 tokens in "before head insertion mode" was not
42 correctly handled.
43 (set_inner_html): Reimplemented using CharString decodehandle
44 class. Support for $get_wrapper argument. Support
45 for |{read_until}| feature.
46
47 2008-09-14 Wakaba <wakaba@suika.fam.cx>
48
49 * HTML.pm.src: Make a "bare ero" error for unknown
50 entities point the "&" character.
51
52 2008-09-14 Wakaba <wakaba@suika.fam.cx>
53
54 * HTML.pm.src: It turns out that U+FFFD don't have to
55 be added to the list of excluded characters.
56
57 2008-09-14 Wakaba <wakaba@suika.fam.cx>
58
59 * HTML.pm.src ($char_onerror): Have character decoder's |line|
60 and |column| a higher priority than the one set by the
61 tokenizer's input handler.
62 ($self->{read_until}): Exclude U+FFFD (but this might
63 not be necessary, since now we do line/column fixup in
64 the character decode handle).
65
66 2008-09-14 Wakaba <wakaba@suika.fam.cx>
67
68 * HTML.pm.src: Use |{read_until}| where possible.
69
70 2008-09-14 Wakaba <wakaba@suika.fam.cx>
71
72 * HTML.pm.src: Change |{getc_until}| to |{read_until}|
73 and |manakai_getc_until| to |manakai_read_until| to
74 reduce the number of string copies.
75
76 2008-09-14 Wakaba <wakaba@suika.fam.cx>
77
78 * HTML.pm.src (parse_char_string): Use newly created
79 |Whatpm::Charset::DecodeHandle::CharString| instead of Perl's
80 standard feature to |open| a string as a filehandle,
81 since Perl's string filehandle seems not supporting |ungetc|
82 method correctly.
83 (parse_char_stream): Define |{getc_until}| method.
84 (DATA_STATE): Experimental support for |getc_until| feature.
85
86 2008-09-13 Wakaba <wakaba@suika.fam.cx>
87
88 * HTML.pm.src: Check points added to newly added branches.
89
90 2008-09-13 Wakaba <wakaba@suika.fam.cx>
91
92 * HTML.pm.src: Remove |{char}|, which is no longer used.
93 Remove |{entity_in_attr}| and |{last_attribute_value_state}|
94 and replaced by |{prev_state}|.
95
96 * mkhtmlparser.pl: Remove |{char}| feature.
97 Remove |!!!back-next-input-character;| macro.
98
99 2008-09-13 Wakaba <wakaba@suika.fam.cx>
100
101 * HTML.pm.src: Finally we get rid of all the inner loops. Remove
102 entity related tokenizer states in favor of new states
103 implementing the consume character reference algorithm.
104
105 2008-09-13 Wakaba <wakaba@suika.fam.cx>
106
107 * HTML.pm.src: "Consume a character reference" algorithm is
108 now implemented as a tokenizer's state, rather than
109 a method, with minimum changes (more changes will
110 be made, in due course). "Bogus comment state"'s inner
111 loop gets removed.
112
113 2008-09-13 Wakaba <wakaba@suika.fam.cx>
114
115 * HTML.pm.src: Make |PUBLIC| and |SYSTEM| keyword tokenizing
116 into their own tokenizer states.
117
118 2008-09-13 Wakaba <wakaba@suika.fam.cx>
119
120 * HTML.pm.src: |CDATA_SECTION_STATE| (formally |CDATA_BLOCK_STATE|
121 is split into three states.
122
123 2008-09-13 Wakaba <wakaba@suika.fam.cx>
124
125 * HTML.pm.src: |CLOSE_TAG_OPEN_STATE| is broken into
126 itself and new |CDATA_PCDATA_CLOSE_TAG_STATE| so that
127 no longer does the tokenizer have to push back next input
128 characters in those states.
129
130 2008-09-13 Wakaba <wakaba@suika.fam.cx>
131
132 * HTML.pm.src: |MARKUP_DECLARATION_OPEN_STATE| broken
133 into four states so that no longer does the tokenizer have to push
134 back next input characters in that state.
135
136 2008-09-11 Wakaba <wakaba@suika.fam.cx>
137
138 * HTML.pm.src: Methods now accept additional parameter, $get_wrapper,
139 which can be used to insert some wrapper between the character
140 stream handle and the tokenizer. (It is currently not supported
141 for |set_inner_html| for |Element|s).
142
143 2008-09-10 Wakaba <wakaba@suika.fam.cx>
144
145 * HTML.pm.src: Ignore punctuations in charset names.
146
147 2008-09-10 Wakaba <wakaba@suika.fam.cx>
148
149 * ContentChecker.pm: Support for charset-layer error levels.
150
151 * HTML.pm.src: Don't specify |text| argument for the
152 |chardecode:fallback| error, since it is not the encoding
153 being used alternatively.
154
155 2008-09-06 Wakaba <wakaba@suika.fam.cx>
156
157 * HTML.pm.src: Support for |XSLT-compat| (HTML5 revision 2141).
158
159 2008-08-31 Wakaba <wakaba@suika.fam.cx>
160
161 * CacheManifest.pm: Support for extensibility (HTML5 revision 2051).
162
163 2008-08-31 Wakaba <wakaba@suika.fam.cx>
164
165 * HTML.pm.src: Bug fix and sync with the spec with regard
166 to after after frameset insertion mode processing (HTML5
167 revision 1909). Note that the implementation was wrong
168 per the old spec before the r1909 changes.
169
170 2008-08-30 Wakaba <wakaba@suika.fam.cx>
171
172 * HTMLTable.pm: scope=auto algorithm fix synced with the
173 spec (HTML5 revision 2093).
174 ($process_row): Algorithm step numbers synced with the
175 spec (HTML5 revision 2092).
176
177 2008-08-30 Wakaba <wakaba@suika.fam.cx>
178
179 * HTMLTable.pm: Zs is not what we want; we want White_Space! (HTML5
180 revision 2094).
181
182 2008-08-30 Wakaba <wakaba@suika.fam.cx>
183
184 * ContentType.pm: Support for image/svg+xml (HTML5 revision 2096).
185
186 2008-08-30 Wakaba <wakaba@suika.fam.cx>
187
188 * HTML.pm.src: '"' and "'" at the end of attribute
189 name (after another attribute) now raise parse error (HTML5
190 revision 2123). Empty unquoted attribute values are no
191 longer allowed (HTML5 revision 2122).
192
193 2008-08-30 Wakaba <wakaba@suika.fam.cx>
194
195 * mkhtmlparser.pl: Support for MathML |definitionURL| attribute (HTML5
196 revision 2130).
197
198 2008-08-30 Wakaba <wakaba@suika.fam.cx>
199
200 * ContentChecker.pm: |xml:lang| attribute value must be same
201 as |lang| attribute value for HTML elements (HTML5 revision 2062
202 and so on).
203
204 2008-08-30 Wakaba <wakaba@suika.fam.cx>
205
206 * ContentChecker.pm: Error level definition for |xml_id_error|
207 was missing.
208
209 * URIChecker.pm: The end of the URL should be marked as the
210 error location for an empty path error. The position
211 between the userinfo and the port components should be
212 marked as the error location for an empty host error.
213
214 2008-08-30 Wakaba <wakaba@suika.fam.cx>
215
216 * URIChecker.pm: Set parameters representing where in the
217 value the error occurs for errors. Report unknown
218 address format error in warning level, since address
219 formats are rarely added. Path segments starting with "/.."
220 were misinterpreted as a dot-segment.
221
222 2008-08-30 Wakaba <wakaba@suika.fam.cx>
223
224 * URIChecker.pm (check_iri_reference): Requires
225 |Message::DOM::DOMImplementation|.
226
227 2008-08-29 Wakaba <wakaba@suika.fam.cx>
228
229 * IMTChecker.pm: Updated for the new error reporting architecture.
230
231 * ContentChecker.pm: Error levels for IMTs are added.
232
233 2008-08-17 Wakaba <wakaba@suika.fam.cx>
234
235 * H2H.pm (_shift_token): Support for unquoted HTML attribute
236 values.
237
238 2008-08-16 Wakaba <wakaba@suika.fam.cx>
239
240 * CacheManifest.pm: Support for new style of error
241 reports.
242
243 * HTML.pm.src: Set line=1, column=1 to the document node.
244
245 2008-08-16 Wakaba <wakaba@suika.fam.cx>
246
247 * ContentChecker.pm, RDFXML.pm: Pass {level} object to language tag
248 and URL checkers. Support for more error levels for bogus
249 langauge tag and URL "standards".
250
251 * LangTag.pm, URIChecker.pm: Support for new style error
252 level reporting.
253
254 2008-08-15 Wakaba <wakaba@suika.fam.cx>
255
256 * ContentChecker.pm: Support for RDF/XML error levels.
257
258 * HTMLTable.pm, RDFXML.pm: Support for new style of error level
259 specifying. Error types are revised.
260
261 2008-08-15 Wakaba <wakaba@suika.fam.cx>
262
263 * ContentChecker.pm: All error reporting method calls are
264 renewed.
265
266 2008-08-15 Wakaba <wakaba@suika.fam.cx>
267
268 * HTML.pm.src: All error type names and "text" parameters
269 are revised. Use new style for "level" specification.
270
271 * mkhtmlparser.pl: Use new style for "level" specification.
272
273 2008-08-03 Wakaba <wakaba@suika.fam.cx>
274
275 * WebIDL.pm (parse_char_string): Simplified error
276 reporting process for broken ignored valuetype definition.
277 (Valuetype idl_text): Support for special "DOMString" name.
278
279 2008-08-03 Wakaba <wakaba@suika.fam.cx>
280
281 * WebIDL.pm ($get_scoped_name): Append "::::" if the last
282 terminal of the ScopedName is "DOMString", such that whether
283 the last part of the scoped name is "DOMString" or "_DOMString"
284 later. It is necessary to determine whether a |typedef|
285 definition should be ignored or not.
286 (parse_char_string): Unescape the identifier of
287 exception members.
288 ($resolve): Return undef for builtin types and sequence<T>
289 types (we might not have to do this, however...).
290 (check): Support checking for Exceptions, Valuetypes,
291 and Typedefs.
292 ($serialize_type): Support for "DOMString::::" syntax.
293 (Typedef idl_text): Output Type as "DOMString" if it
294 is really "DOMString" (i.e. its internal representation
295 is "::DOMString::").
296
297 2008-08-03 Wakaba <wakaba@suika.fam.cx>
298
299 * WebIDL.pm ($resolve): New code, based on resolve code
300 for constant types in the |check| method.
301 (check): Support for checking of attributes, operations, and
302 arguments.
303 (Attribute/Operation idl_text): Exception names in getraises,
304 setraises, and raises clauses is serizlied by |$serialize_type|
305 code.
306
307 2008-08-02 Wakaba <wakaba@suika.fam.cx>
308
309 * WebIDL.pm ($integer): Order of selections are changed to match
310 hexadecimal numbers (the original pattern, taken from the spec,
311 was not work for hexadecimal numbers, because the "0" prefix
312 matches to the [0-7]* part (as an empty string) and therefore
313 it does not match with remaining "x..." part of a "0x..." integer
314 literal.
315 ($get_type): It now returns a string, not an array reference,
316 for regular types and |sequence| types (i.e. it in any case
317 returns a string).
318 ($get_next_token): The second item in the array that represents
319 a integer or float token is now a Perl number value, not the
320 original string representation of the number.
321 (check): Support for const value consistency checking.
322 No extended attribute is defined for constants.
323 (Node subclasses): Use simple strings rather than array references
324 for default data type values.
325 ($serialize_type): Type values are now simple strings.
326 (value): If the new attribute value is a false value, then
327 a FALSE value is set to the attribute.
328
329 2008-08-02 Wakaba <wakaba@suika.fam.cx>
330
331 * WebIDL.pm ($get_scoped_name): Now scoped names are stored
332 in its stringified format ("scoped name" as defined in the
333 spec). Note that future version of this module should not use
334 array references for type values and the |type_text| attribute
335 should be made obsolete.
336 (parse_char_string): Unescape attribute names.
337 (check): Support for checking of whether inherited interfaces
338 are actually defined or not. Support for checking of whether
339 interface member identifiers are duplicated or not.
340 ($serialize_type): Scoped names are returned as is. A future
341 version of this code should escape identifiers other than "DOMString",
342 otherwise the idl_text would be non-conforming.
343
344 2008-08-02 Wakaba <wakaba@suika.fam.cx>
345
346 * WebIDL.pm (parse_char_string): Set line/column numbers
347 to generated nodes. Unescape identifiers. Extended attributes
348 for Definition's were ignored.
349 (append_child): Set |parent_node| attribute.
350 (parent_node): New attribute.
351 (check): Support interface/exception members. Support
352 extended attributes. Support definition identifier uniqueness
353 constraint.
354 (qualified_name): New attribute.
355 (Interface/Exception idl_text): Extended attributes were
356 not prepended to the returned text.
357
358 2008-08-02 Wakaba <wakaba@suika.fam.cx>
359
360 * WebIDL.pm (parse_char_string): Set line/column numbers
361 to interface object experimentally. s/shift/pop/g, shift
362 would make things wrong. Support for interface forward
363 declarations was missing. Broken interface declarations
364 with no block were not ignored entirely.
365 (Whatpm::WebIDL::Node): New abstract class. This class
366 makes things easier.
367 (child_nodes): New attribute. Unlike DOM's attribute with
368 same name, this attribute returns a dead list of nodes for
369 simplicity.
370 (get_user_data, set_user_data): New methods.
371 (Module idl_text): A SPACE character should be inserted
372 before the |{| character.
373 (Interface idl_text): Support for interface forward declarations.
374 (is_forward_declaration): New attribute.
375
376 2008-07-19 Wakaba <wakaba@suika.fam.cx>
377
378 * WebIDL.pm (type_text): Better serializer.
379
380 2008-07-19 Wakaba <wakaba@suika.fam.cx>
381
382 * WebIDL.pm: Revise forward-compatible parsing so that
383 it now can handle broken extended attributes and as such.
384
385 2008-07-19 Wakaba <wakaba@suika.fam.cx>
386
387 * WebIDL.pm: Real support for extended attributes.
388 Support for extended attributes with arguments.
389
390 2008-07-19 Wakaba <wakaba@suika.fam.cx>
391
392 * WebIDL.pm: Support for |exception| syntax.
393 (Interface->idl_text): Tentative support for inheritances.
394
395 2008-07-19 Wakaba <wakaba@suika.fam.cx>
396
397 * WebIDL.pm: Hierarchical scoped name support was broken.
398 Support for raises, setraises, and getraises syntaxes.
399
400 2008-07-18 Wakaba <wakaba@suika.fam.cx>
401
402 * WebIDL.pm: Support for |idl_text| attribute, version 1 (no
403 proper support for types, extended attributes, and exceptions yet).
404 WebIDL parser, version 1 (no support for exceptions yet,
405 no proper support for extended attributes yet).
406
407 2008-07-09 Wakaba <wakaba@suika.fam.cx>
408
409 * WebIDL.pm (parse_char_string): Support for basic attribute syntax.
410
411 2008-06-29 Wakaba <wakaba@suika.fam.cx>
412
413 * WebIDL.pm: Support for valuetype and const.
414
415 2008-06-29 Wakaba <wakaba@suika.fam.cx>
416
417 * WebIDL.pm: New module.
418
419 2008-06-15 Wakaba <wakaba@suika.fam.cx>
420
421 * Makefile (Entities.html): URI changed.
422
423 2008-06-08 Wakaba <wakaba@suika.fam.cx>
424
425 * HTML.pm.src: Support for ruby parsing (HTML5 revision 1704).
426
427 2008-06-01 Wakaba <wakaba@suika.fam.cx>
428
429 * HTML.pm.src (_get_next_token): A parse error was missing.
430
431 2008-06-01 Wakaba <wakaba@suika.fam.cx>
432
433 * mklinktypelist.pl: rel=contact is no longer part of the HTML5
434 spec (commented out). (HTML5 revision 1711).
435
436 2008-05-25 Wakaba <wakaba@suika.fam.cx>
437
438 * ContentType.pm: Drop support for UTF-32 (HTML5 revision 1701).
439
440 * HTML.pm.src: UTF-16BE and UTF-16LE should be considered
441 as UTF-16 (HTML5 revision 1701).
442
443 2008-05-25 Wakaba <wakaba@suika.fam.cx>
444
445 * HTML.pm.src: Support for <noframes> in <head> (HTML5 revision
446 1692).
447
448 2008-05-25 Wakaba <wakaba@suika.fam.cx>
449
450 * HTML.pm.src: The secondary insertion mode used when switching
451 to foreign content is the "in body" insertion mode (HTML5 revision
452 1696).
453
454 2008-05-25 Wakaba <wakaba@suika.fam.cx>
455
456 * HTML.pm.src: Don't raise parse error for <isindex/> (HTML5
457 revision 1697).
458
459 2008-05-24 Wakaba <wakaba@suika.fam.cx>
460
461 * HTML.pm.src: Support for end-of-file token in foreign content
462 insertion mode (HTML5 revision 1693). Update SVG camelCase
463 attribute list (HTML5 revision 1700). <textarea> closes
464 </select> (HTML5 revision 1699). More start tags close in
465 foreign content insertion mode (HTML5 revision 1698).
466
467 2008-05-24 Wakaba <wakaba@suika.fam.cx>
468
469 * HTML.pm.src: ";" is not part of charset name (HTML5 revision 1665).
470
471 2008-05-24 Wakaba <wakaba@suika.fam.cx>
472
473 * HTML.pm.src: More robust charset parameter detection (HTML5
474 revision 1674).
475
476 2008-05-24 Wakaba <wakaba@suika.fam.cx>
477
478 * ContentType.pm: Support for image/vnd.microsoft.icon (HTML5
479 revision 1676).
480
481 2008-05-24 Wakaba <wakaba@suika.fam.cx>
482
483 * HTML.pm.src: Ignore language part of public identifiers for
484 quriks mode detection (HTML5 revision 1679).
485
486 2008-05-24 Wakaba <wakaba@suika.fam.cx>
487
488 * HTML.pm.src: Reduce the number of errors in truncated doctypes (HTML5
489 revision 1685).
490
491 2008-05-24 Wakaba <wakaba@suika.fam.cx>
492
493 * HTML.pm.src: Support for EOF in new states for tags (HTML5
494 revision 1684).
495
496 2008-05-24 Wakaba <wakaba@suika.fam.cx>
497
498 * HTML.pm.src (_reset_insertion_mode): Make <td>.innerHTML
499 work (HTML5 revision 1690).
500
501 2008-05-24 Wakaba <wakaba@suika.fam.cx>
502
503 * HTML.pm.src (_tree_construction_main): Change handling of
504 end tags in head insertion modes (HTML5 revision 1686).
505 (parse_char_string): Bug fix for non-utf8 character string handlings.
506 (parse_char_stream): |ungetc| does not work well for this context.
507
508 2008-05-18 Wakaba <wakaba@suika.fam.cx>
509
510 * HTML.pm.src (parse_byte_string): Redefined to invoke
511 |parse_byte_stream|.
512 (parse_byte_stream): New method.
513
514 2008-05-18 Wakaba <wakaba@suika.fam.cx>
515
516 * HTML.pm.src (parse_byte_string): Fix the column number reported
517 by encoding layer error reporter.
518
519 2008-05-17 Wakaba <wakaba@suika.fam.cx>
520
521 * HTML.pm.src (parse_byte_string): Use streaming decoder
522 rather than converting the whole byte string and then parsing.
523 Propagate errors in character encoding layer.
524 (get_next_token): Precise error reporting for |bare stago| error.
525
526 2008-05-17 Wakaba <wakaba@suika.fam.cx>
527
528 * HTML.pm.src (parse_char_stream): New method.
529 (parse_char_string): This method is now defined as an invocation
530 of the |parse_char_stream| method.
531
532 2008-05-17 Wakaba <wakaba@suika.fam.cx>
533
534 * HTML.pm.src (parse_byte_string): Report various status
535 of the sniffing as info-level errors. Support for new
536 decoding framework in parser resestting.
537 (new): Various default error levels were not set.
538
539 2008-05-17 Wakaba <wakaba@suika.fam.cx>
540
541 * HTML.pm.src (parse_byte_string): HTML5 encoding siniffing
542 algorithm, except for the actual sniffing, is implemented
543 with new framework with Message::Charset::Info.
544
545 2008-05-16 Wakaba <wakaba@suika.fam.cx>
546
547 * CacheManifest.pm (_parse): Drop fragment identifiers from
548 URIs in fallback section (HTML5 revision 1596).
549
550 2008-05-10 Wakaba <wakaba@suika.fam.cx>
551
552 * Makefile (Entities.html): URI has changed.
553
554 2008-05-10 Wakaba <wakaba@suika.fam.cx>
555
556 * CacheManifest.pm: Don't replace U+0000 NULL (HTML5 revision
557 1553).
558
559 2008-05-06 Wakaba <wakaba@suika.fam.cx>
560
561 * ContentChecker.pm: Noted that those returned in |table| are
562 no longer table elements, but table objects returned
563 by Whatpm::HTMLTable.
564
565 * HTMLTable.pm (form_table): Return table element node
566 as |$table->{element}|.
567 (assign_header): Support for the |headers=""| attribute.
568
569 2008-05-06 Wakaba <wakaba@suika.fam.cx>
570
571 * HTMLTable.pm (assign_header): New function; first version
572 with no support for headers="".
573 (form_table): Include table width and height to the returned
574 table object for covenience. Indexing in column assignement was wrong.
575 Set whether a data cell is empty or not for convenience.
576
577 2008-05-05 Wakaba <wakaba@suika.fam.cx>
578
579 * HTMLTable.pm: Robuster caption support (HTML5 revision 1393).
580
581 2008-05-05 Wakaba <wakaba@suika.fam.cx>
582
583 * HTMLTable.pm: How table model errors are detected is
584 changed (HTML5 revision 1387).
585
586 2008-05-05 Wakaba <wakaba@suika.fam.cx>
587
588 * HTMLTable.pm: The algorithm now moves |tfoot| elements
589 to the end of the table (HTML5 revision 1380).
590
591 2008-05-05 Wakaba <wakaba@suika.fam.cx>
592
593 * HTMLTable.pm: The algorithm is now 0-based indexing, instead
594 of 1-based (HTML5 revision 1376).
595
596 2008-05-05 Wakaba <wakaba@suika.fam.cx>
597
598 * ContentType.pm: "Content-Type: text/plain; charset=UTF-8"
599 and "Content-Encoding" no longer prevent sniffing (HTML5
600 revision 1288).
601
602 2008-05-05 Wakaba <wakaba@suika.fam.cx>
603
604 * ContentType.pm: Skip BOMs n feed or HTML algorithm (HTML5 revision
605 1282).
606
607 2008-05-03 Wakaba <wakaba@suika.fam.cx>
608
609 * ContentChecker.pm: Support for global attributes.
610 Status of XML specs are added.
611
612 2008-05-03 Wakaba <wakaba@suika.fam.cx>
613
614 * ContentChecker.pm (check_element): Support for |template=""|
615 and |ref=""| attribute (referent element type checking).
616
617 2008-04-29 Wakaba <wakaba@suika.fam.cx>
618
619 * CacheManifest.pm (_parse): New same origin definition (HTML5 revision
620 1500) is implemented (except for IDNA part and URI-scheme-specific
621 knowledge). Line number counting was wrong for LF-only
622 documents.
623
624 2008-04-13 Wakaba <wakaba@suika.fam.cx>
625
626 * HTML.pm.src: Raise an parse error for any disallowed
627 character (HTML5 revision 1263).
628
629 2008-04-13 Wakaba <wakaba@suika.fam.cx>
630
631 * mkentitylist.pl: Support for new HTML5 entity table format (the
632 definition for |AElig;| was missing).
633
634 2008-04-13 Wakaba <wakaba@suika.fam.cx>
635
636 * HTML.pm.src, mkhtmlparser.pl: Support for element/attribute
637 name/namespace fixup (HTML5 revisions 1413, 1415, 1416, and 1417).
638
639 2008-04-13 Wakaba <wakaba@suika.fam.cx>
640
641 * HTML.pm.src: List of element names that close foreign content
642 insertion mode is added (HTML5 revisions 1412 and 1418).
643
644 2008-04-13 Wakaba <wakaba@suika.fam.cx>
645
646 * HTML.pm.src: Support for |mglyph| and |malignmark|
647 elements (HTML5 revision 1410).
648
649 2008-04-13 Wakaba <wakaba@suika.fam.cx>
650
651 * HTML.pm.src: Support for new long MathML entities (HTML5
652 revision 1406).
653
654 2008-04-13 Wakaba <wakaba@suika.fam.cx>
655
656 * HTML.pm.src: CDATA section support for MathML and SVG
657 elements (HTML5 revision 1404 and 1420).
658
659 2008-04-12 Wakaba <wakaba@suika.fam.cx>
660
661 * HTML.pm.src, mkhtmlparser.pl: Support for MathML and SVG
662 elements (HTML5 revision 1404). Unused !!!macro definitions
663 are removed.
664
665 2008-04-12 Wakaba <wakaba@suika.fam.cx>
666
667 * HTML.pm.src, mkhtmlparser.pl: The way permitted slash errors
668 are raised is changed (HTML5 revision 1404).
669
670 2008-04-06 Wakaba <wakaba@suika.fam.cx>
671
672 * HTML.pm.src: Category sets that are no longer used are removed.
673
674 2008-04-06 Wakaba <wakaba@suika.fam.cx>
675
676 * HTML.pm.src: The ->[1] property of stack entries are now
677 replaced by constants representing element category.
678
679 2008-04-06 Wakaba <wakaba@suika.fam.cx>
680
681 * HTML.pm.src: Don't use local name stored in stack (i.e. ->[1])
682 for error reporting. (This is a preparation for using constant
683 value for ->[1].)
684
685 2008-03-22 Wakaba <wakaba@suika.fam.cx>
686
687 * RDFXML.pm: Typo fixed.
688
689 2008-03-22 Wakaba <wakaba@suika.fam.cx>
690
691 * ContentChecker.pm: |fact_level| is now treated
692 as same as |must_level|, i.e. level = |m|.
693 (check_element): Make list of URIs in the DOM.
694
695 2008-03-21 Wakaba <wakaba@suika.fam.cx>
696
697 * RDFXML.pm: Language accessor implemented.
698 Local (null-namespace) attribute support.
699
700 2008-03-21 Wakaba <wakaba@suika.fam.cx>
701
702 * RDFXML.pm: Factored out ID checking code.
703
704 2008-03-21 Wakaba <wakaba@suika.fam.cx>
705
706 * RDFXML.pm: TODO items noted. Validation of ID
707 and URI attributes is implemented. Warn if unknown
708 value is used in rdf:parseType="" attribute.
709
710 * URIChecker.pm (check_rdf_uri_reference): New function.
711
712 2008-03-21 Wakaba <wakaba@suika.fam.cx>
713
714 * RDFXML.pm: bnodeid implemented. Relative references
715 are now resolved.
716
717 2008-03-21 Wakaba <wakaba@suika.fam.cx>
718
719 * ContentChecker.pm: RDF reification implemented.
720
721 * RDFXML.pm: undef vs false bug fixed.
722 Reification implemented.
723
724 2008-03-21 Wakaba <wakaba@suika.fam.cx>
725
726 * RDFXML.pm: s/id/ID/ for attribute name.
727 The |node| arguments are added for |ontriple| calls.
728 Too many "attribute not allowed" errors were raised.
729
730 * ContentChecker.pm: Initial experimental support for rdf:RDF
731 element.
732
733 2008-03-21 Wakaba <wakaba@suika.fam.cx>
734
735 * RDFXML.pm: New module.
736
737 2008-03-20 Wakaba <wakaba@suika.fam.cx>
738
739 * HTML.pm.src (set_inner_html): Line/column number
740 code was old one yet.
741
742 2008-03-20 Wakaba <wakaba@suika.fam.cx>
743
744 * HTML.pm.src: Better line/column reporting for "duplicate attribute"
745 errors. Line/column markings for DOCTYPE, comment, and
746 character tokens are reintroduced; otherwise,
747 error location for "not HTML5" error and errors
748 for implied elements are not attached.
749
750 2008-03-20 Wakaba <wakaba@suika.fam.cx>
751
752 * HTML.pm.src: Set line/column numbers to attributes.
753
754 * NanoDOM.pm (create_attribute_ns, set_attribute_node_ns): Added.
755 (value): Setter implemented.
756
757 * mkhtmlparser.pl: Set line/column numbers to Attr nodes.
758
759 2008-03-20 Wakaba <wakaba@suika.fam.cx>
760
761 * HTML.pm.src: Unused line/column markings are removed.
762
763 2008-03-20 Wakaba <wakaba@suika.fam.cx>
764
765 * HTML.pm.src (_get_next_token): Remove |first_start_tag|
766 flag, which is no longer used.
767
768 2008-03-17 Wakaba <wakaba@suika.fam.cx>
769
770 * HTML.pm.src: Set line/column information to element nodes.
771
772 * mkhtmlparser.pl (!!!create-element, !!!insert-element,
773 and !!!insert-element-t): Set line/column information to
774 element nodes.
775
776 2008-03-17 Wakaba <wakaba@suika.fam.cx>
777
778 * HTML.pm.src (_get_next_token): The first "<" character
779 in "<?", "<>", or "</>" should be the error point.
780
781 2008-03-16 Wakaba <wakaba@suika.fam.cx>
782
783 * HTML.pm.src: Some more fixes on error position reporting.
784
785 2008-03-16 Wakaba <wakaba@suika.fam.cx>
786
787 * HTML.pm.src: Token-level precious error reporting.
788
789 2008-03-16 Wakaba <wakaba@suika.fam.cx>
790
791 * HTML.pm.src: Preparation for more precious error point
792 reporting.
793
794 2008-03-11 Wakaba <wakaba@suika.fam.cx>
795
796 * HTML.pm.src: Error type revised.
797
798 2008-03-11 Wakaba <wakaba@suika.fam.cx>
799
800 * HTML.pm.src: Similar codes are merged together, again.
801
802 2008-03-11 Wakaba <wakaba@suika.fam.cx>
803
804 * HTML.pm.src: Similar codes are merged together.
805
806 2008-03-10 Wakaba <wakaba@suika.fam.cx>
807
808 * mkhtmlparser.pl: Set "level" parameter to parse errors.
809
810 * HTML.pm.src: Code refined.
811
812 2008-03-09 Wakaba <wakaba@suika.fam.cx>
813
814 * HTML.pm.src: |</body>| treatement has been changed (HTML5
815 revision 1348). Note that I really don't know this makes
816 any difference in the black-box behavior of the parser.
817
818 2008-03-09 Wakaba <wakaba@suika.fam.cx>
819
820 * HTML.pm.src: New end-of-file token implementation (HTML5
821 revision 1348).
822
823 2008-03-09 Wakaba <wakaba@suika.fam.cx>
824
825 * HTML.pm.src: |applet| support (HTML5 revision 1347).
826
827 2008-03-09 Wakaba <wakaba@suika.fam.cx>
828
829 * HTML.pm.src: Foster parenting in AAA (HTML5 revision 1343).
830
831 2008-03-09 Wakaba <wakaba@suika.fam.cx>
832
833 * HTML.pm.src: Support for |<input>| in the "in select" insertion
834 mode and support for the "in select in table" insertion mode (HTML5
835 revision 1342).
836
837 2008-03-09 Wakaba <wakaba@suika.fam.cx>
838
839 * HTML.pm.src: No foster parenting for <script> and <script>
840 in non-tainted <table>s (HTML5 revision 1336).
841
842 2008-03-09 Wakaba <wakaba@suika.fam.cx>
843
844 * HTML.pm.src: Ignore white space characters between <html>
845 and <head> (HTML5 revision 1332).
846
847 2008-03-09 Wakaba <wakaba@suika.fam.cx>
848
849 * HTML.pm.src: Treat <input type=hidden> as if it were a
850 white space (HTML5 revision 1331).
851
852 2008-03-08 Wakaba <wakaba@suika.fam.cx>
853
854 * HTML.pm.src: Ignore U+000A at the beginning of a |listing|
855 element (HTML5 revision 1330).
856
857 2008-03-08 Wakaba <wakaba@suika.fam.cx>
858
859 * HTML.pm.src: <title> is always appended to the current
860 element (HTML5 revision 1328).
861
862 2008-03-08 Wakaba <wakaba@suika.fam.cx>
863
864 * HTML.pm.src: White space in tainted tables are moved
865 into foster parents (HTML5 revision 1326).
866
867 2008-03-08 Wakaba <wakaba@suika.fam.cx>
868
869 * HTML.pm.src: Reduce errors from foster parenting cases (HTML5
870 revision 1321).
871
872 2008-03-08 Wakaba <wakaba@suika.fam.cx>
873
874 * HTML.pm.src: |</h/n/>| case code rearranged to align with
875 the spec (HTML5 revision 1320). Note that we finally complete
876 all of HTML5 revision 1320 changes.
877
878 2008-03-08 Wakaba <wakaba@suika.fam.cx>
879
880 * HTML.pm.src: |</form>| now works similar to |</div>| for unclosed
881 tags (HTML5 revision 1320).
882
883 2008-03-08 Wakaba <wakaba@suika.fam.cx>
884
885 * HTML.pm.src: |</p>| case rearranged with no actual change in fact.
886
887 2008-03-08 Wakaba <wakaba@suika.fam.cx>
888
889 * HTML.pm.src: A "generate implied end tags" code (t409.1)
890 could not be reached so that it is now removed (HTML5 revision
891 1320).
892
893 2008-03-08 Wakaba <wakaba@suika.fam.cx>
894
895 * HTML.pm.src: Code for the case of |</div>| and so on are revised
896 to align with new spec text (HTML5 revision 1320).
897
898 2008-03-08 Wakaba <wakaba@suika.fam.cx>
899
900 * HTML.pm.src: Remove strange |if| condition; however, it should
901 have had no harm in theory.
902
903 2008-03-08 Wakaba <wakaba@suika.fam.cx>
904
905 * HTML.pm.src (_tree_construction_main): '</p>' in body
906 case is split from other end tags for the preparation
907 of implementing HTML revision 1320.
908
909 2008-03-07 Wakaba <wakaba@suika.fam.cx>
910
911 * HTML.pm.src: Simplified "generate implied end tag" (HTML5
912 revision 1320).
913
914 2008-03-07 Wakaba <wakaba@suika.fam.cx>
915
916 * HTML.pm.src (_tree_construction_main): Merge rules for "h1"
917 and "div" (HTML5 revision 1318). Add comments to where
918 |form| pointer association codes should be inserted (HTML5 revision
919 1319).
920
921 2008-03-06 Wakaba <wakaba@suika.fam.cx>
922
923 * HTML.pm.src: <html> treatement refined (HTML5 revision 1314).
924
925 2008-03-05 Wakaba <wakaba@suika.fam.cx>
926
927 * HTML.pm.src: Since the case t268 should never be reached (no
928 other token type, there are), it is replaced by a |die| statement.
929
930 2008-03-05 Wakaba <wakaba@suika.fam.cx>
931
932 * HTML.pm.src: Typo fixed.
933
934 2008-03-04 Wakaba <wakaba@suika.fam.cx>
935
936 * HTML.pm.src (_tree_construction_initial): Some limited quirks
937 doctypes were not uppercased for comparison.
938
939 2008-03-03 Wakaba <wakaba@suika.fam.cx>
940
941 * HTML.pm.src (tree construction and set_inner_html): Checkpoints
942 are added.
943
944 2008-03-03 Wakaba <wakaba@suika.fam.cx>
945
946 * HTML.pm.src (_tokenize_attempt_to_consume_an_entity): Checkpoints
947 are set. Cases that are unlikely reached are noted as so.
948
949 2008-03-03 Wakaba <wakaba@suika.fam.cx>
950
951 * HTML.pm.src: Checkpoints for debugging are added.
952
953 * mkhtmlparser.pl: Support for |!!!cp| syntax.
954
955 2008-03-03 Wakaba <wakaba@suika.fam.cx>
956
957 * HTML.pm.src, mkhtmlparser.pl: s/_input_character/_char/g
958 for simplicity.
959
960 2008-03-03 Wakaba <wakaba@suika.fam.cx>
961
962 * HTML.pm.src: Flag name changed: s/correct/force-quirks/g (HTML5
963 revision 1307).
964
965 2008-03-03 Wakaba <wakaba@suika.fam.cx>
966
967 * HTML.pm.src (_get_next_token): Where the /incorrect/ flag
968 is set are changed (HTML5 revision 1305).
969
970 2008-03-02 Wakaba <wakaba@suika.fam.cx>
971
972 * HTML.pm.src: Raise a parse error for |<span ===>| (HTML5 revision
973 1292). Entities are not parsed in comment-like part in RCDATA
974 elements (HTML5 revision 1294). Allow bare & at the end
975 of attribute value literals (HTML5 revision 1296). More
976 quirks mode doctypes (HTML5 revision 1302). Requires spaces
977 between attributes and ban attribute names or unquoted
978 attribute values containing single or double quotes (HTML5
979 revision 1303).
980
981 2008-03-02 Wakaba <wakaba@suika.fam.cx>
982
983 * ContentChecker.pm: Typo fixed. Don't raise "character encoding"
984 and related errors unless it is an HTML document (though the spec
985 is unclear on whether it is applied to XHTML document).
986
987 * HTML.pm (%HTMLAttrStatus): WF2 repetition model attributes
988 are added.
989
990 2008-03-02 Wakaba <wakaba@suika.fam.cx>
991
992 * HTML.pm.src: s/local_name/manakai_local_name/g.
993
994 2008-03-01 Wakaba <wakaba@suika.fam.cx>
995
996 * _NamedEntityList.pm: Updated (HTML5 revision 1286).
997
998 * HTML.pm.src: |charset| in |content| attribute is
999 case-insensitive (HTML5 revision 1270).
1000
1001 2008-02-26 Wakaba <wakaba@suika.fam.cx>
1002
1003 * ContentChecker.pm: New status constants are added.
1004 ($ElementDefault): |status| added.
1005 (check_element): Err for non-standard or deprecated elements.
1006 (_attr_status_info): For non-standard or deprecated attributes.
1007
1008 2008-02-24 Wakaba <wakaba@suika.fam.cx>
1009
1010 * ContentChecker.pm (_attr_status_info): New internal method.
1011
1012 2008-02-24 Wakaba <wakaba@suika.fam.cx>
1013
1014 * ContentChecker.pm (check_element): Element standardized
1015 status information is now dispatched.
1016
1017 2008-02-24 Wakaba <wakaba@suika.fam.cx>
1018
1019 * ContentChecker.pm (check_element): Fix |del|-and-significant
1020 problem by adding some more arguments.
1021
1022 2008-02-24 Wakaba <wakaba@suika.fam.cx>
1023
1024 * ContentChecker.pm (check_element): Use context of
1025 container-for-the-purpose-of-content-model element (not transparent
1026 element) for |check_child_element| calling and significant
1027 text flag marking. This reintroduces |<del>|-and-significant
1028 problem again.
1029
1030 2008-02-24 Wakaba <wakaba@suika.fam.cx>
1031
1032 * ContentChecker.pm (check_element): Make semi-transparent
1033 elements ignored for the purpose of phase changes in
1034 content model checking.
1035
1036 2008-02-23 Wakaba <wakaba@suika.fam.cx>
1037
1038 * ContentChecker.pm (check_element): In-element state
1039 was not properly managed for transparent cases.
1040
1041 2008-02-23 Wakaba <wakaba@suika.fam.cx>
1042
1043 * ContentChecker.pm (check_element): Support for |video|
1044 and |audio| as semi-transparent elements.
1045
1046 2008-02-23 Wakaba <wakaba@suika.fam.cx>
1047
1048 * ContentChecker.pm ($HTMLSemiTransparentElements): New.
1049 (check_element): s/minuses/minus_elements/, s/pluses/plus_elements/.
1050 Support for |html:object| as a semi-transparent element.
1051
1052 2008-02-23 Wakaba <wakaba@suika.fam.cx>
1053
1054 * ContentChecker.pm (check_element): The way to traverse
1055 the tree is entirely revised to make it easier to track
1056 the state of ancestors/descendants. As a result of this
1057 revision (which rewrites almost all of Whatpm::ContentChecker::HTML),
1058 support for content model checking for HTML elements |figure|,
1059 |object|, |video|, and |audio| and checking for XML elements (and
1060 some XMLNS checkings) are dropped for now. They will be
1061 reimplemented in due cource.
1062
1063 2008-02-17 Wakaba <wakaba@suika.fam.cx>
1064
1065 * HTML.pm.src: |>| in public or system literal closes the DOCTYPE
1066 token (HTML5 revision 1225).
1067
1068 2008-02-17 Wakaba <wakaba@suika.fam.cx>
1069
1070 * ContentChecker.pm ({unsupported_level}): New value.
1071
1072 * HTML.pm.src: Save whether |meta| |content| attribute
1073 contains character references or not.
1074
1075 2008-02-17 Wakaba <wakaba@suika.fam.cx>
1076
1077 * ContentChecker.pm (_get_children): (Incompleted) attempt to
1078 imlement significant content checking for contents
1079 with |del| elements.
1080
1081 2008-02-17 Wakaba <wakaba@suika.fam.cx>
1082
1083 * ContenteChecker.pm ($HTMLTransparentElements): More
1084 elements are added.
1085 (_get_children): HTML |object| elements are now semi-transparent.
1086
1087 * NanoDOM.pm (manakai_html, manakai_head): New methods.
1088
1089 2008-02-16 Wakaba <wakaba@suika.fam.cx>
1090
1091 * CacheManifest.pm: HTML5 revision 1211 implemented.
1092
1093 * CacheManifest.pod: Updated.
1094
1095 2008-02-10 Wakaba <wakaba@suika.fam.cx>
1096
1097 * ContentChecker.pm (check_document, check_element): Support
1098 for second argument ($onsubdoc).
1099 (_get_css_parser): Removed (now it is part of WDCC).
1100
1101 2008-02-09 Wakaba <wakaba@suika.fam.cx>
1102
1103 * ContentChecker.pm (_get_css_parser): New.
1104
1105 2007-11-25 Wakaba <wakaba@suika.fam.cx>
1106
1107 * ContentChecker.pm ($AnyChecker): Old way to add child elements
1108 for checking had been used.
1109
1110 2007-11-25 Wakaba <wakaba@suika.fam.cx>
1111
1112 * ContentChecker.pm (check_element): New todo item type |descendant|.
1113
1114 2007-11-23 Wakaba <wakaba@suika.fam.cx>
1115
1116 * IMTChecker.pm: Revised to raise errors and warnings as (poorly)
1117 specced in RFC 2046 and RFC 4288.
1118 (application/atom+xml): Definition added.
1119
1120 2007-11-23 Wakaba <wakaba@suika.fam.cx>
1121
1122 * URIChecker.pm: Make RFC 3986 should-level errors
1123 warnings (rather than SHOULD-level errors).
1124
1125 2007-11-23 Wakaba <wakaba@suika.fam.cx>
1126
1127 * NanoDOM.pm (get_user_data, set_user_data): New methods.
1128
1129 * HTML.pm.src: A flag for character references in attribute
1130 values are added. Set |manakai_has_reference| user data
1131 to |charset| attribute.
1132
1133 2007-11-23 Wakaba <wakaba@suika.fam.cx>
1134
1135 * NanoDOM.pm (input_encoding, manakai_charset, manakai_has_bom): New
1136 attributes.
1137
1138 * ContentChecker.pm (check_document): Warn if charset requirements
1139 cannot be tested.
1140
1141 2007-11-19 Wakaba <wakaba@suika.fam.cx>
1142
1143 * HTML.pm.src (parse_byte_string): Detect charset
1144 by universalchardet if charset parameter is not specified.
1145
1146 * Makefile (Charset-all, Charset-clean): New rules.
1147
1148 2007-11-18 Wakaba <wakaba@suika.fam.cx>
1149
1150 * ContentChecker.pm (check_document): Check the existence
1151 of character encoding declaration and selection of encoding
1152 for HTML document.
1153
1154 2007-11-18 Wakaba <wakaba@suika.fam.cx>
1155
1156 * ContentType.pm (get_sniffed_type): Return also the official
1157 type in list context.
1158
1159 2007-11-18 Wakaba <wakaba@suika.fam.cx>
1160
1161 * ContentType.pm: Sniffing with leading white space
1162 ignoring (HTML5 revisions 1013 and 1016).
1163
1164 2007-11-18 Wakaba <wakaba@suika.fam.cx>
1165
1166 * ContentType.pm: HTML5 revision 1013 changes, except for leading
1167 white spaces, are implemented.
1168
1169 2007-11-11 Wakaba <wakaba@suika.fam.cx>
1170
1171 * HTML.pm.src (parse_char_string): Set |inner_encoding|
1172 attribute if possible.
1173
1174 2007-11-11 Wakaba <wakaba@suika.fam.cx>
1175
1176 * HTML.pm.src (parse_byte_string): New method.
1177 (parse_char_string): New alias for |parse_string|.
1178 (main phase): Invoking "change the encoding" algorithm if desired.
1179
1180 * HTML.pod: Updated.
1181
1182 2007-11-11 Wakaba <wakaba@suika.fam.cx>
1183
1184 * HTML.pod (get_inner_html): Removed.
1185
1186 * Makefile (HTML-all, HTML-clean): New.
1187
1188 2007-11-11 Wakaba <wakaba@suika.fam.cx>
1189
1190 * HTML.pm.src (get_inner_html): Removed (moved to HTML/Serializer.pm).
1191
1192 2007-11-08 Wakaba <wakaba@suika.fam.cx>
1193
1194 * mklinktypelist.pl: s/noreferer/noreferrer/ (HTML5 revision 1132).
1195
1196 2007-11-04 Wakaba <wakaba@suika.fam.cx>
1197
1198 * Makefile: |CacheManifest.html| is added.
1199
1200 * CacheManifest.pod: New file.
1201
1202 2007-11-04 Wakaba <wakaba@suika.fam.cx>
1203
1204 * CacheManifest.pm: New module.
1205
1206 2007-11-04 Wakaba <wakaba@suika.fam.cx>
1207
1208 * HTML.pm.src: Support for application cache selection algorithm
1209 callback.
1210
1211 2007-11-04 Wakaba <wakaba@suika.fam.cx>
1212
1213 * mklinktypelist.pl: Support for rel=noreferer (HTML5 revision 1118).
1214
1215 2007-10-17 Wakaba <wakaba@suika.fam.cx>
1216
1217 * Makefile (clean): New rule.
1218
1219 * NanoDOM.pm (public_id, system_id): New attributes.
1220
1221 2007-10-17 Wakaba <wakaba@suika.fam.cx>
1222
1223 * Makefile (CSS-all, CSS-clean, clean): New rules.
1224
1225 2007-10-14 Wakaba <wakaba@suika.fam.cx>
1226
1227 * ContentChecker.pm (check_document): Support for
1228 new |is_xml_root| flag.
1229 (check_element): Support for new |pluses| state.
1230 (_add_pluses): New method.
1231 (_remove_minuses): Support for new |minus| item.
1232
1233 2007-09-24 Wakaba <wakaba@suika.fam.cx>
1234
1235 * ContentChecker.pm: Raise specific error for invalid
1236 root element.
1237
1238 2007-09-24 Wakaba <wakaba@suika.fam.cx>
1239
1240 * ContentChecker.pm: Set level values for later uses.
1241
1242 2007-09-09 Wakaba <wakaba@suika.fam.cx>
1243
1244 * ContentChecker.pm: Support for language tag validation.
1245
1246 2007-09-09 Wakaba <wakaba@suika.fam.cx>
1247
1248 * LangTag.pm (check_rfc3066_language_tag): New method.
1249
1250 2007-09-09 Wakaba <wakaba@suika.fam.cx>
1251
1252 * LangTag.pm: New module.
1253
1254 2007-09-04 Wakaba <wakaba@suika.fam.cx>
1255
1256 * HTML.pm.src: Some error types were wrong.
1257
1258 2007-08-17 Wakaba <wakaba@suika.fam.cx>
1259
1260 * CSS/: New directory.
1261
1262 2007-08-17 Wakaba <wakaba@suika.fam.cx>
1263
1264 * ContentChecker.pm (_check_get_children): Support
1265 for |noscript| in |head|.
1266
1267 2007-08-12 Wakaba <wakaba@suika.fam.cx>
1268
1269 * URI/: New directory.
1270
1271 2007-08-11 Wakaba <wakaba@suika.fam.cx>
1272
1273 * HTML.pm.src: Tokenizer's states are now represented in
1274 number.
1275
1276 2007-08-11 Wakaba <wakaba@suika.fam.cx>
1277
1278 * HTML.pm.src: |or|s for insertion modes are replaced
1279 by |&|s.
1280
1281 2007-08-11 Wakaba <wakaba@suika.fam.cx>
1282
1283 * HTML.pm.src: Token types are now represented in number.
1284
1285 2007-08-11 Wakaba <wakaba@suika.fam.cx>
1286
1287 * ContentType.pm (SEE ALSO): Updated.
1288
1289 * HTML.pm.src: Insertion modes are now represented in number.
1290
1291 2007-08-11 Wakaba <wakaba@suika.fam.cx>
1292
1293 * ContentType.pm: Sniffing for bitmap images (HTML5 revision
1294 999) is implemented.
1295
1296 2007-08-08 Wakaba <wakaba@suika.fam.cx>
1297
1298 * ContentType.pm: Sniffing for |<script| (HTML5 revision
1299 983) is implemented.
1300
1301 2007-08-06 Wakaba <wakaba@suika.fam.cx>
1302
1303 * ContentChecker.pod: New documentation.
1304
1305 * Makefile: A rule for |ContentChecker.html| is added.
1306
1307 * ContentChecker.pm: A pod "LICENSE" section is added.
1308
1309 * NanoDOM.pm ($VERSION): New variable.
1310
1311 2007-08-05 Wakaba <wakaba@suika.fam.cx>
1312
1313 * H2H.pm: |b|, |i|, and |sub| are added to the
1314 list of allowed HTML elements.
1315
1316 2007-08-05 Wakaba <wakaba@suika.fam.cx>
1317
1318 * H2H.pm: |samp| is added to the list of allowed
1319 HTML elements.
1320
1321 * URIChecker.pm (check_iri): New.
1322 (check_iri_reference): Error type for IRI reference
1323 syntax error is changed.
1324
1325 2007-08-04 Wakaba <wakaba@suika.fam.cx>
1326
1327 * ContentChecker.pm: Reference to the |Whatpm::ContentChecker::Atom|
1328 is added.
1329 (check_document): Load appropriate module before validation.
1330
1331 2007-08-04 Wakaba <wakaba@suika.fam.cx>
1332
1333 * ContentChecker/: New directory.
1334
1335 2007-08-04 Wakaba <wakaba@suika.fam.cx>
1336
1337 * ContentChecker.pm: HTML |time| element is implemented.
1338
1339 * HTMLTable.pm: Comments are updated as HTML5 is revised.
1340
1341 2007-07-21 Wakaba <wakaba@suika.fam.cx>
1342
1343 * ContentChecker.pm (check_document): Return value
1344 even if no document element is found.
1345
1346 2007-07-21 Wakaba <wakaba@suika.fam.cx>
1347
1348 * HTML.pm.src: |$in_body| is no longer a function.
1349
1350 2007-07-21 Wakaba <wakaba@suika.fam.cx>
1351
1352 * HTML.pm.src: The |$in_body| code has been moved down.
1353
1354 2007-07-21 Wakaba <wakaba@suika.fam.cx>
1355
1356 * HTML.pm.src: The "trailing end" insertion mode
1357 is split into "after html body" and "after html frameset"
1358 insertion modes. Their codes are merged with "after body"
1359 and "after frameset" codes. |$previous_insertion_mode|
1360 has been removed. "after frameset" code is
1361 merged with "in frameset" code.
1362
1363 2007-07-21 Wakaba <wakaba@suika.fam.cx>
1364
1365 * HTML.pm.src: The "before head" insertion mode is
1366 merged with the "in head" insertion mode.
1367
1368 2007-07-21 Wakaba <wakaba@suika.fam.cx>
1369
1370 * HTML.pm.src: Cases in "in head" insertion mode are
1371 reorganized.
1372
1373 2007-07-21 Wakaba <wakaba@suika.fam.cx>
1374
1375 * HTML.pm.src: Some cases in "in table" insertion mode
1376 are merged.
1377
1378 2007-07-21 Wakaba <wakaba@suika.fam.cx>
1379
1380 * HTML.pm.src: The "in row" insertion mode is merged
1381 with "in table" insertion mode.
1382
1383 2007-07-21 Wakaba <wakaba@suika.fam.cx>
1384
1385 * HTML.pm.src: The "in table" and "in table body" insertion
1386 modes are merged.
1387
1388 2007-07-21 Wakaba <wakaba@suika.fam.cx>
1389
1390 * HTML.pm.src: There is no "in table head" or "in table foot"
1391 insertion mode!
1392
1393 2007-07-21 Wakaba <wakaba@suika.fam.cx>
1394
1395 * HTML.pm.src: |<noframes>| "in frameset" and "in noframes"
1396 now directly invoke the handler.
1397
1398 2007-07-21 Wakaba <wakaba@suika.fam.cx>
1399
1400 * HTML.pm.src: Codes for "in cell" insertion mode
1401 is merged to the "in body" insertion mode code.
1402
1403 2007-07-21 Wakaba <wakaba@suika.fam.cx>
1404
1405 * HTML.pm.src: Codes for "in body" and "in caption"
1406 insertion modes are merged.
1407
1408 2007-07-21 Wakaba <wakaba@suika.fam.cx>
1409
1410 * HTML.pm.src: Two |!!!next-token|s were missing.
1411
1412 2007-07-21 Wakaba <wakaba@suika.fam.cx>
1413
1414 * HTML.pm.src: Use numeric constant for |{content_mode}|
1415 instead of string constant for |{content_model_flag}|.
1416
1417 2007-07-21 Wakaba <wakaba@suika.fam.cx>
1418
1419 * HTML.pm.src: Add the name of the attribute
1420 to the "duplicate attribute" error.
1421
1422 2007-07-17 Wakaba <wakaba@suika.fam.cx>
1423
1424 * ContentChecker.pm: Return the |class| node list.
1425
1426 2007-07-17 Wakaba <wakaba@suika.fam.cx>
1427
1428 * ContentChecker.pm: Return the |id| node list.
1429
1430 * HTML.pm.src: A typo is fixed.
1431
1432 2007-07-16 Wakaba <wakaba@suika.fam.cx>
1433
1434 * ContentChecker.pm: Drop wrong |level => 'error'| specification
1435 from "in HTML:xml:lang" error. Character position
1436 is now the last part of the error type in the URI error
1437 description. Report "unsupported" status for language
1438 tags, media queries, script codes, and style sheets.
1439
1440 2007-07-16 Wakaba <wakaba@suika.fam.cx>
1441
1442 * ContentChecker.pm: Report error if |xml:lang|
1443 in HTML, |lang| in XML, |xmlns| in XML, and |meta| |charset|
1444 in XML.
1445
1446 * NanoDOM.pm (Attr.owner_document): New attribute.
1447
1448 2007-07-16 Wakaba <wakaba@suika.fam.cx>
1449
1450 * HTML.pm.src: The character immediately following
1451 a bare |hcro| was discarded. Fix handling of
1452 entity references in attribute values.
1453
1454 2007-07-16 Wakaba <wakaba@suika.fam.cx>
1455
1456 * HTML.pm.src (main and trailin end phases): Token
1457 types |DOCTYPE|, |comment|, |end-of-file|, and |<html>| are
1458 factored out. Error types |in html:#DOCTYPE|
1459 and |after html:#DOCTYPE| are merged
1460 into |DOCTYPE in the middle|. |</frameset>|
1461 in fragment parsing mode changed the
1462 insertion mode.
1463
1464 2007-07-16 Wakaba <wakaba@suika.fam.cx>
1465
1466 * HTML.pm.src: |$phase| has been removed; The |trailing end|
1467 phase is now an insertion mode. Treatments for white
1468 space character tokens were incorrect for some
1469 insertion modes. An old |meta| case was not removed.
1470
1471 2007-07-16 Wakaba <wakaba@suika.fam.cx>
1472
1473 * HTML.pm.src: |meta| charset declaration extraction
1474 implemented (but changing the encoding is not yet:-).
1475
1476 2007-07-15 Wakaba <wakaba@suika.fam.cx>
1477
1478 * Charset/: New directory.
1479
1480 2007-07-15 Wakaba <wakaba@suika.fam.cx>
1481
1482 * H2H.pm: New Perl module (created from
1483 manakai's H2H.dis).
1484
1485 2007-07-15 Wakaba <wakaba@suika.fam.cx>
1486
1487 * XMLSerializer.pm: New Perl module (created from
1488 manakai's SimpleLS.dis).
1489
1490 2007-07-07 Wakaba <wakaba@suika.fam.cx>
1491
1492 * HTML.pm.src: |<!---x-->| was not processed correctly.
1493
1494 2007-07-01 Wakaba <wakaba@suika.fam.cx>
1495
1496 * HTML.pm.src: Report correct error message
1497 for |<body></div></body>|.
1498
1499 2007-07-01 Wakaba <wakaba@suika.fam.cx>
1500
1501 * HTMLTable.pm: An error description was incorrect.
1502
1503 2007-06-30 Wakaba <wakaba@suika.fam.cx>
1504
1505 * ContentChecker.pm: Return |{term}| list.
1506
1507 2007-06-30 Wakaba <wakaba@suika.fam.cx>
1508
1509 * HTML.pm.src: HTML5 revisions 961-966 (</p>, </br>,
1510 nested <nobr>, implied </tbody>, </tfoot>, and </thead>,
1511 and <title> outside of head).
1512
1513 2007-06-30 Wakaba <wakaba@suika.fam.cx>
1514
1515 * IMTChecker.pm: Report warning for unregistered
1516 and private types/subtypes.
1517
1518 * ContentChecker.pm, HTML.pm.src, IMTChecker.pm,
1519 URIChecker.pm, HTMLTable.pm: Error messages are now
1520 consistent; they are all listed in
1521 <http://suika.fam.cx/gate/2005/sw/Whatpm%20Error%20Types>.
1522
1523 2007-06-25 Wakaba <wakaba@suika.fam.cx>
1524
1525 * ContentChecker.pm: |<img ismap>| not in |<a></a>|
1526 is now erred. |<datalist>| is implemented.
1527 Attribute checker for |<command>| and |<menu>| are
1528 added. Support for |contextmenu| global attribute
1529 is added.
1530
1531 2007-06-25 Wakaba <wakaba@suika.fam.cx>
1532
1533 * HTML.pm.src (_reset_insertion_mode): Interpretation
1534 of Step 3 has been changed.
1535
1536 2007-06-25 Wakaba <wakaba@suika.fam.cx>
1537
1538 * HTML.pm.src: Late |<html>| parse error is implemented.
1539
1540 2007-06-24 Wakaba <wakaba@suika.fam.cx>
1541
1542 * URIChecker.pm (check_iri_reference): A |decode| method name was
1543 incorrect.
1544
1545 * ContentChecker.pm: Support for the |footer| element.
1546 Check URI syntax for space-separated URI attributes.
1547 Support for the |tabindex| attribute. Support
1548 for |datetime| attribute.
1549
1550 2007-06-24 Wakaba <wakaba@suika.fam.cx>
1551
1552 * HTML.pm.src: HTML5 revision 1.144 (&#x0D;) and 1.145 (invalid
1553 character references). HTML5 revision 1.146 (white space
1554 characters before root start tag). HTML5 revision
1555 1.148 (named character references in attribute values).
1556 HTML5 revision 1.152 (<plaintext>.innerHTML get).
1557
1558 2007-06-24 Wakaba <wakaba@suika.fam.cx>
1559
1560 * HTML.pm.src: HTML5 revisions 1.142 and 1.143 (<noscript>
1561 in <head>).
1562
1563 2007-06-24 Wakaba <wakaba@suika.fam.cx>
1564
1565 * HTML.pm.src: HTML5 revision 935 (<base>, <link>, <meta>
1566 in body).
1567
1568 * ContentChecker.pm: HTML5 revision 938 (scoped="").
1569
1570 2007-06-24 Wakaba <wakaba@suika.fam.cx>
1571
1572 * HTML.pm.src: HTML5 revision 923 (matching end tag in CDATA
1573 or RCDATA in fragment parsing mode). HTML5 revision
1574 924 (<!--> and <!--->). HTML5 revision 926 (hn in hn).
1575
1576 2007-06-23 Wakaba <wakaba@suika.fam.cx>
1577
1578 * HTML.pm.src (get_inner_html): HTML5 revision 922 (inner_html
1579 for <pre> and <textarea>).
1580
1581 2007-06-23 Wakaba <wakaba@suika.fam.cx>
1582
1583 * HTML.pm.src: HTML5 revision 920 (<isindex>).
1584
1585 2007-06-23 Wakaba <wakaba@suika.fam.cx>
1586
1587 * HTML.pm.src: HTML5 revision 918 (</head>, </body>, </html>).
1588
1589 2007-06-23 Wakaba <wakaba@suika.fam.cx>
1590
1591 * HTML.pm.src: HTML5 revision 916 (</body>).
1592 HTML5 revision 917 (conforming bare &).
1593
1594 2007-06-23 Wakaba <wakaba@suika.fam.cx>
1595
1596 * NanoDOM.pm (manakai_is_html): Setting to false did not work.
1597
1598 * HTML.pm.src: HTML5 revision 914 (</ in CDATA, RCDATA).
1599 HTML5 revision 915 (<nobr>).
1600
1601 2007-06-23 Wakaba <wakaba@suika.fam.cx>
1602
1603 * HTML.pm.src: HTML5 revisions 908, 909, 912, and 913 (quirks mode).
1604
1605 * NanoDOM.pm (manakai_is_html, manakai_compat_mode, compat_mode):
1606 New attributes.
1607
1608 2007-06-23 Wakaba <wakaba@suika.fam.cx>
1609
1610 * HTML.pm.src: HTML5 revisions 900, 901, 902, and 911 (<
1611 in tags).
1612
1613 2007-06-23 Wakaba <wakaba@suika.fam.cx>
1614
1615 * .cvsignore: |Entities.html| is added.
1616
1617 * HTML.pm.src: |$entity_char| is removed and
1618 requires |Whatpm::_NamedEntityList| instead.
1619 HTML5 revision 898 (refc), except that lack of refc
1620 is parse error.
1621
1622 * mkentitylist.pl: New script.
1623
1624 * Makefile (all): |_NamedEntityList.pm| is added.
1625 (_NamedEntityList.pm, Entities.html): New rules.
1626
1627 2007-06-23 Wakaba <wakaba@suika.fam.cx>
1628
1629 * HTML.pm.src: Parse errors immediately after U+000D
1630 were ignored and U+000D immediately following another
1631 U+000D was not converted to U+000A.
1632
1633 2007-06-23 Wakaba <wakaba@suika.fam.cx>
1634
1635 * HTML.pm.src (set_inner_html): HTML5 revision 892 (adopt
1636 nodes before appended). Parser was not ready for NULL
1637 parse error and escape flag.
1638
1639 * NanoDOM.pm (adopt_node): New.
1640
1641 2007-06-23 Wakaba <wakaba@suika.fam.cx>
1642
1643 * HTML.pm.src: HTML5 revision 886 (insane comment in
1644 CDATA and RCDATA). Note that current implementation
1645 is simply repeating what the spec says and it is maybe not
1646 a best way to do it.
1647
1648 2007-06-23 Wakaba <wakaba@suika.fam.cx>
1649
1650 * HTML.pm.src: HTML5 revision 884 (</form> don't close
1651 the form element if a descendant element without implied end tag has
1652 still been open).
1653
1654 2007-06-23 Wakaba <wakaba@suika.fam.cx>
1655
1656 * ContentChecker.pm: HTML5 revision 881 (Make |id| attribute with
1657 space characters non-conforming).
1658
1659 2007-06-23 Wakaba <wakaba@suika.fam.cx>
1660
1661 * HTML.pm.src: An error message was incorrect.
1662 HTML5 revision 869 (C1 character references).
1663
1664 2007-06-23 Wakaba <wakaba@suika.fam.cx>
1665
1666 * HTML.pm.src: HTML5 revision 867 (a LF at the beginning of
1667 a |textarea| is removed).
1668
1669 2007-06-05 Wakaba <wakaba@suika.fam.cx>
1670
1671 * NanoDOM.pm (get_attribute_node_ns): New method.
1672
1673 * ContentChecker.pm: |script| |async| and |defer|
1674 no longer require |src|. |async| MUST NOT be
1675 specified if |defer|. (HTML5 revision 858).
1676
1677 2007-05-30 Wakaba <wakaba@suika.fam.cx>
1678
1679 * HTML.pm.src: |<form><form>| went to inifinite loop.
1680
1681 2007-05-27 Wakaba <wakaba@suika.fam.cx>
1682
1683 * ContentChecker.pm (html): Set |is_root| (allowed
1684 as a document element) flag on.
1685 (new): Removed.
1686 (check_document): New method.
1687
1688 2007-05-27 Wakaba <wakaba@suika.fam.cx>
1689
1690 * ContentChecker.pm (thead, tfoot): Checker specifications
1691 were incorrect.
1692
1693 2007-05-27 Wakaba <wakaba@suika.fam.cx>
1694
1695 * ContentChecker.pm ($HTMLURIAttrChecker): Include
1696 error position in the |type| option of the error.
1697
1698 * HTMLTable.pm (form_table): The |$onerror| parameter
1699 is now optional. Some bugs are fixed.
1700
1701 2007-05-27 Wakaba <wakaba@suika.fam.cx>
1702
1703 * HTMLTable.pm: New module.
1704
1705 * ContentChecker.pm (table): Invoke table model error checker.
1706
1707 * NanoDOM.pm (first_child, get_attribute_ns): New.
1708
1709 2007-05-26 Wakaba <wakaba@suika.fam.cx>
1710
1711 * ContentChecker.pm ($HTMLLinkTypesAttrChecker): New checker.
1712 (link/@rel, a/@rel, area/@rel): Use new checker.
1713
1714 * Makefile (_LinkTypeList.pm, RelExtensions.html): New rules.
1715
1716 * _LinkTypeList.pm: New file.
1717
1718 * mklinktypelist.pl: New file.
1719
1720 * .cvsignore: |RelExtensions.html| added.
1721
1722 * NanoDOM.pm (child_nodes): Returns an empty array
1723 for non-child-containing node types.
1724 (text_content): New attribute.
1725
1726 2007-05-26 Wakaba <wakaba@suika.fam.cx>
1727
1728 * IMTChecker.pm: New module.
1729
1730 * ContentChecker.pm ($HTMLIMTAttrChecker): Call IMTChecker
1731 to test parameter value validity.
1732
1733 * HTML.pm.src ($style_start_tag): Attributes were
1734 discarded.
1735
1736 2007-05-25 Wakaba <wakaba@suika.fam.cx>
1737
1738 * ContentChecker.pm ($HTMLURIAttrChecker): Implemented.
1739
1740 2007-05-25 Wakaba <wakaba@suika.fam.cx>
1741
1742 * URIChecker.pm: All recommendations from RFC 3986
1743 and RFC 3987 are listed (not all testable items are checked yet).
1744
1745 2007-05-25 Wakaba <wakaba@suika.fam.cx>
1746
1747 * URIChecker.pm: New module.
1748
1749 2007-05-20 Wakaba <wakaba@suika.fam.cx>
1750
1751 * ContentChecker.pm: Now most attributes are implemented
1752 or associated to some placeholder.
1753 ($ElementDefault): Warn unknown attributes
1754 for unknown elements as "attribute not supported".
1755 ($HTMLLanguageTagAttrChecker, $HTMLMQAttrChecker): New placeholders.
1756 ($HTMLUsemapAttrChecker, $HTMLTargetAttrChecker): New checkers.
1757 (|a| attribute checker): Reimplemented.
1758
1759 2007-05-20 Wakaba <wakaba@suika.fam.cx>
1760
1761 * ContentChecker.pm ($HTMLEventHandlerAttrChecker): New placeholder.
1762 ($HTMLAttrChecker): Event handler content attributes
1763 are added.
1764 (link, embed): Required attribute is now checked.
1765 (embed): Unknown local attributes are no longer warned.
1766
1767 2007-05-20 Wakaba <wakaba@suika.fam.cx>
1768
1769 * ContentChecker.pm ($HTMLSpaceURIsAttrChecker): New placeholder.
1770 ($HTMLIMTAttrChecker): New checker.
1771 (link@rel, link@href, link@type, style@type,
1772 a@href, a@ping, a@ping, a@type, embed@src, embed@type,
1773 object@data, object@type, source@src, source@type, area@alt,
1774 area@shape, area@coords, area@href,
1775 area@ping, area@rel, area@type, script@src,
1776 script@defer, script@async, script@type): Checkers added.
1777
1778 2007-05-20 Wakaba <wakaba@suika.fam.cx>
1779
1780 * ContentChecker.pm: Descendant checking was incorrect.
1781
1782 2007-05-19 Wakaba <wakaba@suika.fam.cx>
1783
1784 * ContentChecker.pm: Support |xml:*| and |xmlns:*|
1785 attributes. Report an error if |Element.prefix|
1786 is |xmlns|.
1787
1788 * NanoDOM.pm (prefix): New attribute.
1789
1790 2007-05-19 Wakaba <wakaba@suika.fam.cx>
1791
1792 * HTML.pm.src: In |main| phase, |in body| insertion
1793 mode, action for |<iframe>| was missing.
1794
1795 2007-05-19 Wakaba <wakaba@suika.fam.cx>
1796
1797 * ContentChecker.pm: Support for many of HTML5 elements.
1798 ($GetHTMLNonNegativeIntegerAttrChecker): New.
1799
1800 2007-05-19 Wakaba <wakaba@suika.fam.cx>
1801
1802 * ContentChecker.pm: Support for most elements up to |progress|.
1803 ($HTMLURIAttrChecker): Placeholder.
1804 ($HTMLIntegerAttrChecker, $GetHTMLFloatingPointNumberAttrChecker):
1805 New.
1806
1807 2007-05-19 Wakaba <wakaba@suika.fam.cx>
1808
1809 * ContentChecker.pm: Attribute checkers
1810 for global attributes, |html|, |base|, |style|, and |meta|.
1811
1812 * NanoDOM.pm (insert_before): Weaken reference
1813 to the parent node.
1814 (Attr::new): Set |owner_element| attribute.
1815 (namespace_uri, manakai_local_name): New attribute implementations.
1816 (owner_element): New attribute.
1817
1818 2007-05-19 Wakaba <wakaba@suika.fam.cx>
1819
1820 * ContentChecker.pm ($AttrChecker, $HTMLAttrChecker,
1821 $AnyChecker->{attr_checker}, $HTMLAttrsChecker,
1822 $Element->{$HTML_NS}->{''}): New.
1823 (check_element): Invoke attrs_checker for each element.
1824
1825 2007-05-13 Wakaba <wakaba@suika.fam.cx>
1826
1827 * ContentChecker.pm: Don't use |manakai_element_type_match|.
1828
1829 2007-05-13 Wakaba <wakaba@suika.fam.cx>
1830
1831 * ContentChecker.pm: Use hashs rather than lists for
1832 element type testings.
1833
1834 2007-05-13 Wakaba <wakaba@suika.fam.cx>
1835
1836 * ContentChecker.pm: Don't generate duplicate
1837 error when an element type is put in the "minus" list
1838 and the element type is not allowed explicitly in the particular
1839 element content model.
1840 (html:a checker): New checker.
1841 (html:details, html:datagrid): New checkers.
1842 (html:legend): New checker.
1843
1844 2007-05-13 Wakaba <wakaba@suika.fam.cx>
1845
1846 * ContentChecker.pm (html:li checker): Implemented.
1847
1848 2007-05-13 Wakaba <wakaba@suika.fam.cx>
1849
1850 * ContentChecker.pm ($HTMLInlineOrStriclyInlineChecker): New
1851 checker.
1852 (html:dd checker): New checker.
1853 (html:q, html:em, html:strong, html:small,
1854 html:m, html:dfn, html:code, html:samp, html:span): New checkers.
1855
1856 2007-05-13 Wakaba <wakaba@suika.fam.cx>
1857
1858 * ContentChecker.pm ($AnyChecker): Renamed
1859 from |$ElementDefault->{checker}|.
1860 ($ElementDefault->{checker}): Throw an error that
1861 the element type is not supported by the checker.
1862 ($HTMLMetadataElement): |html:base| was missing.
1863 ($HTMLEmptyChecker): Don't throw an error
1864 for inter-element whitespace nodes.
1865 (html:html checker): Errors were not
1866 thrown even if |html:head| and/or |html:body|
1867 children were missing.
1868 (html:head checker): An error was not
1869 thrown if <meta charset> appered after other
1870 elements.
1871
1872 2007-05-05 Wakaba <wakaba@suika.fam.cx>
1873
1874 * ContentChecker.pm: |footer|, |video|, |audio|, |script|,
1875 and |noscript| elements are implemented.
1876 (new): New method.
1877
1878 2007-05-04 Wakaba <wakaba@suika.fam.cx>
1879
1880 * ContentChecker.pm: New module.
1881
1882 2007-05-04 Wakaba <wakaba@suika.fam.cx>
1883
1884 * NanoDOM.pm (manakai_parent_element,
1885 document_element, manakai_local_name,
1886 manakai_element_type_match): New method.
1887
1888 2007-05-03 Wakaba <wakaba@suika.fam.cx>
1889
1890 * HTML.pm.src: Replace decimal and hexadecimal numeric
1891 entities in C1 range using Windows-1252 mapping. Bare LF
1892 did not count as new line for error reporting.
1893
1894 2007-05-02 Wakaba <wakaba@suika.fam.cx>
1895
1896 * NanoDOM.pm (DOMImplementation): New class.
1897 (append_child): Weaken the |parent_node| reference.
1898 (create_element_ns, Element new): Set the |owner_document|
1899 reference.
1900 (implementation): New attribute.
1901 (owner_document, local_name, namespace_uri): New attributes.
1902
1903 * HTML.pm.src (parse_string): Line and column numbers
1904 are now provided to error handler.
1905 (!!!parse-error): Short descriptions are added.
1906 (_construct_tree): Split into three methods; support
1907 for innerHTML mode.
1908 (set_inner_html): New method.
1909
1910 2007-05-01 Wakaba <wakaba@suika.fam.cx>
1911
1912 * NanoDOM.html: Documentation is added.
1913
1914 * HTML.pod, ContentType.html: Documentation is revised.
1915
1916 * .cvsignore: Pod2html temporary files are added.
1917
1918 * Makefile: Make |NanoDOM.html|.
1919
1920 2007-05-01 Wakaba <wakaba@suika.fam.cx>
1921
1922 * HTML.pm.src (parse_string): New method.
1923 (get_inner_html): Renamed from |inner_html|.
1924
1925 * Makefile: A rule for |HTML.html| is added.
1926
1927 * HTML.pod: New documentation.
1928
1929 2007-05-01 Wakaba <wakaba@suika.fam.cx>
1930
1931 * NanoDOM.pm (last_child, previous_sibling): New attributes.
1932 (clone_node): Attribute nodes were not completely copied.
1933
1934 * HTML.pm.src: Many bugs are fixed.
1935
1936 2007-04-30 Wakaba <wakaba@suika.fam.cx>
1937
1938 * HTML.pm.src: Some typos are fixed.
1939
1940 2007-04-30 Wakaba <wakaba@suika.fam.cx>
1941
1942 * mkhtmlparser.pl, Makefile: References to the |HTML-consume-entity.src|
1943 are removed.
1944
1945 * HTML.pm.src: Tokenizer's handling on named entities are rewritten.
1946
1947 * HTML-consume-entity.src: Removed.
1948
1949 2007-04-30 Wakaba <wakaba@suika.fam.cx>
1950
1951 * HTML.pm.src: Tokenizer's handling on hexadecimal
1952 numeric entities are rewritten.
1953
1954 2007-04-30 Wakaba <wakaba@suika.fam.cx>
1955
1956 * HTML.pm.src: Some tokenizer bugs are fixed.
1957
1958 2007-04-30 Wakaba <wakaba@suika.fam.cx>
1959
1960 * HTML.pm.src: The tree construction stage is implemented.
1961
1962 * mkhtmlparser.pl: New macros are added.
1963
1964 2007-04-28 Wakaba <wakaba@suika.fam.cx>
1965
1966 * ContentType.pm: A note on bug in the specification
1967 is removed since it's been now fixed.
1968
1969 * .cvsignore: New file.
1970
1971 2007-04-28 Wakaba <wakaba@suika.fam.cx>
1972
1973 * HTML.pm.src, HTML-consume-entity.src: New files.
1974
1975 * Makefile (HTML.pm): New rule.
1976
1977 * mkhtmlparser.pl: New script.
1978
1979 2007-04-25 Wakaba <wakaba@suika.fam.cx>
1980
1981 * Makefile: New file.
1982
1983 2007-04-24 Wakaba <wakaba@suika.fam.cx>
1984
1985 * ContentType.pm: An error in pod is fixed.
1986
1987 2007-04-24 Wakaba <wakaba@suika.fam.cx>
1988
1989 * ContentType.pm: Documentation is added.
1990
1991 2007-04-24 Wakaba <wakaba@suika.fam.cx>
1992
1993 * ContentType.pm: New Perl module.
1994
1995 * ChangeLog: New file.
1996
1997

admin@suikawiki.org
ViewVC Help
Powered by ViewVC 1.1.24