/[suikacvs]/markup/html/whatpm/Whatpm/ChangeLog
Suika

Contents of /markup/html/whatpm/Whatpm/ChangeLog

Parent Directory Parent Directory | Revision Log Revision Log


Revision 1.404 - (show annotations) (download)
Sun Sep 6 23:32:06 2009 UTC (15 years, 2 months ago) by wakaba
Branch: MAIN
CVS Tags: HEAD
Changes since 1.403: +5 -0 lines
++ whatpm/t/ChangeLog	6 Sep 2009 23:31:19 -0000
2009-09-07  Wakaba  <wakaba@suika.fam.cx>

	* tree-test-1.dat: Added new test data on obsolete permitted
	DOCTYPEs (HTML5 revision 3378).

++ whatpm/Whatpm/ChangeLog	6 Sep 2009 23:31:49 -0000
2009-09-07  Wakaba  <wakaba@suika.fam.cx>

	* HTML.pm.src (_tree_construction_initial): Implemented "obsolete
	permitted DOCTYPEs" (HTML5 revision 3378).

1 2009-09-07 Wakaba <wakaba@suika.fam.cx>
2
3 * HTML.pm.src (_tree_construction_initial): Implemented "obsolete
4 permitted DOCTYPEs" (HTML5 revision 3378).
5
6 2009-09-06 Wakaba <wakaba@suika.fam.cx>
7
8 * HTML.pm.src: Implemented the remaining parts of the
9 "frameset-ok" flag (HTML5 revision 2703 and HTML5 revision 2987).
10
11 2009-09-06 Wakaba <wakaba@suika.fam.cx>
12
13 * HTML.pm.src: Fixed typos on |frameset-ok| implementation.
14
15 2009-09-06 Wakaba <wakaba@suika.fam.cx>
16
17 * HTML.pm.src: The first cut to implement the |frameset-ok| flag
18 concept (HTML5 revision 2703).
19
20 2009-09-06 Wakaba <wakaba@suika.fam.cx>
21
22 * HTML.pm.src: |<hr/>| was incorrectly marked as an error.
23
24 2009-09-06 Wakaba <wakaba@suika.fam.cx>
25
26 * HTML.pm.src: Changed the handling of </html> end tag of "in
27 body" insertion mode according to the latest spec definition.
28
29 2009-09-06 Wakaba <wakaba@suika.fam.cx>
30
31 * HTML.pm.src: Rewrote end tag handling in |head| area (cf. HTML5
32 revision 2730, but it was entirely broken, maybe I missed some
33 spec changes before rev.2730).
34
35 2009-09-06 Wakaba <wakaba@suika.fam.cx>
36
37 * HTML.pm.src: Added the |hgroup| element (HTML5 revision 3039 and
38 HTML5 revision 3040).
39
40 2009-09-06 Wakaba <wakaba@suika.fam.cx>
41
42 * HTML.pm.src: Added |xmp| to the list of Special elements (HTML5
43 revision 3689). It should make no difference since the |xmp|
44 element has the PCDATA content and cannot be the non-bottommost
45 element in the stack of open elements.
46
47 2009-09-06 Wakaba <wakaba@suika.fam.cx>
48
49 * HTML.pm.src: Set the |confidence| flag to |irrlevant| when the
50 fragment parsing algorithm is invoked (HTML5 revision 2699).
51
52 2009-09-06 Wakaba <wakaba@suika.fam.cx>
53
54 * HTML.pm.src: Simplified foster parenting (HTML5 revision 2732).
55
56 2009-09-06 Wakaba <wakaba@suika.fam.cx>
57
58 * HTML.pm.src: No longer does foster parenting |style|, |script|,
59 and |input type=hidden| elements (HTML5 revision 2988).
60
61 2009-09-06 Wakaba <wakaba@suika.fam.cx>
62
63 * HTML.pm.src: Dropped the |eventsource| element (HTML5 revision
64 2870).
65
66 2009-09-06 Wakaba <wakaba@suika.fam.cx>
67
68 * HTML.pm.src (_tree_construction_main): Treate |keygen| as a void
69 element (HTML5 revision 2960).
70
71 2009-09-05 Wakaba <wakaba@suika.fam.cx>
72
73 * HTML.pm.src: Moved the "in table text" insertion mode code up
74 such that it does also work as specced in HTML5 in the remaining
75 cases (HTML5 revision 3382).
76
77 2009-09-05 Wakaba <wakaba@suika.fam.cx>
78
79 * HTML.pm.src: Implementing the "in table text" insertion mode,
80 first half (HTML5 revision 3382).
81
82 2009-08-30 Wakaba <wakaba@suika.fam.cx>
83
84 * ContentChecker.pm: Allow |xml:lang| in HTML document if the
85 owner element is not an HTML element (HTML5 revision 3697).
86
87 2009-08-23 Wakaba <wakaba@suika.fam.cx>
88
89 * ContentChecker.pm: Allow <input type=hidden> in context where
90 interactive content is disallowed (HTML5 revision 2392).
91
92 2009-08-16 Wakaba <wakaba@suika.fam.cx>
93
94 * HTML.pm.src: Dropped support for "XSLT-compat" and added
95 "about:legacy-compat" (HTML5 revision 2725).
96
97 2009-08-16 Wakaba <wakaba@suika.fam.cx>
98
99 * HTML.pm.src: "<!DOCTYPE>" is now explicitly supported in the
100 spec (HTML5 revision 2700).
101
102 2009-08-16 Wakaba <wakaba@suika.fam.cx>
103
104 * HTML.pm.src: Added support for <script> element in the "in
105 select" and "in select in table" insertion modes (HTML5 revision
106 2500).
107
108 2009-08-16 Wakaba <wakaba@suika.fam.cx>
109
110 * HTML.pm.src: An <hn> start tag in an <hn> element should close
111 the earlier element (HTML5 revision 2501).
112
113 2009-08-16 Wakaba <wakaba@suika.fam.cx>
114
115 * HTML.pm.src: The </br> end tags before the <body> start tag
116 should equally survive as the first br element child of the body
117 element (HTML5 revision 2586).
118
119 2009-08-16 Wakaba <wakaba@suika.fam.cx>
120
121 * HTML.pm.src: <font> in SVG land is handled as an SVG element as
122 long as it did not have HTML attributes (HTML5 revision 2590).
123
124 2009-08-16 Wakaba <wakaba@suika.fam.cx>
125
126 * HTML.pm.src (_tree_construction_main): Improved error reporting
127 on "select" start tag in "in select" and "in select in table"
128 parsing modes (c.f. HTML5 revision 3604).
129
130 2009-07-26 Wakaba <wakaba@suika.fam.cx>
131
132 * mkhtmlclass.pl: Made xmlns:xlink with wrong namespace URL
133 invalid (HTML5 revision 2701).
134
135 2009-07-26 Wakaba <wakaba@suika.fam.cx>
136
137 * HTML.pm.src: </rt>, </rp>, </option>, </optgroup> become
138 optional before </body> (HTML5 revision 2806).
139
140 2009-07-25 Wakaba <wakaba@suika.fam.cx>
141
142 * HTML.pm.src: Marked where SVG |script| execution should be
143 implemented (HTML5 revision 2904).
144
145 2009-07-25 Wakaba <wakaba@suika.fam.cx>
146
147 * HTML.pm.src (set_inner_html): The fragment parsing algorithm
148 inherits the document quirksness flag of the main document (HTML5
149 revision 2935).
150
151 2009-07-05 Wakaba <wakaba@suika.fam.cx>
152
153 * HTML.pm.src (_tree_construction_main): Put |table| in |p| in
154 quirks mode (The Hixie Quirk, HTML5 revision 3132).
155
156 2009-07-03 Wakaba <wakaba@suika.fam.cx>
157
158 * HTML.pm.src: <kegten> breaks out of <select> (HTML5 revision
159 3163).
160
161 2009-07-03 Wakaba <wakaba@suika.fam.cx>
162
163 * HTML.pm.src: Don't change encoding to UTF-8 if the input
164 encoding is UTF-16 (HTML5 revision 3204).
165
166 2009-07-03 Wakaba <wakaba@suika.fam.cx>
167
168 * HTML.pm.src (parse_byte_stream): Don't change to another
169 encoding if the input is UTF-16 (HTML5 revision 3205).
170
171 2009-06-28 Wakaba <wakaba@suika.fam.cx>
172
173 * HTML.pm.src: |isindex| should no longer generate |p| element
174 node (HTML5 revision 3236).
175
176 2008-12-12 Wakaba <wakaba@suika.fam.cx>
177
178 * ContentChecker.pm: Introduced new flag |has_autofocus|. Added
179 support for new |id_type| value of |any|, which means any kind of
180 element is allowed as a referenced element.
181
182 2008-12-12 Wakaba <wakaba@suika.fam.cx>
183
184 * ContentChecker.pm: Introduced |check_attrs2| method for the ease
185 of defining codes for checking required attribute and as such.
186
187 2008-12-12 Wakaba <wakaba@suika.fam.cx>
188
189 * IMTChecker.pm: Added more definitions for subtypes.
190
191 2008-12-11 Wakaba <wakaba@suika.fam.cx>
192
193 * URIChecker.pm: Some of |pos_end| values were wrong.
194
195 2008-12-06 Wakaba <wakaba@suika.fam.cx>
196
197 * ContentChecker.pm (check_element): Added support for "no
198 referenced datalist" error.
199
200 2008-12-06 Wakaba <wakaba@suika.fam.cx>
201
202 * URIChecker.pm: Bug fix: It did not work unless
203 Message::DOM::DOMImplementation has been |require|d.
204
205 2008-12-06 Wakaba <wakaba@suika.fam.cx>
206
207 * NanoDOM.pm (document_uri): New attribute.
208
209 * ContentChecker.pm: Don't use methods not implemented by NanoDOM.
210
211 2008-11-07 Wakaba <wakaba@suika.fam.cx>
212
213 * NanoDOM.pm (text_content): Don't create a Text node if the new
214 value is empty.
215
216 2008-11-06 Wakaba <wakaba@suika.fam.cx>
217
218 * SWML/: New directory.
219
220 2008-10-20 Wakaba <wakaba@suika.fam.cx>
221
222 * NanoDOM.pm (specified, all_declarations_processed,
223 manakai_attribute_type): New attributes.
224
225 2008-10-19 Wakaba <wakaba@suika.fam.cx>
226
227 * NanoDOM.pm (Entity->new): Initialize ->child_nodes as an empty
228 array.
229
230 2008-10-19 Wakaba <wakaba@suika.fam.cx>
231
232 * NanoDOM.pm (notation_name): New attribute.
233
234 2008-10-18 Wakaba <wakaba@suika.fam.cx>
235
236 * NanoDOM.pm (public_id, system_id): New attributes.a
237
238 2008-10-18 Wakaba <wakaba@suika.fam.cx>
239
240 * NanoDOM.pm (text_content): Moved to Node from Element. Setter
241 implemented.
242 (allowed_tokens, default_type, declared_type): Implemented.
243
244 2008-10-17 Wakaba <wakaba@suika.fam.cx>
245
246 * NanoDOM.pm (node_name): New attribute.
247 (ELEMENT_TYPE_DEFINITION_NODE, ATTRIBUTE_DEFINITION_NODE): New
248 constants.
249 (create_element_type_definition_node, create_attribute_definition,
250 create_notation, create_general_entity,
251 get_element_type_definition_node,
252 set_element_type_definition_node, get_general_entity_node,
253 set_general_entity_node, get_notation_node, set_notation_node,
254 get_attribute_definition_node, set_attribute_definition_node): New
255 methods.
256 (element_types, entities, notations, attribute_definitions): New
257 attributes.
258 (DocumentType): Support for child nodes, entities, notations, and
259 element types.
260 (Entity, Notation, ElementTypeDefinition, AttributeDefinition):
261 New classes.
262
263 * Dumper.pm: Support for general entities, notations, element type
264 definitions, and attribute definitions.
265
266 2008-10-15 Wakaba <wakaba@suika.fam.cx>
267
268 * NanoDOM.pm (create_processing_instruction): New method.
269 (xml_version, xml_encoding, xml_standalone): New attributes.
270 (ProcessingInstruction): New class.
271
272 2008-10-14 Wakaba <wakaba@suika.fam.cx>
273
274 * HTML.pm.src: Handling of end tags in the foreign content
275 insertion mode was partially wrong, because of wrong bit
276 operations.
277
278 2008-10-14 Wakaba <wakaba@suika.fam.cx>
279
280 * NanoDOM.pm (dom_config): New attribute (do nothing), for
281 Whatpm::XML::Parser support.
282
283 2008-10-14 Wakaba <wakaba@suika.fam.cx>
284
285 * Makefile: New rule to make HTML/Tokenizer.pm is added.
286
287 * HTML.pm.src: Tokenizer part moved to another file.
288
289 2008-10-13 Wakaba <wakaba@suika.fam.cx>
290
291 * HTML.pm.src: Merge |DT_EL| and |DD_EL| as |DTDD_EL|.
292
293 2008-10-13 Wakaba <wakaba@suika.fam.cx>
294
295 * HTML.pm.src: Element category constants redefined.
296
297 2008-10-13 Wakaba <wakaba@suika.fam.cx>
298
299 * HTML.pm.src: Steps for CDATA/RCDATA elements in tree
300 construction stage synced with the spec (HTML5 revisions 2139 and
301 2302).
302
303 2008-10-07 Wakaba <wakaba@suika.fam.cx>
304
305 * ContentChecker.pm: New error level "html5_fact" added, which
306 should be tentatively used until all of requirements are properly
307 specced as RFC 2119 "MUST" in HTML5.
308
309 2008-10-05 Wakaba <wakaba@suika.fam.cx>
310
311 * ContentChecker.pod: Note on internal flags is added.
312
313 2008-10-05 Wakaba <wakaba@suika.fam.cx>
314
315 * HTML.pm.src: An AAA bug fixed.
316
317 2008-10-04 Wakaba <wakaba@suika.fam.cx>
318
319 * HTML.pm.src: If another node is inserted by the parser, don't
320 reuse existing Text node to append a character (HTML5 revision
321 2124).
322
323 2008-10-04 Wakaba <wakaba@suika.fam.cx>
324
325 * HTML.pm.src: Support for <option> and <optgroup> in body (HTML5
326 revisions 1731 and 2128).
327
328 2008-10-04 Wakaba <wakaba@suika.fam.cx>
329
330 * HTML.pm.src: Make <foreignObject> scoping (HTML5 revision 1837).
331 Support for end tags of camelCase SVG elements were broken. A
332 wrong error type text fixed.
333
334 2008-10-04 Wakaba <wakaba@suika.fam.cx>
335
336 * HTML.pm.src: Drop redundant code (HTML5 revision 1731).
337
338 2008-10-04 Wakaba <wakaba@suika.fam.cx>
339
340 * HTML.pm.src: Support for new definition of |param| and |source|
341 start tag parsing (HTML5 revision 1731).
342
343 2008-10-04 Wakaba <wakaba@suika.fam.cx>
344
345 * HTML.pm.src: <p> steps reimplemented (HTML5 revision 1731).
346
347 2008-10-04 Wakaba <wakaba@suika.fam.cx>
348
349 * HTML.pm.src: <li>, <dt>, and <dd> steps reimplemented (HTML5
350 revisions 1731 and 1831).
351
352 2008-10-04 Wakaba <wakaba@suika.fam.cx>
353
354 * HTML.pm.src: Support for new flow (but not phrasing) elements (HTML5
355 revisions 1731 and 1778). Support for the </sarcasm> end tag (HTML5
356 revision 1731).
357
358 2008-10-04 Wakaba <wakaba@suika.fam.cx>
359
360 * HTML.pm.src: Support for |command| and |eventsource| elements (HTML5
361 revision 1731). End tags of |option| and |optgroup| elements are
362 now optional (HTML5 revision 1731).
363
364 2008-10-04 Wakaba <wakaba@suika.fam.cx>
365
366 * HTML.pm.src: New "special" elements added to the list (HTML5
367 revision 1778). "strile" -> "strike".
368
369 2008-10-02 Wakaba <wakaba@suika.fam.cx>
370
371 * ContentType.pm (get_sniffed_type): Support for the "better"
372 content sniffing (HTML5 revision 1927). In a case the official
373 type was not returned when the method is invoked in the list
374 context.
375
376 2008-09-22 Wakaba <wakaba@suika.fam.cx>
377
378 * HTML.pm.src: Character references for non-space C0 characters,
379 including U+000B VT, DEL character, noncharacter code points, are
380 now converted to the U+FFFD character (cf. HTML5 revision 2138).
381
382 2008-09-21 Wakaba <wakaba@suika.fam.cx>
383
384 * ContentChecker.pm: |form=""| check support added.
385
386 2008-09-21 Wakaba <wakaba@suika.fam.cx>
387
388 * ContentChecker.pm: |contextmenu| validness is now checked using
389 |id| and |id_type| properties, and |menu| property is removed.
390
391 2008-09-21 Wakaba <wakaba@suika.fam.cx>
392
393 * ContentChecker.pm: Prepare for |form| |name| attribute's
394 duplication checking.
395
396 2008-09-21 Wakaba <wakaba@suika.fam.cx>
397
398 * HTML.pm.src (parse_byte_stream): Support (or non-support) for
399 unsupported charset="" parameter value (HTML5 revision 2131).
400
401 2008-09-20 Wakaba <wakaba@suika.fam.cx>
402
403 * HTML.pm.src: Reminding places where U+000B is allowed as a space
404 character is fixed (cf. HTML5 revision 1738).
405
406 * ContentChecker.pm, HTMLTable.pm: U+000B is no longer part of
407 space characters (HTML5 revision 1738).
408
409 2008-09-20 Wakaba <wakaba@suika.fam.cx>
410
411 * HTML.pm.src: The "anything else" case for the "after after body"
412 insertion mode was not updated to swtich to the "in body"
413 insertion mode. U+000B is no longer a space character for the
414 purpose of tree construction phase (HTML5 revision 1738).
415
416 2008-09-20 Wakaba <wakaba@suika.fam.cx>
417
418 * HTML.pm.src: U+000B is no longer a space character (HTML5
419 revision 1738).
420
421 2008-09-20 Wakaba <wakaba@suika.fam.cx>
422
423 * ContentType.pm: 0x0B is no longer a space character (HTML5
424 revision 1738).
425
426 * HTML.pm.src: U+000B is no longer a space character for the
427 algorithm for extracting an encoding from a Content-Type (HTML5
428 revision 1738).
429
430 2008-09-20 Wakaba <wakaba@suika.fam.cx>
431
432 * ContentChecker.pm ($IsInHTMLInteractiveContent): New.
433
434 2008-09-18 Wakaba <wakaba@suika.fam.cx>
435
436 * LangTag.pm: Add checks for remaining requirements from RFC 4646.
437
438 * mklangreg.pl: Sort 'Prefix' values by their length, to ease
439 matching.
440
441 2008-09-18 Wakaba <wakaba@suika.fam.cx>
442
443 * LangTag.pm: Warn for private use language subtags. Error level
444 typos fixed. Support for Suppress-Script field.
445
446 * mklangreg.pl: Support for dumping of nested structure.
447
448 2008-09-18 Wakaba <wakaba@suika.fam.cx>
449
450 * LangTag.pm (check_rfc4646_langtag): Check if a tag is in the
451 recommended case as per RFC 4646.
452
453 2008-09-18 Wakaba <wakaba@suika.fam.cx>
454
455 * LangTag.pm (check_rfc4646_langtag): New method.
456
457 2008-09-18 Wakaba <wakaba@suika.fam.cx>
458
459 * mklangreg.pl: New script.
460
461 * Makefile: Updated for creation of the module for language subtag
462 registry.
463
464 2008-09-16 Wakaba <wakaba@suika.fam.cx>
465
466 * Makefile: WebIDL.html added.
467
468 * WebIDL.pod: New documentation.
469
470 2008-09-16 Wakaba <wakaba@suika.fam.cx>
471
472 * WebIDL.pm: Checker's error types are redefined.
473
474 2008-09-16 Wakaba <wakaba@suika.fam.cx>
475
476 * WebIDL.pm: Parser's error types are redefined. Some forward
477 compatible parsing bugs are fixed. Some unreachable codes are
478 commented out.
479
480 2008-09-16 Wakaba <wakaba@suika.fam.cx>
481
482 * WebIDL.pm: Support for the reminding extended attributes are
483 added. It does not satisfy the definition that a forward
484 interface declaration has an extended attribute. It seems that
485 unless explicitly allowed multiple extended attributes with the
486 same name is not allowed, though it is not explicitly mentioned in
487 the spec.
488
489 2008-09-16 Wakaba <wakaba@suika.fam.cx>
490
491 * WebIDL.pm: Unescapes extended attribute names and extended
492 attribute identifiers. Preserve whether an extended attribute has
493 an argument list of not. Support for extended attributes:
494 Constructor, ExceptionConsts, IndexGetter, IndexSetter,
495 NameGetter, NameSetter, and Null.
496 (has_argument_list): New attribute.
497 (idl_text): Stringifies argument lists, if any, even if it is
498 empty.
499
500 2008-09-15 Wakaba <wakaba@suika.fam.cx>
501
502 * HTML.pm.src: New state |PCDATA_STATE|. Use an empty string for
503 |{s_kwd}| in DATA_STATE as default.
504
505 2008-09-15 Wakaba <wakaba@suika.fam.cx>
506
507 * HTML.pm.src, mkhtmlparser.pl: Replace |{prev_char}|
508 by |{s_kwd}| in DATA_STATE.
509
510 2008-09-15 Wakaba <wakaba@suika.fam.cx>
511
512 * HTML.pm.src: Shorten keys.
513
514 2008-09-15 Wakaba <wakaba@suika.fam.cx>
515
516 * HTML.pm.src: Remove checking for control character, surrogate
517 pair, or noncharacter code points and non-Unicode code
518 points (they should be handled by Whatpm::Charset::UnicodeChecker).
519 (parse_char_stream): Support for the |$get_wrapper| argument and
520 character stream error handlers.
521
522 2008-09-15 Wakaba <wakaba@suika.fam.cx>
523
524 * ContentChecker.pm: Don't call |loda_ns_module|
525 for null-namespace elements/attributes.
526
527 * HTML.pm.src: Fact out $disallowed_control_chars
528 as a hash.
529
530 2008-09-14 Wakaba <wakaba@suika.fam.cx>
531
532 * HTML.pm.src: Regexp typo fixed. |{prev_char}|
533 and |{next_char}| initializations are moved to initialization
534 method. |{read_until}| now supports buffering. Sync |set_inner_html|
535 with |parse_char_stream|.
536
537 2008-09-14 Wakaba <wakaba@suika.fam.cx>
538
539 * HTML.pm.src (parse_char_stream): Make |set_next_char|
540 invoke |manakai_read_until|, not only |read|, where
541 possible, to decrease the number of |read| method calls.
542
543 * mkhtmlparser.pl: Related changes to the aforementioned
544 modification.
545
546 2008-09-14 Wakaba <wakaba@suika.fam.cx>
547
548 * HTML.pm.src: Use |read| instead of |getc|. |set_inner_html|
549 would report character error from now.
550
551 2008-09-14 Wakaba <wakaba@suika.fam.cx>
552
553 * HTML.pm.src: White-space-leaded non-white-space character
554 tokens in "before head insertion mode" was not
555 correctly handled.
556 (set_inner_html): Reimplemented using CharString decodehandle
557 class. Support for $get_wrapper argument. Support
558 for |{read_until}| feature.
559
560 2008-09-14 Wakaba <wakaba@suika.fam.cx>
561
562 * HTML.pm.src: Make a "bare ero" error for unknown
563 entities point the "&" character.
564
565 2008-09-14 Wakaba <wakaba@suika.fam.cx>
566
567 * HTML.pm.src: It turns out that U+FFFD don't have to
568 be added to the list of excluded characters.
569
570 2008-09-14 Wakaba <wakaba@suika.fam.cx>
571
572 * HTML.pm.src ($char_onerror): Have character decoder's |line|
573 and |column| a higher priority than the one set by the
574 tokenizer's input handler.
575 ($self->{read_until}): Exclude U+FFFD (but this might
576 not be necessary, since now we do line/column fixup in
577 the character decode handle).
578
579 2008-09-14 Wakaba <wakaba@suika.fam.cx>
580
581 * HTML.pm.src: Use |{read_until}| where possible.
582
583 2008-09-14 Wakaba <wakaba@suika.fam.cx>
584
585 * HTML.pm.src: Change |{getc_until}| to |{read_until}|
586 and |manakai_getc_until| to |manakai_read_until| to
587 reduce the number of string copies.
588
589 2008-09-14 Wakaba <wakaba@suika.fam.cx>
590
591 * HTML.pm.src (parse_char_string): Use newly created
592 |Whatpm::Charset::DecodeHandle::CharString| instead of Perl's
593 standard feature to |open| a string as a filehandle,
594 since Perl's string filehandle seems not supporting |ungetc|
595 method correctly.
596 (parse_char_stream): Define |{getc_until}| method.
597 (DATA_STATE): Experimental support for |getc_until| feature.
598
599 2008-09-13 Wakaba <wakaba@suika.fam.cx>
600
601 * HTML.pm.src: Check points added to newly added branches.
602
603 2008-09-13 Wakaba <wakaba@suika.fam.cx>
604
605 * HTML.pm.src: Remove |{char}|, which is no longer used.
606 Remove |{entity_in_attr}| and |{last_attribute_value_state}|
607 and replaced by |{prev_state}|.
608
609 * mkhtmlparser.pl: Remove |{char}| feature.
610 Remove |!!!back-next-input-character;| macro.
611
612 2008-09-13 Wakaba <wakaba@suika.fam.cx>
613
614 * HTML.pm.src: Finally we get rid of all the inner loops. Remove
615 entity related tokenizer states in favor of new states
616 implementing the consume character reference algorithm.
617
618 2008-09-13 Wakaba <wakaba@suika.fam.cx>
619
620 * HTML.pm.src: "Consume a character reference" algorithm is
621 now implemented as a tokenizer's state, rather than
622 a method, with minimum changes (more changes will
623 be made, in due course). "Bogus comment state"'s inner
624 loop gets removed.
625
626 2008-09-13 Wakaba <wakaba@suika.fam.cx>
627
628 * HTML.pm.src: Make |PUBLIC| and |SYSTEM| keyword tokenizing
629 into their own tokenizer states.
630
631 2008-09-13 Wakaba <wakaba@suika.fam.cx>
632
633 * HTML.pm.src: |CDATA_SECTION_STATE| (formally |CDATA_BLOCK_STATE|
634 is split into three states.
635
636 2008-09-13 Wakaba <wakaba@suika.fam.cx>
637
638 * HTML.pm.src: |CLOSE_TAG_OPEN_STATE| is broken into
639 itself and new |CDATA_PCDATA_CLOSE_TAG_STATE| so that
640 no longer does the tokenizer have to push back next input
641 characters in those states.
642
643 2008-09-13 Wakaba <wakaba@suika.fam.cx>
644
645 * HTML.pm.src: |MARKUP_DECLARATION_OPEN_STATE| broken
646 into four states so that no longer does the tokenizer have to push
647 back next input characters in that state.
648
649 2008-09-11 Wakaba <wakaba@suika.fam.cx>
650
651 * HTML.pm.src: Methods now accept additional parameter, $get_wrapper,
652 which can be used to insert some wrapper between the character
653 stream handle and the tokenizer. (It is currently not supported
654 for |set_inner_html| for |Element|s).
655
656 2008-09-10 Wakaba <wakaba@suika.fam.cx>
657
658 * HTML.pm.src: Ignore punctuations in charset names.
659
660 2008-09-10 Wakaba <wakaba@suika.fam.cx>
661
662 * ContentChecker.pm: Support for charset-layer error levels.
663
664 * HTML.pm.src: Don't specify |text| argument for the
665 |chardecode:fallback| error, since it is not the encoding
666 being used alternatively.
667
668 2008-09-06 Wakaba <wakaba@suika.fam.cx>
669
670 * HTML.pm.src: Support for |XSLT-compat| (HTML5 revision 2141).
671
672 2008-08-31 Wakaba <wakaba@suika.fam.cx>
673
674 * CacheManifest.pm: Support for extensibility (HTML5 revision 2051).
675
676 2008-08-31 Wakaba <wakaba@suika.fam.cx>
677
678 * HTML.pm.src: Bug fix and sync with the spec with regard
679 to after after frameset insertion mode processing (HTML5
680 revision 1909). Note that the implementation was wrong
681 per the old spec before the r1909 changes.
682
683 2008-08-30 Wakaba <wakaba@suika.fam.cx>
684
685 * HTMLTable.pm: scope=auto algorithm fix synced with the
686 spec (HTML5 revision 2093).
687 ($process_row): Algorithm step numbers synced with the
688 spec (HTML5 revision 2092).
689
690 2008-08-30 Wakaba <wakaba@suika.fam.cx>
691
692 * HTMLTable.pm: Zs is not what we want; we want White_Space! (HTML5
693 revision 2094).
694
695 2008-08-30 Wakaba <wakaba@suika.fam.cx>
696
697 * ContentType.pm: Support for image/svg+xml (HTML5 revision 2096).
698
699 2008-08-30 Wakaba <wakaba@suika.fam.cx>
700
701 * HTML.pm.src: '"' and "'" at the end of attribute
702 name (after another attribute) now raise parse error (HTML5
703 revision 2123). Empty unquoted attribute values are no
704 longer allowed (HTML5 revision 2122).
705
706 2008-08-30 Wakaba <wakaba@suika.fam.cx>
707
708 * mkhtmlparser.pl: Support for MathML |definitionURL| attribute (HTML5
709 revision 2130).
710
711 2008-08-30 Wakaba <wakaba@suika.fam.cx>
712
713 * ContentChecker.pm: |xml:lang| attribute value must be same
714 as |lang| attribute value for HTML elements (HTML5 revision 2062
715 and so on).
716
717 2008-08-30 Wakaba <wakaba@suika.fam.cx>
718
719 * ContentChecker.pm: Error level definition for |xml_id_error|
720 was missing.
721
722 * URIChecker.pm: The end of the URL should be marked as the
723 error location for an empty path error. The position
724 between the userinfo and the port components should be
725 marked as the error location for an empty host error.
726
727 2008-08-30 Wakaba <wakaba@suika.fam.cx>
728
729 * URIChecker.pm: Set parameters representing where in the
730 value the error occurs for errors. Report unknown
731 address format error in warning level, since address
732 formats are rarely added. Path segments starting with "/.."
733 were misinterpreted as a dot-segment.
734
735 2008-08-30 Wakaba <wakaba@suika.fam.cx>
736
737 * URIChecker.pm (check_iri_reference): Requires
738 |Message::DOM::DOMImplementation|.
739
740 2008-08-29 Wakaba <wakaba@suika.fam.cx>
741
742 * IMTChecker.pm: Updated for the new error reporting architecture.
743
744 * ContentChecker.pm: Error levels for IMTs are added.
745
746 2008-08-17 Wakaba <wakaba@suika.fam.cx>
747
748 * H2H.pm (_shift_token): Support for unquoted HTML attribute
749 values.
750
751 2008-08-16 Wakaba <wakaba@suika.fam.cx>
752
753 * CacheManifest.pm: Support for new style of error
754 reports.
755
756 * HTML.pm.src: Set line=1, column=1 to the document node.
757
758 2008-08-16 Wakaba <wakaba@suika.fam.cx>
759
760 * ContentChecker.pm, RDFXML.pm: Pass {level} object to language tag
761 and URL checkers. Support for more error levels for bogus
762 langauge tag and URL "standards".
763
764 * LangTag.pm, URIChecker.pm: Support for new style error
765 level reporting.
766
767 2008-08-15 Wakaba <wakaba@suika.fam.cx>
768
769 * ContentChecker.pm: Support for RDF/XML error levels.
770
771 * HTMLTable.pm, RDFXML.pm: Support for new style of error level
772 specifying. Error types are revised.
773
774 2008-08-15 Wakaba <wakaba@suika.fam.cx>
775
776 * ContentChecker.pm: All error reporting method calls are
777 renewed.
778
779 2008-08-15 Wakaba <wakaba@suika.fam.cx>
780
781 * HTML.pm.src: All error type names and "text" parameters
782 are revised. Use new style for "level" specification.
783
784 * mkhtmlparser.pl: Use new style for "level" specification.
785
786 2008-08-03 Wakaba <wakaba@suika.fam.cx>
787
788 * WebIDL.pm (parse_char_string): Simplified error
789 reporting process for broken ignored valuetype definition.
790 (Valuetype idl_text): Support for special "DOMString" name.
791
792 2008-08-03 Wakaba <wakaba@suika.fam.cx>
793
794 * WebIDL.pm ($get_scoped_name): Append "::::" if the last
795 terminal of the ScopedName is "DOMString", such that whether
796 the last part of the scoped name is "DOMString" or "_DOMString"
797 later. It is necessary to determine whether a |typedef|
798 definition should be ignored or not.
799 (parse_char_string): Unescape the identifier of
800 exception members.
801 ($resolve): Return undef for builtin types and sequence<T>
802 types (we might not have to do this, however...).
803 (check): Support checking for Exceptions, Valuetypes,
804 and Typedefs.
805 ($serialize_type): Support for "DOMString::::" syntax.
806 (Typedef idl_text): Output Type as "DOMString" if it
807 is really "DOMString" (i.e. its internal representation
808 is "::DOMString::").
809
810 2008-08-03 Wakaba <wakaba@suika.fam.cx>
811
812 * WebIDL.pm ($resolve): New code, based on resolve code
813 for constant types in the |check| method.
814 (check): Support for checking of attributes, operations, and
815 arguments.
816 (Attribute/Operation idl_text): Exception names in getraises,
817 setraises, and raises clauses is serizlied by |$serialize_type|
818 code.
819
820 2008-08-02 Wakaba <wakaba@suika.fam.cx>
821
822 * WebIDL.pm ($integer): Order of selections are changed to match
823 hexadecimal numbers (the original pattern, taken from the spec,
824 was not work for hexadecimal numbers, because the "0" prefix
825 matches to the [0-7]* part (as an empty string) and therefore
826 it does not match with remaining "x..." part of a "0x..." integer
827 literal.
828 ($get_type): It now returns a string, not an array reference,
829 for regular types and |sequence| types (i.e. it in any case
830 returns a string).
831 ($get_next_token): The second item in the array that represents
832 a integer or float token is now a Perl number value, not the
833 original string representation of the number.
834 (check): Support for const value consistency checking.
835 No extended attribute is defined for constants.
836 (Node subclasses): Use simple strings rather than array references
837 for default data type values.
838 ($serialize_type): Type values are now simple strings.
839 (value): If the new attribute value is a false value, then
840 a FALSE value is set to the attribute.
841
842 2008-08-02 Wakaba <wakaba@suika.fam.cx>
843
844 * WebIDL.pm ($get_scoped_name): Now scoped names are stored
845 in its stringified format ("scoped name" as defined in the
846 spec). Note that future version of this module should not use
847 array references for type values and the |type_text| attribute
848 should be made obsolete.
849 (parse_char_string): Unescape attribute names.
850 (check): Support for checking of whether inherited interfaces
851 are actually defined or not. Support for checking of whether
852 interface member identifiers are duplicated or not.
853 ($serialize_type): Scoped names are returned as is. A future
854 version of this code should escape identifiers other than "DOMString",
855 otherwise the idl_text would be non-conforming.
856
857 2008-08-02 Wakaba <wakaba@suika.fam.cx>
858
859 * WebIDL.pm (parse_char_string): Set line/column numbers
860 to generated nodes. Unescape identifiers. Extended attributes
861 for Definition's were ignored.
862 (append_child): Set |parent_node| attribute.
863 (parent_node): New attribute.
864 (check): Support interface/exception members. Support
865 extended attributes. Support definition identifier uniqueness
866 constraint.
867 (qualified_name): New attribute.
868 (Interface/Exception idl_text): Extended attributes were
869 not prepended to the returned text.
870
871 2008-08-02 Wakaba <wakaba@suika.fam.cx>
872
873 * WebIDL.pm (parse_char_string): Set line/column numbers
874 to interface object experimentally. s/shift/pop/g, shift
875 would make things wrong. Support for interface forward
876 declarations was missing. Broken interface declarations
877 with no block were not ignored entirely.
878 (Whatpm::WebIDL::Node): New abstract class. This class
879 makes things easier.
880 (child_nodes): New attribute. Unlike DOM's attribute with
881 same name, this attribute returns a dead list of nodes for
882 simplicity.
883 (get_user_data, set_user_data): New methods.
884 (Module idl_text): A SPACE character should be inserted
885 before the |{| character.
886 (Interface idl_text): Support for interface forward declarations.
887 (is_forward_declaration): New attribute.
888
889 2008-07-19 Wakaba <wakaba@suika.fam.cx>
890
891 * WebIDL.pm (type_text): Better serializer.
892
893 2008-07-19 Wakaba <wakaba@suika.fam.cx>
894
895 * WebIDL.pm: Revise forward-compatible parsing so that
896 it now can handle broken extended attributes and as such.
897
898 2008-07-19 Wakaba <wakaba@suika.fam.cx>
899
900 * WebIDL.pm: Real support for extended attributes.
901 Support for extended attributes with arguments.
902
903 2008-07-19 Wakaba <wakaba@suika.fam.cx>
904
905 * WebIDL.pm: Support for |exception| syntax.
906 (Interface->idl_text): Tentative support for inheritances.
907
908 2008-07-19 Wakaba <wakaba@suika.fam.cx>
909
910 * WebIDL.pm: Hierarchical scoped name support was broken.
911 Support for raises, setraises, and getraises syntaxes.
912
913 2008-07-18 Wakaba <wakaba@suika.fam.cx>
914
915 * WebIDL.pm: Support for |idl_text| attribute, version 1 (no
916 proper support for types, extended attributes, and exceptions yet).
917 WebIDL parser, version 1 (no support for exceptions yet,
918 no proper support for extended attributes yet).
919
920 2008-07-09 Wakaba <wakaba@suika.fam.cx>
921
922 * WebIDL.pm (parse_char_string): Support for basic attribute syntax.
923
924 2008-06-29 Wakaba <wakaba@suika.fam.cx>
925
926 * WebIDL.pm: Support for valuetype and const.
927
928 2008-06-29 Wakaba <wakaba@suika.fam.cx>
929
930 * WebIDL.pm: New module.
931
932 2008-06-15 Wakaba <wakaba@suika.fam.cx>
933
934 * Makefile (Entities.html): URI changed.
935
936 2008-06-08 Wakaba <wakaba@suika.fam.cx>
937
938 * HTML.pm.src: Support for ruby parsing (HTML5 revision 1704).
939
940 2008-06-01 Wakaba <wakaba@suika.fam.cx>
941
942 * HTML.pm.src (_get_next_token): A parse error was missing.
943
944 2008-06-01 Wakaba <wakaba@suika.fam.cx>
945
946 * mklinktypelist.pl: rel=contact is no longer part of the HTML5
947 spec (commented out). (HTML5 revision 1711).
948
949 2008-05-25 Wakaba <wakaba@suika.fam.cx>
950
951 * ContentType.pm: Drop support for UTF-32 (HTML5 revision 1701).
952
953 * HTML.pm.src: UTF-16BE and UTF-16LE should be considered
954 as UTF-16 (HTML5 revision 1701).
955
956 2008-05-25 Wakaba <wakaba@suika.fam.cx>
957
958 * HTML.pm.src: Support for <noframes> in <head> (HTML5 revision
959 1692).
960
961 2008-05-25 Wakaba <wakaba@suika.fam.cx>
962
963 * HTML.pm.src: The secondary insertion mode used when switching
964 to foreign content is the "in body" insertion mode (HTML5 revision
965 1696).
966
967 2008-05-25 Wakaba <wakaba@suika.fam.cx>
968
969 * HTML.pm.src: Don't raise parse error for <isindex/> (HTML5
970 revision 1697).
971
972 2008-05-24 Wakaba <wakaba@suika.fam.cx>
973
974 * HTML.pm.src: Support for end-of-file token in foreign content
975 insertion mode (HTML5 revision 1693). Update SVG camelCase
976 attribute list (HTML5 revision 1700). <textarea> closes
977 </select> (HTML5 revision 1699). More start tags close in
978 foreign content insertion mode (HTML5 revision 1698).
979
980 2008-05-24 Wakaba <wakaba@suika.fam.cx>
981
982 * HTML.pm.src: ";" is not part of charset name (HTML5 revision 1665).
983
984 2008-05-24 Wakaba <wakaba@suika.fam.cx>
985
986 * HTML.pm.src: More robust charset parameter detection (HTML5
987 revision 1674).
988
989 2008-05-24 Wakaba <wakaba@suika.fam.cx>
990
991 * ContentType.pm: Support for image/vnd.microsoft.icon (HTML5
992 revision 1676).
993
994 2008-05-24 Wakaba <wakaba@suika.fam.cx>
995
996 * HTML.pm.src: Ignore language part of public identifiers for
997 quriks mode detection (HTML5 revision 1679).
998
999 2008-05-24 Wakaba <wakaba@suika.fam.cx>
1000
1001 * HTML.pm.src: Reduce the number of errors in truncated doctypes (HTML5
1002 revision 1685).
1003
1004 2008-05-24 Wakaba <wakaba@suika.fam.cx>
1005
1006 * HTML.pm.src: Support for EOF in new states for tags (HTML5
1007 revision 1684).
1008
1009 2008-05-24 Wakaba <wakaba@suika.fam.cx>
1010
1011 * HTML.pm.src (_reset_insertion_mode): Make <td>.innerHTML
1012 work (HTML5 revision 1690).
1013
1014 2008-05-24 Wakaba <wakaba@suika.fam.cx>
1015
1016 * HTML.pm.src (_tree_construction_main): Change handling of
1017 end tags in head insertion modes (HTML5 revision 1686).
1018 (parse_char_string): Bug fix for non-utf8 character string handlings.
1019 (parse_char_stream): |ungetc| does not work well for this context.
1020
1021 2008-05-18 Wakaba <wakaba@suika.fam.cx>
1022
1023 * HTML.pm.src (parse_byte_string): Redefined to invoke
1024 |parse_byte_stream|.
1025 (parse_byte_stream): New method.
1026
1027 2008-05-18 Wakaba <wakaba@suika.fam.cx>
1028
1029 * HTML.pm.src (parse_byte_string): Fix the column number reported
1030 by encoding layer error reporter.
1031
1032 2008-05-17 Wakaba <wakaba@suika.fam.cx>
1033
1034 * HTML.pm.src (parse_byte_string): Use streaming decoder
1035 rather than converting the whole byte string and then parsing.
1036 Propagate errors in character encoding layer.
1037 (get_next_token): Precise error reporting for |bare stago| error.
1038
1039 2008-05-17 Wakaba <wakaba@suika.fam.cx>
1040
1041 * HTML.pm.src (parse_char_stream): New method.
1042 (parse_char_string): This method is now defined as an invocation
1043 of the |parse_char_stream| method.
1044
1045 2008-05-17 Wakaba <wakaba@suika.fam.cx>
1046
1047 * HTML.pm.src (parse_byte_string): Report various status
1048 of the sniffing as info-level errors. Support for new
1049 decoding framework in parser resestting.
1050 (new): Various default error levels were not set.
1051
1052 2008-05-17 Wakaba <wakaba@suika.fam.cx>
1053
1054 * HTML.pm.src (parse_byte_string): HTML5 encoding siniffing
1055 algorithm, except for the actual sniffing, is implemented
1056 with new framework with Message::Charset::Info.
1057
1058 2008-05-16 Wakaba <wakaba@suika.fam.cx>
1059
1060 * CacheManifest.pm (_parse): Drop fragment identifiers from
1061 URIs in fallback section (HTML5 revision 1596).
1062
1063 2008-05-10 Wakaba <wakaba@suika.fam.cx>
1064
1065 * Makefile (Entities.html): URI has changed.
1066
1067 2008-05-10 Wakaba <wakaba@suika.fam.cx>
1068
1069 * CacheManifest.pm: Don't replace U+0000 NULL (HTML5 revision
1070 1553).
1071
1072 2008-05-06 Wakaba <wakaba@suika.fam.cx>
1073
1074 * ContentChecker.pm: Noted that those returned in |table| are
1075 no longer table elements, but table objects returned
1076 by Whatpm::HTMLTable.
1077
1078 * HTMLTable.pm (form_table): Return table element node
1079 as |$table->{element}|.
1080 (assign_header): Support for the |headers=""| attribute.
1081
1082 2008-05-06 Wakaba <wakaba@suika.fam.cx>
1083
1084 * HTMLTable.pm (assign_header): New function; first version
1085 with no support for headers="".
1086 (form_table): Include table width and height to the returned
1087 table object for covenience. Indexing in column assignement was wrong.
1088 Set whether a data cell is empty or not for convenience.
1089
1090 2008-05-05 Wakaba <wakaba@suika.fam.cx>
1091
1092 * HTMLTable.pm: Robuster caption support (HTML5 revision 1393).
1093
1094 2008-05-05 Wakaba <wakaba@suika.fam.cx>
1095
1096 * HTMLTable.pm: How table model errors are detected is
1097 changed (HTML5 revision 1387).
1098
1099 2008-05-05 Wakaba <wakaba@suika.fam.cx>
1100
1101 * HTMLTable.pm: The algorithm now moves |tfoot| elements
1102 to the end of the table (HTML5 revision 1380).
1103
1104 2008-05-05 Wakaba <wakaba@suika.fam.cx>
1105
1106 * HTMLTable.pm: The algorithm is now 0-based indexing, instead
1107 of 1-based (HTML5 revision 1376).
1108
1109 2008-05-05 Wakaba <wakaba@suika.fam.cx>
1110
1111 * ContentType.pm: "Content-Type: text/plain; charset=UTF-8"
1112 and "Content-Encoding" no longer prevent sniffing (HTML5
1113 revision 1288).
1114
1115 2008-05-05 Wakaba <wakaba@suika.fam.cx>
1116
1117 * ContentType.pm: Skip BOMs n feed or HTML algorithm (HTML5 revision
1118 1282).
1119
1120 2008-05-03 Wakaba <wakaba@suika.fam.cx>
1121
1122 * ContentChecker.pm: Support for global attributes.
1123 Status of XML specs are added.
1124
1125 2008-05-03 Wakaba <wakaba@suika.fam.cx>
1126
1127 * ContentChecker.pm (check_element): Support for |template=""|
1128 and |ref=""| attribute (referent element type checking).
1129
1130 2008-04-29 Wakaba <wakaba@suika.fam.cx>
1131
1132 * CacheManifest.pm (_parse): New same origin definition (HTML5 revision
1133 1500) is implemented (except for IDNA part and URI-scheme-specific
1134 knowledge). Line number counting was wrong for LF-only
1135 documents.
1136
1137 2008-04-13 Wakaba <wakaba@suika.fam.cx>
1138
1139 * HTML.pm.src: Raise an parse error for any disallowed
1140 character (HTML5 revision 1263).
1141
1142 2008-04-13 Wakaba <wakaba@suika.fam.cx>
1143
1144 * mkentitylist.pl: Support for new HTML5 entity table format (the
1145 definition for |AElig;| was missing).
1146
1147 2008-04-13 Wakaba <wakaba@suika.fam.cx>
1148
1149 * HTML.pm.src, mkhtmlparser.pl: Support for element/attribute
1150 name/namespace fixup (HTML5 revisions 1413, 1415, 1416, and 1417).
1151
1152 2008-04-13 Wakaba <wakaba@suika.fam.cx>
1153
1154 * HTML.pm.src: List of element names that close foreign content
1155 insertion mode is added (HTML5 revisions 1412 and 1418).
1156
1157 2008-04-13 Wakaba <wakaba@suika.fam.cx>
1158
1159 * HTML.pm.src: Support for |mglyph| and |malignmark|
1160 elements (HTML5 revision 1410).
1161
1162 2008-04-13 Wakaba <wakaba@suika.fam.cx>
1163
1164 * HTML.pm.src: Support for new long MathML entities (HTML5
1165 revision 1406).
1166
1167 2008-04-13 Wakaba <wakaba@suika.fam.cx>
1168
1169 * HTML.pm.src: CDATA section support for MathML and SVG
1170 elements (HTML5 revision 1404 and 1420).
1171
1172 2008-04-12 Wakaba <wakaba@suika.fam.cx>
1173
1174 * HTML.pm.src, mkhtmlparser.pl: Support for MathML and SVG
1175 elements (HTML5 revision 1404). Unused !!!macro definitions
1176 are removed.
1177
1178 2008-04-12 Wakaba <wakaba@suika.fam.cx>
1179
1180 * HTML.pm.src, mkhtmlparser.pl: The way permitted slash errors
1181 are raised is changed (HTML5 revision 1404).
1182
1183 2008-04-06 Wakaba <wakaba@suika.fam.cx>
1184
1185 * HTML.pm.src: Category sets that are no longer used are removed.
1186
1187 2008-04-06 Wakaba <wakaba@suika.fam.cx>
1188
1189 * HTML.pm.src: The ->[1] property of stack entries are now
1190 replaced by constants representing element category.
1191
1192 2008-04-06 Wakaba <wakaba@suika.fam.cx>
1193
1194 * HTML.pm.src: Don't use local name stored in stack (i.e. ->[1])
1195 for error reporting. (This is a preparation for using constant
1196 value for ->[1].)
1197
1198 2008-03-22 Wakaba <wakaba@suika.fam.cx>
1199
1200 * RDFXML.pm: Typo fixed.
1201
1202 2008-03-22 Wakaba <wakaba@suika.fam.cx>
1203
1204 * ContentChecker.pm: |fact_level| is now treated
1205 as same as |must_level|, i.e. level = |m|.
1206 (check_element): Make list of URIs in the DOM.
1207
1208 2008-03-21 Wakaba <wakaba@suika.fam.cx>
1209
1210 * RDFXML.pm: Language accessor implemented.
1211 Local (null-namespace) attribute support.
1212
1213 2008-03-21 Wakaba <wakaba@suika.fam.cx>
1214
1215 * RDFXML.pm: Factored out ID checking code.
1216
1217 2008-03-21 Wakaba <wakaba@suika.fam.cx>
1218
1219 * RDFXML.pm: TODO items noted. Validation of ID
1220 and URI attributes is implemented. Warn if unknown
1221 value is used in rdf:parseType="" attribute.
1222
1223 * URIChecker.pm (check_rdf_uri_reference): New function.
1224
1225 2008-03-21 Wakaba <wakaba@suika.fam.cx>
1226
1227 * RDFXML.pm: bnodeid implemented. Relative references
1228 are now resolved.
1229
1230 2008-03-21 Wakaba <wakaba@suika.fam.cx>
1231
1232 * ContentChecker.pm: RDF reification implemented.
1233
1234 * RDFXML.pm: undef vs false bug fixed.
1235 Reification implemented.
1236
1237 2008-03-21 Wakaba <wakaba@suika.fam.cx>
1238
1239 * RDFXML.pm: s/id/ID/ for attribute name.
1240 The |node| arguments are added for |ontriple| calls.
1241 Too many "attribute not allowed" errors were raised.
1242
1243 * ContentChecker.pm: Initial experimental support for rdf:RDF
1244 element.
1245
1246 2008-03-21 Wakaba <wakaba@suika.fam.cx>
1247
1248 * RDFXML.pm: New module.
1249
1250 2008-03-20 Wakaba <wakaba@suika.fam.cx>
1251
1252 * HTML.pm.src (set_inner_html): Line/column number
1253 code was old one yet.
1254
1255 2008-03-20 Wakaba <wakaba@suika.fam.cx>
1256
1257 * HTML.pm.src: Better line/column reporting for "duplicate attribute"
1258 errors. Line/column markings for DOCTYPE, comment, and
1259 character tokens are reintroduced; otherwise,
1260 error location for "not HTML5" error and errors
1261 for implied elements are not attached.
1262
1263 2008-03-20 Wakaba <wakaba@suika.fam.cx>
1264
1265 * HTML.pm.src: Set line/column numbers to attributes.
1266
1267 * NanoDOM.pm (create_attribute_ns, set_attribute_node_ns): Added.
1268 (value): Setter implemented.
1269
1270 * mkhtmlparser.pl: Set line/column numbers to Attr nodes.
1271
1272 2008-03-20 Wakaba <wakaba@suika.fam.cx>
1273
1274 * HTML.pm.src: Unused line/column markings are removed.
1275
1276 2008-03-20 Wakaba <wakaba@suika.fam.cx>
1277
1278 * HTML.pm.src (_get_next_token): Remove |first_start_tag|
1279 flag, which is no longer used.
1280
1281 2008-03-17 Wakaba <wakaba@suika.fam.cx>
1282
1283 * HTML.pm.src: Set line/column information to element nodes.
1284
1285 * mkhtmlparser.pl (!!!create-element, !!!insert-element,
1286 and !!!insert-element-t): Set line/column information to
1287 element nodes.
1288
1289 2008-03-17 Wakaba <wakaba@suika.fam.cx>
1290
1291 * HTML.pm.src (_get_next_token): The first "<" character
1292 in "<?", "<>", or "</>" should be the error point.
1293
1294 2008-03-16 Wakaba <wakaba@suika.fam.cx>
1295
1296 * HTML.pm.src: Some more fixes on error position reporting.
1297
1298 2008-03-16 Wakaba <wakaba@suika.fam.cx>
1299
1300 * HTML.pm.src: Token-level precious error reporting.
1301
1302 2008-03-16 Wakaba <wakaba@suika.fam.cx>
1303
1304 * HTML.pm.src: Preparation for more precious error point
1305 reporting.
1306
1307 2008-03-11 Wakaba <wakaba@suika.fam.cx>
1308
1309 * HTML.pm.src: Error type revised.
1310
1311 2008-03-11 Wakaba <wakaba@suika.fam.cx>
1312
1313 * HTML.pm.src: Similar codes are merged together, again.
1314
1315 2008-03-11 Wakaba <wakaba@suika.fam.cx>
1316
1317 * HTML.pm.src: Similar codes are merged together.
1318
1319 2008-03-10 Wakaba <wakaba@suika.fam.cx>
1320
1321 * mkhtmlparser.pl: Set "level" parameter to parse errors.
1322
1323 * HTML.pm.src: Code refined.
1324
1325 2008-03-09 Wakaba <wakaba@suika.fam.cx>
1326
1327 * HTML.pm.src: |</body>| treatement has been changed (HTML5
1328 revision 1348). Note that I really don't know this makes
1329 any difference in the black-box behavior of the parser.
1330
1331 2008-03-09 Wakaba <wakaba@suika.fam.cx>
1332
1333 * HTML.pm.src: New end-of-file token implementation (HTML5
1334 revision 1348).
1335
1336 2008-03-09 Wakaba <wakaba@suika.fam.cx>
1337
1338 * HTML.pm.src: |applet| support (HTML5 revision 1347).
1339
1340 2008-03-09 Wakaba <wakaba@suika.fam.cx>
1341
1342 * HTML.pm.src: Foster parenting in AAA (HTML5 revision 1343).
1343
1344 2008-03-09 Wakaba <wakaba@suika.fam.cx>
1345
1346 * HTML.pm.src: Support for |<input>| in the "in select" insertion
1347 mode and support for the "in select in table" insertion mode (HTML5
1348 revision 1342).
1349
1350 2008-03-09 Wakaba <wakaba@suika.fam.cx>
1351
1352 * HTML.pm.src: No foster parenting for <script> and <script>
1353 in non-tainted <table>s (HTML5 revision 1336).
1354
1355 2008-03-09 Wakaba <wakaba@suika.fam.cx>
1356
1357 * HTML.pm.src: Ignore white space characters between <html>
1358 and <head> (HTML5 revision 1332).
1359
1360 2008-03-09 Wakaba <wakaba@suika.fam.cx>
1361
1362 * HTML.pm.src: Treat <input type=hidden> as if it were a
1363 white space (HTML5 revision 1331).
1364
1365 2008-03-08 Wakaba <wakaba@suika.fam.cx>
1366
1367 * HTML.pm.src: Ignore U+000A at the beginning of a |listing|
1368 element (HTML5 revision 1330).
1369
1370 2008-03-08 Wakaba <wakaba@suika.fam.cx>
1371
1372 * HTML.pm.src: <title> is always appended to the current
1373 element (HTML5 revision 1328).
1374
1375 2008-03-08 Wakaba <wakaba@suika.fam.cx>
1376
1377 * HTML.pm.src: White space in tainted tables are moved
1378 into foster parents (HTML5 revision 1326).
1379
1380 2008-03-08 Wakaba <wakaba@suika.fam.cx>
1381
1382 * HTML.pm.src: Reduce errors from foster parenting cases (HTML5
1383 revision 1321).
1384
1385 2008-03-08 Wakaba <wakaba@suika.fam.cx>
1386
1387 * HTML.pm.src: |</h/n/>| case code rearranged to align with
1388 the spec (HTML5 revision 1320). Note that we finally complete
1389 all of HTML5 revision 1320 changes.
1390
1391 2008-03-08 Wakaba <wakaba@suika.fam.cx>
1392
1393 * HTML.pm.src: |</form>| now works similar to |</div>| for unclosed
1394 tags (HTML5 revision 1320).
1395
1396 2008-03-08 Wakaba <wakaba@suika.fam.cx>
1397
1398 * HTML.pm.src: |</p>| case rearranged with no actual change in fact.
1399
1400 2008-03-08 Wakaba <wakaba@suika.fam.cx>
1401
1402 * HTML.pm.src: A "generate implied end tags" code (t409.1)
1403 could not be reached so that it is now removed (HTML5 revision
1404 1320).
1405
1406 2008-03-08 Wakaba <wakaba@suika.fam.cx>
1407
1408 * HTML.pm.src: Code for the case of |</div>| and so on are revised
1409 to align with new spec text (HTML5 revision 1320).
1410
1411 2008-03-08 Wakaba <wakaba@suika.fam.cx>
1412
1413 * HTML.pm.src: Remove strange |if| condition; however, it should
1414 have had no harm in theory.
1415
1416 2008-03-08 Wakaba <wakaba@suika.fam.cx>
1417
1418 * HTML.pm.src (_tree_construction_main): '</p>' in body
1419 case is split from other end tags for the preparation
1420 of implementing HTML revision 1320.
1421
1422 2008-03-07 Wakaba <wakaba@suika.fam.cx>
1423
1424 * HTML.pm.src: Simplified "generate implied end tag" (HTML5
1425 revision 1320).
1426
1427 2008-03-07 Wakaba <wakaba@suika.fam.cx>
1428
1429 * HTML.pm.src (_tree_construction_main): Merge rules for "h1"
1430 and "div" (HTML5 revision 1318). Add comments to where
1431 |form| pointer association codes should be inserted (HTML5 revision
1432 1319).
1433
1434 2008-03-06 Wakaba <wakaba@suika.fam.cx>
1435
1436 * HTML.pm.src: <html> treatement refined (HTML5 revision 1314).
1437
1438 2008-03-05 Wakaba <wakaba@suika.fam.cx>
1439
1440 * HTML.pm.src: Since the case t268 should never be reached (no
1441 other token type, there are), it is replaced by a |die| statement.
1442
1443 2008-03-05 Wakaba <wakaba@suika.fam.cx>
1444
1445 * HTML.pm.src: Typo fixed.
1446
1447 2008-03-04 Wakaba <wakaba@suika.fam.cx>
1448
1449 * HTML.pm.src (_tree_construction_initial): Some limited quirks
1450 doctypes were not uppercased for comparison.
1451
1452 2008-03-03 Wakaba <wakaba@suika.fam.cx>
1453
1454 * HTML.pm.src (tree construction and set_inner_html): Checkpoints
1455 are added.
1456
1457 2008-03-03 Wakaba <wakaba@suika.fam.cx>
1458
1459 * HTML.pm.src (_tokenize_attempt_to_consume_an_entity): Checkpoints
1460 are set. Cases that are unlikely reached are noted as so.
1461
1462 2008-03-03 Wakaba <wakaba@suika.fam.cx>
1463
1464 * HTML.pm.src: Checkpoints for debugging are added.
1465
1466 * mkhtmlparser.pl: Support for |!!!cp| syntax.
1467
1468 2008-03-03 Wakaba <wakaba@suika.fam.cx>
1469
1470 * HTML.pm.src, mkhtmlparser.pl: s/_input_character/_char/g
1471 for simplicity.
1472
1473 2008-03-03 Wakaba <wakaba@suika.fam.cx>
1474
1475 * HTML.pm.src: Flag name changed: s/correct/force-quirks/g (HTML5
1476 revision 1307).
1477
1478 2008-03-03 Wakaba <wakaba@suika.fam.cx>
1479
1480 * HTML.pm.src (_get_next_token): Where the /incorrect/ flag
1481 is set are changed (HTML5 revision 1305).
1482
1483 2008-03-02 Wakaba <wakaba@suika.fam.cx>
1484
1485 * HTML.pm.src: Raise a parse error for |<span ===>| (HTML5 revision
1486 1292). Entities are not parsed in comment-like part in RCDATA
1487 elements (HTML5 revision 1294). Allow bare & at the end
1488 of attribute value literals (HTML5 revision 1296). More
1489 quirks mode doctypes (HTML5 revision 1302). Requires spaces
1490 between attributes and ban attribute names or unquoted
1491 attribute values containing single or double quotes (HTML5
1492 revision 1303).
1493
1494 2008-03-02 Wakaba <wakaba@suika.fam.cx>
1495
1496 * ContentChecker.pm: Typo fixed. Don't raise "character encoding"
1497 and related errors unless it is an HTML document (though the spec
1498 is unclear on whether it is applied to XHTML document).
1499
1500 * HTML.pm (%HTMLAttrStatus): WF2 repetition model attributes
1501 are added.
1502
1503 2008-03-02 Wakaba <wakaba@suika.fam.cx>
1504
1505 * HTML.pm.src: s/local_name/manakai_local_name/g.
1506
1507 2008-03-01 Wakaba <wakaba@suika.fam.cx>
1508
1509 * _NamedEntityList.pm: Updated (HTML5 revision 1286).
1510
1511 * HTML.pm.src: |charset| in |content| attribute is
1512 case-insensitive (HTML5 revision 1270).
1513
1514 2008-02-26 Wakaba <wakaba@suika.fam.cx>
1515
1516 * ContentChecker.pm: New status constants are added.
1517 ($ElementDefault): |status| added.
1518 (check_element): Err for non-standard or deprecated elements.
1519 (_attr_status_info): For non-standard or deprecated attributes.
1520
1521 2008-02-24 Wakaba <wakaba@suika.fam.cx>
1522
1523 * ContentChecker.pm (_attr_status_info): New internal method.
1524
1525 2008-02-24 Wakaba <wakaba@suika.fam.cx>
1526
1527 * ContentChecker.pm (check_element): Element standardized
1528 status information is now dispatched.
1529
1530 2008-02-24 Wakaba <wakaba@suika.fam.cx>
1531
1532 * ContentChecker.pm (check_element): Fix |del|-and-significant
1533 problem by adding some more arguments.
1534
1535 2008-02-24 Wakaba <wakaba@suika.fam.cx>
1536
1537 * ContentChecker.pm (check_element): Use context of
1538 container-for-the-purpose-of-content-model element (not transparent
1539 element) for |check_child_element| calling and significant
1540 text flag marking. This reintroduces |<del>|-and-significant
1541 problem again.
1542
1543 2008-02-24 Wakaba <wakaba@suika.fam.cx>
1544
1545 * ContentChecker.pm (check_element): Make semi-transparent
1546 elements ignored for the purpose of phase changes in
1547 content model checking.
1548
1549 2008-02-23 Wakaba <wakaba@suika.fam.cx>
1550
1551 * ContentChecker.pm (check_element): In-element state
1552 was not properly managed for transparent cases.
1553
1554 2008-02-23 Wakaba <wakaba@suika.fam.cx>
1555
1556 * ContentChecker.pm (check_element): Support for |video|
1557 and |audio| as semi-transparent elements.
1558
1559 2008-02-23 Wakaba <wakaba@suika.fam.cx>
1560
1561 * ContentChecker.pm ($HTMLSemiTransparentElements): New.
1562 (check_element): s/minuses/minus_elements/, s/pluses/plus_elements/.
1563 Support for |html:object| as a semi-transparent element.
1564
1565 2008-02-23 Wakaba <wakaba@suika.fam.cx>
1566
1567 * ContentChecker.pm (check_element): The way to traverse
1568 the tree is entirely revised to make it easier to track
1569 the state of ancestors/descendants. As a result of this
1570 revision (which rewrites almost all of Whatpm::ContentChecker::HTML),
1571 support for content model checking for HTML elements |figure|,
1572 |object|, |video|, and |audio| and checking for XML elements (and
1573 some XMLNS checkings) are dropped for now. They will be
1574 reimplemented in due cource.
1575
1576 2008-02-17 Wakaba <wakaba@suika.fam.cx>
1577
1578 * HTML.pm.src: |>| in public or system literal closes the DOCTYPE
1579 token (HTML5 revision 1225).
1580
1581 2008-02-17 Wakaba <wakaba@suika.fam.cx>
1582
1583 * ContentChecker.pm ({unsupported_level}): New value.
1584
1585 * HTML.pm.src: Save whether |meta| |content| attribute
1586 contains character references or not.
1587
1588 2008-02-17 Wakaba <wakaba@suika.fam.cx>
1589
1590 * ContentChecker.pm (_get_children): (Incompleted) attempt to
1591 imlement significant content checking for contents
1592 with |del| elements.
1593
1594 2008-02-17 Wakaba <wakaba@suika.fam.cx>
1595
1596 * ContenteChecker.pm ($HTMLTransparentElements): More
1597 elements are added.
1598 (_get_children): HTML |object| elements are now semi-transparent.
1599
1600 * NanoDOM.pm (manakai_html, manakai_head): New methods.
1601
1602 2008-02-16 Wakaba <wakaba@suika.fam.cx>
1603
1604 * CacheManifest.pm: HTML5 revision 1211 implemented.
1605
1606 * CacheManifest.pod: Updated.
1607
1608 2008-02-10 Wakaba <wakaba@suika.fam.cx>
1609
1610 * ContentChecker.pm (check_document, check_element): Support
1611 for second argument ($onsubdoc).
1612 (_get_css_parser): Removed (now it is part of WDCC).
1613
1614 2008-02-09 Wakaba <wakaba@suika.fam.cx>
1615
1616 * ContentChecker.pm (_get_css_parser): New.
1617
1618 2007-11-25 Wakaba <wakaba@suika.fam.cx>
1619
1620 * ContentChecker.pm ($AnyChecker): Old way to add child elements
1621 for checking had been used.
1622
1623 2007-11-25 Wakaba <wakaba@suika.fam.cx>
1624
1625 * ContentChecker.pm (check_element): New todo item type |descendant|.
1626
1627 2007-11-23 Wakaba <wakaba@suika.fam.cx>
1628
1629 * IMTChecker.pm: Revised to raise errors and warnings as (poorly)
1630 specced in RFC 2046 and RFC 4288.
1631 (application/atom+xml): Definition added.
1632
1633 2007-11-23 Wakaba <wakaba@suika.fam.cx>
1634
1635 * URIChecker.pm: Make RFC 3986 should-level errors
1636 warnings (rather than SHOULD-level errors).
1637
1638 2007-11-23 Wakaba <wakaba@suika.fam.cx>
1639
1640 * NanoDOM.pm (get_user_data, set_user_data): New methods.
1641
1642 * HTML.pm.src: A flag for character references in attribute
1643 values are added. Set |manakai_has_reference| user data
1644 to |charset| attribute.
1645
1646 2007-11-23 Wakaba <wakaba@suika.fam.cx>
1647
1648 * NanoDOM.pm (input_encoding, manakai_charset, manakai_has_bom): New
1649 attributes.
1650
1651 * ContentChecker.pm (check_document): Warn if charset requirements
1652 cannot be tested.
1653
1654 2007-11-19 Wakaba <wakaba@suika.fam.cx>
1655
1656 * HTML.pm.src (parse_byte_string): Detect charset
1657 by universalchardet if charset parameter is not specified.
1658
1659 * Makefile (Charset-all, Charset-clean): New rules.
1660
1661 2007-11-18 Wakaba <wakaba@suika.fam.cx>
1662
1663 * ContentChecker.pm (check_document): Check the existence
1664 of character encoding declaration and selection of encoding
1665 for HTML document.
1666
1667 2007-11-18 Wakaba <wakaba@suika.fam.cx>
1668
1669 * ContentType.pm (get_sniffed_type): Return also the official
1670 type in list context.
1671
1672 2007-11-18 Wakaba <wakaba@suika.fam.cx>
1673
1674 * ContentType.pm: Sniffing with leading white space
1675 ignoring (HTML5 revisions 1013 and 1016).
1676
1677 2007-11-18 Wakaba <wakaba@suika.fam.cx>
1678
1679 * ContentType.pm: HTML5 revision 1013 changes, except for leading
1680 white spaces, are implemented.
1681
1682 2007-11-11 Wakaba <wakaba@suika.fam.cx>
1683
1684 * HTML.pm.src (parse_char_string): Set |inner_encoding|
1685 attribute if possible.
1686
1687 2007-11-11 Wakaba <wakaba@suika.fam.cx>
1688
1689 * HTML.pm.src (parse_byte_string): New method.
1690 (parse_char_string): New alias for |parse_string|.
1691 (main phase): Invoking "change the encoding" algorithm if desired.
1692
1693 * HTML.pod: Updated.
1694
1695 2007-11-11 Wakaba <wakaba@suika.fam.cx>
1696
1697 * HTML.pod (get_inner_html): Removed.
1698
1699 * Makefile (HTML-all, HTML-clean): New.
1700
1701 2007-11-11 Wakaba <wakaba@suika.fam.cx>
1702
1703 * HTML.pm.src (get_inner_html): Removed (moved to HTML/Serializer.pm).
1704
1705 2007-11-08 Wakaba <wakaba@suika.fam.cx>
1706
1707 * mklinktypelist.pl: s/noreferer/noreferrer/ (HTML5 revision 1132).
1708
1709 2007-11-04 Wakaba <wakaba@suika.fam.cx>
1710
1711 * Makefile: |CacheManifest.html| is added.
1712
1713 * CacheManifest.pod: New file.
1714
1715 2007-11-04 Wakaba <wakaba@suika.fam.cx>
1716
1717 * CacheManifest.pm: New module.
1718
1719 2007-11-04 Wakaba <wakaba@suika.fam.cx>
1720
1721 * HTML.pm.src: Support for application cache selection algorithm
1722 callback.
1723
1724 2007-11-04 Wakaba <wakaba@suika.fam.cx>
1725
1726 * mklinktypelist.pl: Support for rel=noreferer (HTML5 revision 1118).
1727
1728 2007-10-17 Wakaba <wakaba@suika.fam.cx>
1729
1730 * Makefile (clean): New rule.
1731
1732 * NanoDOM.pm (public_id, system_id): New attributes.
1733
1734 2007-10-17 Wakaba <wakaba@suika.fam.cx>
1735
1736 * Makefile (CSS-all, CSS-clean, clean): New rules.
1737
1738 2007-10-14 Wakaba <wakaba@suika.fam.cx>
1739
1740 * ContentChecker.pm (check_document): Support for
1741 new |is_xml_root| flag.
1742 (check_element): Support for new |pluses| state.
1743 (_add_pluses): New method.
1744 (_remove_minuses): Support for new |minus| item.
1745
1746 2007-09-24 Wakaba <wakaba@suika.fam.cx>
1747
1748 * ContentChecker.pm: Raise specific error for invalid
1749 root element.
1750
1751 2007-09-24 Wakaba <wakaba@suika.fam.cx>
1752
1753 * ContentChecker.pm: Set level values for later uses.
1754
1755 2007-09-09 Wakaba <wakaba@suika.fam.cx>
1756
1757 * ContentChecker.pm: Support for language tag validation.
1758
1759 2007-09-09 Wakaba <wakaba@suika.fam.cx>
1760
1761 * LangTag.pm (check_rfc3066_language_tag): New method.
1762
1763 2007-09-09 Wakaba <wakaba@suika.fam.cx>
1764
1765 * LangTag.pm: New module.
1766
1767 2007-09-04 Wakaba <wakaba@suika.fam.cx>
1768
1769 * HTML.pm.src: Some error types were wrong.
1770
1771 2007-08-17 Wakaba <wakaba@suika.fam.cx>
1772
1773 * CSS/: New directory.
1774
1775 2007-08-17 Wakaba <wakaba@suika.fam.cx>
1776
1777 * ContentChecker.pm (_check_get_children): Support
1778 for |noscript| in |head|.
1779
1780 2007-08-12 Wakaba <wakaba@suika.fam.cx>
1781
1782 * URI/: New directory.
1783
1784 2007-08-11 Wakaba <wakaba@suika.fam.cx>
1785
1786 * HTML.pm.src: Tokenizer's states are now represented in
1787 number.
1788
1789 2007-08-11 Wakaba <wakaba@suika.fam.cx>
1790
1791 * HTML.pm.src: |or|s for insertion modes are replaced
1792 by |&|s.
1793
1794 2007-08-11 Wakaba <wakaba@suika.fam.cx>
1795
1796 * HTML.pm.src: Token types are now represented in number.
1797
1798 2007-08-11 Wakaba <wakaba@suika.fam.cx>
1799
1800 * ContentType.pm (SEE ALSO): Updated.
1801
1802 * HTML.pm.src: Insertion modes are now represented in number.
1803
1804 2007-08-11 Wakaba <wakaba@suika.fam.cx>
1805
1806 * ContentType.pm: Sniffing for bitmap images (HTML5 revision
1807 999) is implemented.
1808
1809 2007-08-08 Wakaba <wakaba@suika.fam.cx>
1810
1811 * ContentType.pm: Sniffing for |<script| (HTML5 revision
1812 983) is implemented.
1813
1814 2007-08-06 Wakaba <wakaba@suika.fam.cx>
1815
1816 * ContentChecker.pod: New documentation.
1817
1818 * Makefile: A rule for |ContentChecker.html| is added.
1819
1820 * ContentChecker.pm: A pod "LICENSE" section is added.
1821
1822 * NanoDOM.pm ($VERSION): New variable.
1823
1824 2007-08-05 Wakaba <wakaba@suika.fam.cx>
1825
1826 * H2H.pm: |b|, |i|, and |sub| are added to the
1827 list of allowed HTML elements.
1828
1829 2007-08-05 Wakaba <wakaba@suika.fam.cx>
1830
1831 * H2H.pm: |samp| is added to the list of allowed
1832 HTML elements.
1833
1834 * URIChecker.pm (check_iri): New.
1835 (check_iri_reference): Error type for IRI reference
1836 syntax error is changed.
1837
1838 2007-08-04 Wakaba <wakaba@suika.fam.cx>
1839
1840 * ContentChecker.pm: Reference to the |Whatpm::ContentChecker::Atom|
1841 is added.
1842 (check_document): Load appropriate module before validation.
1843
1844 2007-08-04 Wakaba <wakaba@suika.fam.cx>
1845
1846 * ContentChecker/: New directory.
1847
1848 2007-08-04 Wakaba <wakaba@suika.fam.cx>
1849
1850 * ContentChecker.pm: HTML |time| element is implemented.
1851
1852 * HTMLTable.pm: Comments are updated as HTML5 is revised.
1853
1854 2007-07-21 Wakaba <wakaba@suika.fam.cx>
1855
1856 * ContentChecker.pm (check_document): Return value
1857 even if no document element is found.
1858
1859 2007-07-21 Wakaba <wakaba@suika.fam.cx>
1860
1861 * HTML.pm.src: |$in_body| is no longer a function.
1862
1863 2007-07-21 Wakaba <wakaba@suika.fam.cx>
1864
1865 * HTML.pm.src: The |$in_body| code has been moved down.
1866
1867 2007-07-21 Wakaba <wakaba@suika.fam.cx>
1868
1869 * HTML.pm.src: The "trailing end" insertion mode
1870 is split into "after html body" and "after html frameset"
1871 insertion modes. Their codes are merged with "after body"
1872 and "after frameset" codes. |$previous_insertion_mode|
1873 has been removed. "after frameset" code is
1874 merged with "in frameset" code.
1875
1876 2007-07-21 Wakaba <wakaba@suika.fam.cx>
1877
1878 * HTML.pm.src: The "before head" insertion mode is
1879 merged with the "in head" insertion mode.
1880
1881 2007-07-21 Wakaba <wakaba@suika.fam.cx>
1882
1883 * HTML.pm.src: Cases in "in head" insertion mode are
1884 reorganized.
1885
1886 2007-07-21 Wakaba <wakaba@suika.fam.cx>
1887
1888 * HTML.pm.src: Some cases in "in table" insertion mode
1889 are merged.
1890
1891 2007-07-21 Wakaba <wakaba@suika.fam.cx>
1892
1893 * HTML.pm.src: The "in row" insertion mode is merged
1894 with "in table" insertion mode.
1895
1896 2007-07-21 Wakaba <wakaba@suika.fam.cx>
1897
1898 * HTML.pm.src: The "in table" and "in table body" insertion
1899 modes are merged.
1900
1901 2007-07-21 Wakaba <wakaba@suika.fam.cx>
1902
1903 * HTML.pm.src: There is no "in table head" or "in table foot"
1904 insertion mode!
1905
1906 2007-07-21 Wakaba <wakaba@suika.fam.cx>
1907
1908 * HTML.pm.src: |<noframes>| "in frameset" and "in noframes"
1909 now directly invoke the handler.
1910
1911 2007-07-21 Wakaba <wakaba@suika.fam.cx>
1912
1913 * HTML.pm.src: Codes for "in cell" insertion mode
1914 is merged to the "in body" insertion mode code.
1915
1916 2007-07-21 Wakaba <wakaba@suika.fam.cx>
1917
1918 * HTML.pm.src: Codes for "in body" and "in caption"
1919 insertion modes are merged.
1920
1921 2007-07-21 Wakaba <wakaba@suika.fam.cx>
1922
1923 * HTML.pm.src: Two |!!!next-token|s were missing.
1924
1925 2007-07-21 Wakaba <wakaba@suika.fam.cx>
1926
1927 * HTML.pm.src: Use numeric constant for |{content_mode}|
1928 instead of string constant for |{content_model_flag}|.
1929
1930 2007-07-21 Wakaba <wakaba@suika.fam.cx>
1931
1932 * HTML.pm.src: Add the name of the attribute
1933 to the "duplicate attribute" error.
1934
1935 2007-07-17 Wakaba <wakaba@suika.fam.cx>
1936
1937 * ContentChecker.pm: Return the |class| node list.
1938
1939 2007-07-17 Wakaba <wakaba@suika.fam.cx>
1940
1941 * ContentChecker.pm: Return the |id| node list.
1942
1943 * HTML.pm.src: A typo is fixed.
1944
1945 2007-07-16 Wakaba <wakaba@suika.fam.cx>
1946
1947 * ContentChecker.pm: Drop wrong |level => 'error'| specification
1948 from "in HTML:xml:lang" error. Character position
1949 is now the last part of the error type in the URI error
1950 description. Report "unsupported" status for language
1951 tags, media queries, script codes, and style sheets.
1952
1953 2007-07-16 Wakaba <wakaba@suika.fam.cx>
1954
1955 * ContentChecker.pm: Report error if |xml:lang|
1956 in HTML, |lang| in XML, |xmlns| in XML, and |meta| |charset|
1957 in XML.
1958
1959 * NanoDOM.pm (Attr.owner_document): New attribute.
1960
1961 2007-07-16 Wakaba <wakaba@suika.fam.cx>
1962
1963 * HTML.pm.src: The character immediately following
1964 a bare |hcro| was discarded. Fix handling of
1965 entity references in attribute values.
1966
1967 2007-07-16 Wakaba <wakaba@suika.fam.cx>
1968
1969 * HTML.pm.src (main and trailin end phases): Token
1970 types |DOCTYPE|, |comment|, |end-of-file|, and |<html>| are
1971 factored out. Error types |in html:#DOCTYPE|
1972 and |after html:#DOCTYPE| are merged
1973 into |DOCTYPE in the middle|. |</frameset>|
1974 in fragment parsing mode changed the
1975 insertion mode.
1976
1977 2007-07-16 Wakaba <wakaba@suika.fam.cx>
1978
1979 * HTML.pm.src: |$phase| has been removed; The |trailing end|
1980 phase is now an insertion mode. Treatments for white
1981 space character tokens were incorrect for some
1982 insertion modes. An old |meta| case was not removed.
1983
1984 2007-07-16 Wakaba <wakaba@suika.fam.cx>
1985
1986 * HTML.pm.src: |meta| charset declaration extraction
1987 implemented (but changing the encoding is not yet:-).
1988
1989 2007-07-15 Wakaba <wakaba@suika.fam.cx>
1990
1991 * Charset/: New directory.
1992
1993 2007-07-15 Wakaba <wakaba@suika.fam.cx>
1994
1995 * H2H.pm: New Perl module (created from
1996 manakai's H2H.dis).
1997
1998 2007-07-15 Wakaba <wakaba@suika.fam.cx>
1999
2000 * XMLSerializer.pm: New Perl module (created from
2001 manakai's SimpleLS.dis).
2002
2003 2007-07-07 Wakaba <wakaba@suika.fam.cx>
2004
2005 * HTML.pm.src: |<!---x-->| was not processed correctly.
2006
2007 2007-07-01 Wakaba <wakaba@suika.fam.cx>
2008
2009 * HTML.pm.src: Report correct error message
2010 for |<body></div></body>|.
2011
2012 2007-07-01 Wakaba <wakaba@suika.fam.cx>
2013
2014 * HTMLTable.pm: An error description was incorrect.
2015
2016 2007-06-30 Wakaba <wakaba@suika.fam.cx>
2017
2018 * ContentChecker.pm: Return |{term}| list.
2019
2020 2007-06-30 Wakaba <wakaba@suika.fam.cx>
2021
2022 * HTML.pm.src: HTML5 revisions 961-966 (</p>, </br>,
2023 nested <nobr>, implied </tbody>, </tfoot>, and </thead>,
2024 and <title> outside of head).
2025
2026 2007-06-30 Wakaba <wakaba@suika.fam.cx>
2027
2028 * IMTChecker.pm: Report warning for unregistered
2029 and private types/subtypes.
2030
2031 * ContentChecker.pm, HTML.pm.src, IMTChecker.pm,
2032 URIChecker.pm, HTMLTable.pm: Error messages are now
2033 consistent; they are all listed in
2034 <http://suika.fam.cx/gate/2005/sw/Whatpm%20Error%20Types>.
2035
2036 2007-06-25 Wakaba <wakaba@suika.fam.cx>
2037
2038 * ContentChecker.pm: |<img ismap>| not in |<a></a>|
2039 is now erred. |<datalist>| is implemented.
2040 Attribute checker for |<command>| and |<menu>| are
2041 added. Support for |contextmenu| global attribute
2042 is added.
2043
2044 2007-06-25 Wakaba <wakaba@suika.fam.cx>
2045
2046 * HTML.pm.src (_reset_insertion_mode): Interpretation
2047 of Step 3 has been changed.
2048
2049 2007-06-25 Wakaba <wakaba@suika.fam.cx>
2050
2051 * HTML.pm.src: Late |<html>| parse error is implemented.
2052
2053 2007-06-24 Wakaba <wakaba@suika.fam.cx>
2054
2055 * URIChecker.pm (check_iri_reference): A |decode| method name was
2056 incorrect.
2057
2058 * ContentChecker.pm: Support for the |footer| element.
2059 Check URI syntax for space-separated URI attributes.
2060 Support for the |tabindex| attribute. Support
2061 for |datetime| attribute.
2062
2063 2007-06-24 Wakaba <wakaba@suika.fam.cx>
2064
2065 * HTML.pm.src: HTML5 revision 1.144 (&#x0D;) and 1.145 (invalid
2066 character references). HTML5 revision 1.146 (white space
2067 characters before root start tag). HTML5 revision
2068 1.148 (named character references in attribute values).
2069 HTML5 revision 1.152 (<plaintext>.innerHTML get).
2070
2071 2007-06-24 Wakaba <wakaba@suika.fam.cx>
2072
2073 * HTML.pm.src: HTML5 revisions 1.142 and 1.143 (<noscript>
2074 in <head>).
2075
2076 2007-06-24 Wakaba <wakaba@suika.fam.cx>
2077
2078 * HTML.pm.src: HTML5 revision 935 (<base>, <link>, <meta>
2079 in body).
2080
2081 * ContentChecker.pm: HTML5 revision 938 (scoped="").
2082
2083 2007-06-24 Wakaba <wakaba@suika.fam.cx>
2084
2085 * HTML.pm.src: HTML5 revision 923 (matching end tag in CDATA
2086 or RCDATA in fragment parsing mode). HTML5 revision
2087 924 (<!--> and <!--->). HTML5 revision 926 (hn in hn).
2088
2089 2007-06-23 Wakaba <wakaba@suika.fam.cx>
2090
2091 * HTML.pm.src (get_inner_html): HTML5 revision 922 (inner_html
2092 for <pre> and <textarea>).
2093
2094 2007-06-23 Wakaba <wakaba@suika.fam.cx>
2095
2096 * HTML.pm.src: HTML5 revision 920 (<isindex>).
2097
2098 2007-06-23 Wakaba <wakaba@suika.fam.cx>
2099
2100 * HTML.pm.src: HTML5 revision 918 (</head>, </body>, </html>).
2101
2102 2007-06-23 Wakaba <wakaba@suika.fam.cx>
2103
2104 * HTML.pm.src: HTML5 revision 916 (</body>).
2105 HTML5 revision 917 (conforming bare &).
2106
2107 2007-06-23 Wakaba <wakaba@suika.fam.cx>
2108
2109 * NanoDOM.pm (manakai_is_html): Setting to false did not work.
2110
2111 * HTML.pm.src: HTML5 revision 914 (</ in CDATA, RCDATA).
2112 HTML5 revision 915 (<nobr>).
2113
2114 2007-06-23 Wakaba <wakaba@suika.fam.cx>
2115
2116 * HTML.pm.src: HTML5 revisions 908, 909, 912, and 913 (quirks mode).
2117
2118 * NanoDOM.pm (manakai_is_html, manakai_compat_mode, compat_mode):
2119 New attributes.
2120
2121 2007-06-23 Wakaba <wakaba@suika.fam.cx>
2122
2123 * HTML.pm.src: HTML5 revisions 900, 901, 902, and 911 (<
2124 in tags).
2125
2126 2007-06-23 Wakaba <wakaba@suika.fam.cx>
2127
2128 * .cvsignore: |Entities.html| is added.
2129
2130 * HTML.pm.src: |$entity_char| is removed and
2131 requires |Whatpm::_NamedEntityList| instead.
2132 HTML5 revision 898 (refc), except that lack of refc
2133 is parse error.
2134
2135 * mkentitylist.pl: New script.
2136
2137 * Makefile (all): |_NamedEntityList.pm| is added.
2138 (_NamedEntityList.pm, Entities.html): New rules.
2139
2140 2007-06-23 Wakaba <wakaba@suika.fam.cx>
2141
2142 * HTML.pm.src: Parse errors immediately after U+000D
2143 were ignored and U+000D immediately following another
2144 U+000D was not converted to U+000A.
2145
2146 2007-06-23 Wakaba <wakaba@suika.fam.cx>
2147
2148 * HTML.pm.src (set_inner_html): HTML5 revision 892 (adopt
2149 nodes before appended). Parser was not ready for NULL
2150 parse error and escape flag.
2151
2152 * NanoDOM.pm (adopt_node): New.
2153
2154 2007-06-23 Wakaba <wakaba@suika.fam.cx>
2155
2156 * HTML.pm.src: HTML5 revision 886 (insane comment in
2157 CDATA and RCDATA). Note that current implementation
2158 is simply repeating what the spec says and it is maybe not
2159 a best way to do it.
2160
2161 2007-06-23 Wakaba <wakaba@suika.fam.cx>
2162
2163 * HTML.pm.src: HTML5 revision 884 (</form> don't close
2164 the form element if a descendant element without implied end tag has
2165 still been open).
2166
2167 2007-06-23 Wakaba <wakaba@suika.fam.cx>
2168
2169 * ContentChecker.pm: HTML5 revision 881 (Make |id| attribute with
2170 space characters non-conforming).
2171
2172 2007-06-23 Wakaba <wakaba@suika.fam.cx>
2173
2174 * HTML.pm.src: An error message was incorrect.
2175 HTML5 revision 869 (C1 character references).
2176
2177 2007-06-23 Wakaba <wakaba@suika.fam.cx>
2178
2179 * HTML.pm.src: HTML5 revision 867 (a LF at the beginning of
2180 a |textarea| is removed).
2181
2182 2007-06-05 Wakaba <wakaba@suika.fam.cx>
2183
2184 * NanoDOM.pm (get_attribute_node_ns): New method.
2185
2186 * ContentChecker.pm: |script| |async| and |defer|
2187 no longer require |src|. |async| MUST NOT be
2188 specified if |defer|. (HTML5 revision 858).
2189
2190 2007-05-30 Wakaba <wakaba@suika.fam.cx>
2191
2192 * HTML.pm.src: |<form><form>| went to inifinite loop.
2193
2194 2007-05-27 Wakaba <wakaba@suika.fam.cx>
2195
2196 * ContentChecker.pm (html): Set |is_root| (allowed
2197 as a document element) flag on.
2198 (new): Removed.
2199 (check_document): New method.
2200
2201 2007-05-27 Wakaba <wakaba@suika.fam.cx>
2202
2203 * ContentChecker.pm (thead, tfoot): Checker specifications
2204 were incorrect.
2205
2206 2007-05-27 Wakaba <wakaba@suika.fam.cx>
2207
2208 * ContentChecker.pm ($HTMLURIAttrChecker): Include
2209 error position in the |type| option of the error.
2210
2211 * HTMLTable.pm (form_table): The |$onerror| parameter
2212 is now optional. Some bugs are fixed.
2213
2214 2007-05-27 Wakaba <wakaba@suika.fam.cx>
2215
2216 * HTMLTable.pm: New module.
2217
2218 * ContentChecker.pm (table): Invoke table model error checker.
2219
2220 * NanoDOM.pm (first_child, get_attribute_ns): New.
2221
2222 2007-05-26 Wakaba <wakaba@suika.fam.cx>
2223
2224 * ContentChecker.pm ($HTMLLinkTypesAttrChecker): New checker.
2225 (link/@rel, a/@rel, area/@rel): Use new checker.
2226
2227 * Makefile (_LinkTypeList.pm, RelExtensions.html): New rules.
2228
2229 * _LinkTypeList.pm: New file.
2230
2231 * mklinktypelist.pl: New file.
2232
2233 * .cvsignore: |RelExtensions.html| added.
2234
2235 * NanoDOM.pm (child_nodes): Returns an empty array
2236 for non-child-containing node types.
2237 (text_content): New attribute.
2238
2239 2007-05-26 Wakaba <wakaba@suika.fam.cx>
2240
2241 * IMTChecker.pm: New module.
2242
2243 * ContentChecker.pm ($HTMLIMTAttrChecker): Call IMTChecker
2244 to test parameter value validity.
2245
2246 * HTML.pm.src ($style_start_tag): Attributes were
2247 discarded.
2248
2249 2007-05-25 Wakaba <wakaba@suika.fam.cx>
2250
2251 * ContentChecker.pm ($HTMLURIAttrChecker): Implemented.
2252
2253 2007-05-25 Wakaba <wakaba@suika.fam.cx>
2254
2255 * URIChecker.pm: All recommendations from RFC 3986
2256 and RFC 3987 are listed (not all testable items are checked yet).
2257
2258 2007-05-25 Wakaba <wakaba@suika.fam.cx>
2259
2260 * URIChecker.pm: New module.
2261
2262 2007-05-20 Wakaba <wakaba@suika.fam.cx>
2263
2264 * ContentChecker.pm: Now most attributes are implemented
2265 or associated to some placeholder.
2266 ($ElementDefault): Warn unknown attributes
2267 for unknown elements as "attribute not supported".
2268 ($HTMLLanguageTagAttrChecker, $HTMLMQAttrChecker): New placeholders.
2269 ($HTMLUsemapAttrChecker, $HTMLTargetAttrChecker): New checkers.
2270 (|a| attribute checker): Reimplemented.
2271
2272 2007-05-20 Wakaba <wakaba@suika.fam.cx>
2273
2274 * ContentChecker.pm ($HTMLEventHandlerAttrChecker): New placeholder.
2275 ($HTMLAttrChecker): Event handler content attributes
2276 are added.
2277 (link, embed): Required attribute is now checked.
2278 (embed): Unknown local attributes are no longer warned.
2279
2280 2007-05-20 Wakaba <wakaba@suika.fam.cx>
2281
2282 * ContentChecker.pm ($HTMLSpaceURIsAttrChecker): New placeholder.
2283 ($HTMLIMTAttrChecker): New checker.
2284 (link@rel, link@href, link@type, style@type,
2285 a@href, a@ping, a@ping, a@type, embed@src, embed@type,
2286 object@data, object@type, source@src, source@type, area@alt,
2287 area@shape, area@coords, area@href,
2288 area@ping, area@rel, area@type, script@src,
2289 script@defer, script@async, script@type): Checkers added.
2290
2291 2007-05-20 Wakaba <wakaba@suika.fam.cx>
2292
2293 * ContentChecker.pm: Descendant checking was incorrect.
2294
2295 2007-05-19 Wakaba <wakaba@suika.fam.cx>
2296
2297 * ContentChecker.pm: Support |xml:*| and |xmlns:*|
2298 attributes. Report an error if |Element.prefix|
2299 is |xmlns|.
2300
2301 * NanoDOM.pm (prefix): New attribute.
2302
2303 2007-05-19 Wakaba <wakaba@suika.fam.cx>
2304
2305 * HTML.pm.src: In |main| phase, |in body| insertion
2306 mode, action for |<iframe>| was missing.
2307
2308 2007-05-19 Wakaba <wakaba@suika.fam.cx>
2309
2310 * ContentChecker.pm: Support for many of HTML5 elements.
2311 ($GetHTMLNonNegativeIntegerAttrChecker): New.
2312
2313 2007-05-19 Wakaba <wakaba@suika.fam.cx>
2314
2315 * ContentChecker.pm: Support for most elements up to |progress|.
2316 ($HTMLURIAttrChecker): Placeholder.
2317 ($HTMLIntegerAttrChecker, $GetHTMLFloatingPointNumberAttrChecker):
2318 New.
2319
2320 2007-05-19 Wakaba <wakaba@suika.fam.cx>
2321
2322 * ContentChecker.pm: Attribute checkers
2323 for global attributes, |html|, |base|, |style|, and |meta|.
2324
2325 * NanoDOM.pm (insert_before): Weaken reference
2326 to the parent node.
2327 (Attr::new): Set |owner_element| attribute.
2328 (namespace_uri, manakai_local_name): New attribute implementations.
2329 (owner_element): New attribute.
2330
2331 2007-05-19 Wakaba <wakaba@suika.fam.cx>
2332
2333 * ContentChecker.pm ($AttrChecker, $HTMLAttrChecker,
2334 $AnyChecker->{attr_checker}, $HTMLAttrsChecker,
2335 $Element->{$HTML_NS}->{''}): New.
2336 (check_element): Invoke attrs_checker for each element.
2337
2338 2007-05-13 Wakaba <wakaba@suika.fam.cx>
2339
2340 * ContentChecker.pm: Don't use |manakai_element_type_match|.
2341
2342 2007-05-13 Wakaba <wakaba@suika.fam.cx>
2343
2344 * ContentChecker.pm: Use hashs rather than lists for
2345 element type testings.
2346
2347 2007-05-13 Wakaba <wakaba@suika.fam.cx>
2348
2349 * ContentChecker.pm: Don't generate duplicate
2350 error when an element type is put in the "minus" list
2351 and the element type is not allowed explicitly in the particular
2352 element content model.
2353 (html:a checker): New checker.
2354 (html:details, html:datagrid): New checkers.
2355 (html:legend): New checker.
2356
2357 2007-05-13 Wakaba <wakaba@suika.fam.cx>
2358
2359 * ContentChecker.pm (html:li checker): Implemented.
2360
2361 2007-05-13 Wakaba <wakaba@suika.fam.cx>
2362
2363 * ContentChecker.pm ($HTMLInlineOrStriclyInlineChecker): New
2364 checker.
2365 (html:dd checker): New checker.
2366 (html:q, html:em, html:strong, html:small,
2367 html:m, html:dfn, html:code, html:samp, html:span): New checkers.
2368
2369 2007-05-13 Wakaba <wakaba@suika.fam.cx>
2370
2371 * ContentChecker.pm ($AnyChecker): Renamed
2372 from |$ElementDefault->{checker}|.
2373 ($ElementDefault->{checker}): Throw an error that
2374 the element type is not supported by the checker.
2375 ($HTMLMetadataElement): |html:base| was missing.
2376 ($HTMLEmptyChecker): Don't throw an error
2377 for inter-element whitespace nodes.
2378 (html:html checker): Errors were not
2379 thrown even if |html:head| and/or |html:body|
2380 children were missing.
2381 (html:head checker): An error was not
2382 thrown if <meta charset> appered after other
2383 elements.
2384
2385 2007-05-05 Wakaba <wakaba@suika.fam.cx>
2386
2387 * ContentChecker.pm: |footer|, |video|, |audio|, |script|,
2388 and |noscript| elements are implemented.
2389 (new): New method.
2390
2391 2007-05-04 Wakaba <wakaba@suika.fam.cx>
2392
2393 * ContentChecker.pm: New module.
2394
2395 2007-05-04 Wakaba <wakaba@suika.fam.cx>
2396
2397 * NanoDOM.pm (manakai_parent_element,
2398 document_element, manakai_local_name,
2399 manakai_element_type_match): New method.
2400
2401 2007-05-03 Wakaba <wakaba@suika.fam.cx>
2402
2403 * HTML.pm.src: Replace decimal and hexadecimal numeric
2404 entities in C1 range using Windows-1252 mapping. Bare LF
2405 did not count as new line for error reporting.
2406
2407 2007-05-02 Wakaba <wakaba@suika.fam.cx>
2408
2409 * NanoDOM.pm (DOMImplementation): New class.
2410 (append_child): Weaken the |parent_node| reference.
2411 (create_element_ns, Element new): Set the |owner_document|
2412 reference.
2413 (implementation): New attribute.
2414 (owner_document, local_name, namespace_uri): New attributes.
2415
2416 * HTML.pm.src (parse_string): Line and column numbers
2417 are now provided to error handler.
2418 (!!!parse-error): Short descriptions are added.
2419 (_construct_tree): Split into three methods; support
2420 for innerHTML mode.
2421 (set_inner_html): New method.
2422
2423 2007-05-01 Wakaba <wakaba@suika.fam.cx>
2424
2425 * NanoDOM.html: Documentation is added.
2426
2427 * HTML.pod, ContentType.html: Documentation is revised.
2428
2429 * .cvsignore: Pod2html temporary files are added.
2430
2431 * Makefile: Make |NanoDOM.html|.
2432
2433 2007-05-01 Wakaba <wakaba@suika.fam.cx>
2434
2435 * HTML.pm.src (parse_string): New method.
2436 (get_inner_html): Renamed from |inner_html|.
2437
2438 * Makefile: A rule for |HTML.html| is added.
2439
2440 * HTML.pod: New documentation.
2441
2442 2007-05-01 Wakaba <wakaba@suika.fam.cx>
2443
2444 * NanoDOM.pm (last_child, previous_sibling): New attributes.
2445 (clone_node): Attribute nodes were not completely copied.
2446
2447 * HTML.pm.src: Many bugs are fixed.
2448
2449 2007-04-30 Wakaba <wakaba@suika.fam.cx>
2450
2451 * HTML.pm.src: Some typos are fixed.
2452
2453 2007-04-30 Wakaba <wakaba@suika.fam.cx>
2454
2455 * mkhtmlparser.pl, Makefile: References to the |HTML-consume-entity.src|
2456 are removed.
2457
2458 * HTML.pm.src: Tokenizer's handling on named entities are rewritten.
2459
2460 * HTML-consume-entity.src: Removed.
2461
2462 2007-04-30 Wakaba <wakaba@suika.fam.cx>
2463
2464 * HTML.pm.src: Tokenizer's handling on hexadecimal
2465 numeric entities are rewritten.
2466
2467 2007-04-30 Wakaba <wakaba@suika.fam.cx>
2468
2469 * HTML.pm.src: Some tokenizer bugs are fixed.
2470
2471 2007-04-30 Wakaba <wakaba@suika.fam.cx>
2472
2473 * HTML.pm.src: The tree construction stage is implemented.
2474
2475 * mkhtmlparser.pl: New macros are added.
2476
2477 2007-04-28 Wakaba <wakaba@suika.fam.cx>
2478
2479 * ContentType.pm: A note on bug in the specification
2480 is removed since it's been now fixed.
2481
2482 * .cvsignore: New file.
2483
2484 2007-04-28 Wakaba <wakaba@suika.fam.cx>
2485
2486 * HTML.pm.src, HTML-consume-entity.src: New files.
2487
2488 * Makefile (HTML.pm): New rule.
2489
2490 * mkhtmlparser.pl: New script.
2491
2492 2007-04-25 Wakaba <wakaba@suika.fam.cx>
2493
2494 * Makefile: New file.
2495
2496 2007-04-24 Wakaba <wakaba@suika.fam.cx>
2497
2498 * ContentType.pm: An error in pod is fixed.
2499
2500 2007-04-24 Wakaba <wakaba@suika.fam.cx>
2501
2502 * ContentType.pm: Documentation is added.
2503
2504 2007-04-24 Wakaba <wakaba@suika.fam.cx>
2505
2506 * ContentType.pm: New Perl module.
2507
2508 * ChangeLog: New file.
2509
2510

admin@suikawiki.org
ViewVC Help
Powered by ViewVC 1.1.24