/[suikacvs]/markup/html/whatpm/Whatpm/ChangeLog
Suika

Contents of /markup/html/whatpm/Whatpm/ChangeLog

Parent Directory Parent Directory | Revision Log Revision Log


Revision 1.183 - (show annotations) (download)
Mon Mar 10 10:55:56 2008 UTC (17 years, 4 months ago) by wakaba
Branch: MAIN
Changes since 1.182: +6 -0 lines
++ whatpm/Whatpm/ChangeLog	10 Mar 2008 10:55:36 -0000
2008-03-10  Wakaba  <wakaba@suika.fam.cx>

	* mkhtmlparser.pl: Set "level" parameter to parse errors.

	* HTML.pm.src: Code refined.

1 2008-03-10 Wakaba <wakaba@suika.fam.cx>
2
3 * mkhtmlparser.pl: Set "level" parameter to parse errors.
4
5 * HTML.pm.src: Code refined.
6
7 2008-03-09 Wakaba <wakaba@suika.fam.cx>
8
9 * HTML.pm.src: |</body>| treatement has been changed (HTML5
10 revision 1348). Note that I really don't know this makes
11 any difference in the black-box behavior of the parser.
12
13 2008-03-09 Wakaba <wakaba@suika.fam.cx>
14
15 * HTML.pm.src: New end-of-file token implementation (HTML5
16 revision 1348).
17
18 2008-03-09 Wakaba <wakaba@suika.fam.cx>
19
20 * HTML.pm.src: |applet| support (HTML5 revision 1347).
21
22 2008-03-09 Wakaba <wakaba@suika.fam.cx>
23
24 * HTML.pm.src: Foster parenting in AAA (HTML5 revision 1343).
25
26 2008-03-09 Wakaba <wakaba@suika.fam.cx>
27
28 * HTML.pm.src: Support for |<input>| in the "in select" insertion
29 mode and support for the "in select in table" insertion mode (HTML5
30 revision 1342).
31
32 2008-03-09 Wakaba <wakaba@suika.fam.cx>
33
34 * HTML.pm.src: No foster parenting for <script> and <script>
35 in non-tainted <table>s (HTML5 revision 1336).
36
37 2008-03-09 Wakaba <wakaba@suika.fam.cx>
38
39 * HTML.pm.src: Ignore white space characters between <html>
40 and <head> (HTML5 revision 1332).
41
42 2008-03-09 Wakaba <wakaba@suika.fam.cx>
43
44 * HTML.pm.src: Treat <input type=hidden> as if it were a
45 white space (HTML5 revision 1331).
46
47 2008-03-08 Wakaba <wakaba@suika.fam.cx>
48
49 * HTML.pm.src: Ignore U+000A at the beginning of a |listing|
50 element (HTML5 revision 1330).
51
52 2008-03-08 Wakaba <wakaba@suika.fam.cx>
53
54 * HTML.pm.src: <title> is always appended to the current
55 element (HTML5 revision 1328).
56
57 2008-03-08 Wakaba <wakaba@suika.fam.cx>
58
59 * HTML.pm.src: White space in tainted tables are moved
60 into foster parents (HTML5 revision 1326).
61
62 2008-03-08 Wakaba <wakaba@suika.fam.cx>
63
64 * HTML.pm.src: Reduce errors from foster parenting cases (HTML5
65 revision 1321).
66
67 2008-03-08 Wakaba <wakaba@suika.fam.cx>
68
69 * HTML.pm.src: |</h/n/>| case code rearranged to align with
70 the spec (HTML5 revision 1320). Note that we finally complete
71 all of HTML5 revision 1320 changes.
72
73 2008-03-08 Wakaba <wakaba@suika.fam.cx>
74
75 * HTML.pm.src: |</form>| now works similar to |</div>| for unclosed
76 tags (HTML5 revision 1320).
77
78 2008-03-08 Wakaba <wakaba@suika.fam.cx>
79
80 * HTML.pm.src: |</p>| case rearranged with no actual change in fact.
81
82 2008-03-08 Wakaba <wakaba@suika.fam.cx>
83
84 * HTML.pm.src: A "generate implied end tags" code (t409.1)
85 could not be reached so that it is now removed (HTML5 revision
86 1320).
87
88 2008-03-08 Wakaba <wakaba@suika.fam.cx>
89
90 * HTML.pm.src: Code for the case of |</div>| and so on are revised
91 to align with new spec text (HTML5 revision 1320).
92
93 2008-03-08 Wakaba <wakaba@suika.fam.cx>
94
95 * HTML.pm.src: Remove strange |if| condition; however, it should
96 have had no harm in theory.
97
98 2008-03-08 Wakaba <wakaba@suika.fam.cx>
99
100 * HTML.pm.src (_tree_construction_main): '</p>' in body
101 case is split from other end tags for the preparation
102 of implementing HTML revision 1320.
103
104 2008-03-07 Wakaba <wakaba@suika.fam.cx>
105
106 * HTML.pm.src: Simplified "generate implied end tag" (HTML5
107 revision 1320).
108
109 2008-03-07 Wakaba <wakaba@suika.fam.cx>
110
111 * HTML.pm.src (_tree_construction_main): Merge rules for "h1"
112 and "div" (HTML5 revision 1318). Add comments to where
113 |form| pointer association codes should be inserted (HTML5 revision
114 1319).
115
116 2008-03-06 Wakaba <wakaba@suika.fam.cx>
117
118 * HTML.pm.src: <html> treatement refined (HTML5 revision 1314).
119
120 2008-03-05 Wakaba <wakaba@suika.fam.cx>
121
122 * HTML.pm.src: Since the case t268 should never be reached (no
123 other token type, there are), it is replaced by a |die| statement.
124
125 2008-03-05 Wakaba <wakaba@suika.fam.cx>
126
127 * HTML.pm.src: Typo fixed.
128
129 2008-03-04 Wakaba <wakaba@suika.fam.cx>
130
131 * HTML.pm.src (_tree_construction_initial): Some limited quirks
132 doctypes were not uppercased for comparison.
133
134 2008-03-03 Wakaba <wakaba@suika.fam.cx>
135
136 * HTML.pm.src (tree construction and set_inner_html): Checkpoints
137 are added.
138
139 2008-03-03 Wakaba <wakaba@suika.fam.cx>
140
141 * HTML.pm.src (_tokenize_attempt_to_consume_an_entity): Checkpoints
142 are set. Cases that are unlikely reached are noted as so.
143
144 2008-03-03 Wakaba <wakaba@suika.fam.cx>
145
146 * HTML.pm.src: Checkpoints for debugging are added.
147
148 * mkhtmlparser.pl: Support for |!!!cp| syntax.
149
150 2008-03-03 Wakaba <wakaba@suika.fam.cx>
151
152 * HTML.pm.src, mkhtmlparser.pl: s/_input_character/_char/g
153 for simplicity.
154
155 2008-03-03 Wakaba <wakaba@suika.fam.cx>
156
157 * HTML.pm.src: Flag name changed: s/correct/force-quirks/g (HTML5
158 revision 1307).
159
160 2008-03-03 Wakaba <wakaba@suika.fam.cx>
161
162 * HTML.pm.src (_get_next_token): Where the /incorrect/ flag
163 is set are changed (HTML5 revision 1305).
164
165 2008-03-02 Wakaba <wakaba@suika.fam.cx>
166
167 * HTML.pm.src: Raise a parse error for |<span ===>| (HTML5 revision
168 1292). Entities are not parsed in comment-like part in RCDATA
169 elements (HTML5 revision 1294). Allow bare & at the end
170 of attribute value literals (HTML5 revision 1296). More
171 quirks mode doctypes (HTML5 revision 1302). Requires spaces
172 between attributes and ban attribute names or unquoted
173 attribute values containing single or double quotes (HTML5
174 revision 1303).
175
176 2008-03-02 Wakaba <wakaba@suika.fam.cx>
177
178 * ContentChecker.pm: Typo fixed. Don't raise "character encoding"
179 and related errors unless it is an HTML document (though the spec
180 is unclear on whether it is applied to XHTML document).
181
182 * HTML.pm (%HTMLAttrStatus): WF2 repetition model attributes
183 are added.
184
185 2008-03-02 Wakaba <wakaba@suika.fam.cx>
186
187 * HTML.pm.src: s/local_name/manakai_local_name/g.
188
189 2008-03-01 Wakaba <wakaba@suika.fam.cx>
190
191 * _NamedEntityList.pm: Updated (HTML5 revision 1286).
192
193 * HTML.pm.src: |charset| in |content| attribute is
194 case-insensitive (HTML5 revision 1270).
195
196 2008-02-26 Wakaba <wakaba@suika.fam.cx>
197
198 * ContentChecker.pm: New status constants are added.
199 ($ElementDefault): |status| added.
200 (check_element): Err for non-standard or deprecated elements.
201 (_attr_status_info): For non-standard or deprecated attributes.
202
203 2008-02-24 Wakaba <wakaba@suika.fam.cx>
204
205 * ContentChecker.pm (_attr_status_info): New internal method.
206
207 2008-02-24 Wakaba <wakaba@suika.fam.cx>
208
209 * ContentChecker.pm (check_element): Element standardized
210 status information is now dispatched.
211
212 2008-02-24 Wakaba <wakaba@suika.fam.cx>
213
214 * ContentChecker.pm (check_element): Fix |del|-and-significant
215 problem by adding some more arguments.
216
217 2008-02-24 Wakaba <wakaba@suika.fam.cx>
218
219 * ContentChecker.pm (check_element): Use context of
220 container-for-the-purpose-of-content-model element (not transparent
221 element) for |check_child_element| calling and significant
222 text flag marking. This reintroduces |<del>|-and-significant
223 problem again.
224
225 2008-02-24 Wakaba <wakaba@suika.fam.cx>
226
227 * ContentChecker.pm (check_element): Make semi-transparent
228 elements ignored for the purpose of phase changes in
229 content model checking.
230
231 2008-02-23 Wakaba <wakaba@suika.fam.cx>
232
233 * ContentChecker.pm (check_element): In-element state
234 was not properly managed for transparent cases.
235
236 2008-02-23 Wakaba <wakaba@suika.fam.cx>
237
238 * ContentChecker.pm (check_element): Support for |video|
239 and |audio| as semi-transparent elements.
240
241 2008-02-23 Wakaba <wakaba@suika.fam.cx>
242
243 * ContentChecker.pm ($HTMLSemiTransparentElements): New.
244 (check_element): s/minuses/minus_elements/, s/pluses/plus_elements/.
245 Support for |html:object| as a semi-transparent element.
246
247 2008-02-23 Wakaba <wakaba@suika.fam.cx>
248
249 * ContentChecker.pm (check_element): The way to traverse
250 the tree is entirely revised to make it easier to track
251 the state of ancestors/descendants. As a result of this
252 revision (which rewrites almost all of Whatpm::ContentChecker::HTML),
253 support for content model checking for HTML elements |figure|,
254 |object|, |video|, and |audio| and checking for XML elements (and
255 some XMLNS checkings) are dropped for now. They will be
256 reimplemented in due cource.
257
258 2008-02-17 Wakaba <wakaba@suika.fam.cx>
259
260 * HTML.pm.src: |>| in public or system literal closes the DOCTYPE
261 token (HTML5 revision 1225).
262
263 2008-02-17 Wakaba <wakaba@suika.fam.cx>
264
265 * ContentChecker.pm ({unsupported_level}): New value.
266
267 * HTML.pm.src: Save whether |meta| |content| attribute
268 contains character references or not.
269
270 2008-02-17 Wakaba <wakaba@suika.fam.cx>
271
272 * ContentChecker.pm (_get_children): (Incompleted) attempt to
273 imlement significant content checking for contents
274 with |del| elements.
275
276 2008-02-17 Wakaba <wakaba@suika.fam.cx>
277
278 * ContenteChecker.pm ($HTMLTransparentElements): More
279 elements are added.
280 (_get_children): HTML |object| elements are now semi-transparent.
281
282 * NanoDOM.pm (manakai_html, manakai_head): New methods.
283
284 2008-02-16 Wakaba <wakaba@suika.fam.cx>
285
286 * CacheManifest.pm: HTML5 revision 1211 implemented.
287
288 * CacheManifest.pod: Updated.
289
290 2008-02-10 Wakaba <wakaba@suika.fam.cx>
291
292 * ContentChecker.pm (check_document, check_element): Support
293 for second argument ($onsubdoc).
294 (_get_css_parser): Removed (now it is part of WDCC).
295
296 2008-02-09 Wakaba <wakaba@suika.fam.cx>
297
298 * ContentChecker.pm (_get_css_parser): New.
299
300 2007-11-25 Wakaba <wakaba@suika.fam.cx>
301
302 * ContentChecker.pm ($AnyChecker): Old way to add child elements
303 for checking had been used.
304
305 2007-11-25 Wakaba <wakaba@suika.fam.cx>
306
307 * ContentChecker.pm (check_element): New todo item type |descendant|.
308
309 2007-11-23 Wakaba <wakaba@suika.fam.cx>
310
311 * IMTChecker.pm: Revised to raise errors and warnings as (poorly)
312 specced in RFC 2046 and RFC 4288.
313 (application/atom+xml): Definition added.
314
315 2007-11-23 Wakaba <wakaba@suika.fam.cx>
316
317 * URIChecker.pm: Make RFC 3986 should-level errors
318 warnings (rather than SHOULD-level errors).
319
320 2007-11-23 Wakaba <wakaba@suika.fam.cx>
321
322 * NanoDOM.pm (get_user_data, set_user_data): New methods.
323
324 * HTML.pm.src: A flag for character references in attribute
325 values are added. Set |manakai_has_reference| user data
326 to |charset| attribute.
327
328 2007-11-23 Wakaba <wakaba@suika.fam.cx>
329
330 * NanoDOM.pm (input_encoding, manakai_charset, manakai_has_bom): New
331 attributes.
332
333 * ContentChecker.pm (check_document): Warn if charset requirements
334 cannot be tested.
335
336 2007-11-19 Wakaba <wakaba@suika.fam.cx>
337
338 * HTML.pm.src (parse_byte_string): Detect charset
339 by universalchardet if charset parameter is not specified.
340
341 * Makefile (Charset-all, Charset-clean): New rules.
342
343 2007-11-18 Wakaba <wakaba@suika.fam.cx>
344
345 * ContentChecker.pm (check_document): Check the existence
346 of character encoding declaration and selection of encoding
347 for HTML document.
348
349 2007-11-18 Wakaba <wakaba@suika.fam.cx>
350
351 * ContentType.pm (get_sniffed_type): Return also the official
352 type in list context.
353
354 2007-11-18 Wakaba <wakaba@suika.fam.cx>
355
356 * ContentType.pm: Sniffing with leading white space
357 ignoring (HTML5 revisions 1013 and 1016).
358
359 2007-11-18 Wakaba <wakaba@suika.fam.cx>
360
361 * ContentType.pm: HTML5 revision 1013 changes, except for leading
362 white spaces, are implemented.
363
364 2007-11-11 Wakaba <wakaba@suika.fam.cx>
365
366 * HTML.pm.src (parse_char_string): Set |inner_encoding|
367 attribute if possible.
368
369 2007-11-11 Wakaba <wakaba@suika.fam.cx>
370
371 * HTML.pm.src (parse_byte_string): New method.
372 (parse_char_string): New alias for |parse_string|.
373 (main phase): Invoking "change the encoding" algorithm if desired.
374
375 * HTML.pod: Updated.
376
377 2007-11-11 Wakaba <wakaba@suika.fam.cx>
378
379 * HTML.pod (get_inner_html): Removed.
380
381 * Makefile (HTML-all, HTML-clean): New.
382
383 2007-11-11 Wakaba <wakaba@suika.fam.cx>
384
385 * HTML.pm.src (get_inner_html): Removed (moved to HTML/Serializer.pm).
386
387 2007-11-08 Wakaba <wakaba@suika.fam.cx>
388
389 * mklinktypelist.pl: s/noreferer/noreferrer/ (HTML5 revision 1132).
390
391 2007-11-04 Wakaba <wakaba@suika.fam.cx>
392
393 * Makefile: |CacheManifest.html| is added.
394
395 * CacheManifest.pod: New file.
396
397 2007-11-04 Wakaba <wakaba@suika.fam.cx>
398
399 * CacheManifest.pm: New module.
400
401 2007-11-04 Wakaba <wakaba@suika.fam.cx>
402
403 * HTML.pm.src: Support for application cache selection algorithm
404 callback.
405
406 2007-11-04 Wakaba <wakaba@suika.fam.cx>
407
408 * mklinktypelist.pl: Support for rel=noreferer (HTML5 revision 1118).
409
410 2007-10-17 Wakaba <wakaba@suika.fam.cx>
411
412 * Makefile (clean): New rule.
413
414 * NanoDOM.pm (public_id, system_id): New attributes.
415
416 2007-10-17 Wakaba <wakaba@suika.fam.cx>
417
418 * Makefile (CSS-all, CSS-clean, clean): New rules.
419
420 2007-10-14 Wakaba <wakaba@suika.fam.cx>
421
422 * ContentChecker.pm (check_document): Support for
423 new |is_xml_root| flag.
424 (check_element): Support for new |pluses| state.
425 (_add_pluses): New method.
426 (_remove_minuses): Support for new |minus| item.
427
428 2007-09-24 Wakaba <wakaba@suika.fam.cx>
429
430 * ContentChecker.pm: Raise specific error for invalid
431 root element.
432
433 2007-09-24 Wakaba <wakaba@suika.fam.cx>
434
435 * ContentChecker.pm: Set level values for later uses.
436
437 2007-09-09 Wakaba <wakaba@suika.fam.cx>
438
439 * ContentChecker.pm: Support for language tag validation.
440
441 2007-09-09 Wakaba <wakaba@suika.fam.cx>
442
443 * LangTag.pm (check_rfc3066_language_tag): New method.
444
445 2007-09-09 Wakaba <wakaba@suika.fam.cx>
446
447 * LangTag.pm: New module.
448
449 2007-09-04 Wakaba <wakaba@suika.fam.cx>
450
451 * HTML.pm.src: Some error types were wrong.
452
453 2007-08-17 Wakaba <wakaba@suika.fam.cx>
454
455 * CSS/: New directory.
456
457 2007-08-17 Wakaba <wakaba@suika.fam.cx>
458
459 * ContentChecker.pm (_check_get_children): Support
460 for |noscript| in |head|.
461
462 2007-08-12 Wakaba <wakaba@suika.fam.cx>
463
464 * URI/: New directory.
465
466 2007-08-11 Wakaba <wakaba@suika.fam.cx>
467
468 * HTML.pm.src: Tokenizer's states are now represented in
469 number.
470
471 2007-08-11 Wakaba <wakaba@suika.fam.cx>
472
473 * HTML.pm.src: |or|s for insertion modes are replaced
474 by |&|s.
475
476 2007-08-11 Wakaba <wakaba@suika.fam.cx>
477
478 * HTML.pm.src: Token types are now represented in number.
479
480 2007-08-11 Wakaba <wakaba@suika.fam.cx>
481
482 * ContentType.pm (SEE ALSO): Updated.
483
484 * HTML.pm.src: Insertion modes are now represented in number.
485
486 2007-08-11 Wakaba <wakaba@suika.fam.cx>
487
488 * ContentType.pm: Sniffing for bitmap images (HTML5 revision
489 999) is implemented.
490
491 2007-08-08 Wakaba <wakaba@suika.fam.cx>
492
493 * ContentType.pm: Sniffing for |<script| (HTML5 revision
494 983) is implemented.
495
496 2007-08-06 Wakaba <wakaba@suika.fam.cx>
497
498 * ContentChecker.pod: New documentation.
499
500 * Makefile: A rule for |ContentChecker.html| is added.
501
502 * ContentChecker.pm: A pod "LICENSE" section is added.
503
504 * NanoDOM.pm ($VERSION): New variable.
505
506 2007-08-05 Wakaba <wakaba@suika.fam.cx>
507
508 * H2H.pm: |b|, |i|, and |sub| are added to the
509 list of allowed HTML elements.
510
511 2007-08-05 Wakaba <wakaba@suika.fam.cx>
512
513 * H2H.pm: |samp| is added to the list of allowed
514 HTML elements.
515
516 * URIChecker.pm (check_iri): New.
517 (check_iri_reference): Error type for IRI reference
518 syntax error is changed.
519
520 2007-08-04 Wakaba <wakaba@suika.fam.cx>
521
522 * ContentChecker.pm: Reference to the |Whatpm::ContentChecker::Atom|
523 is added.
524 (check_document): Load appropriate module before validation.
525
526 2007-08-04 Wakaba <wakaba@suika.fam.cx>
527
528 * ContentChecker/: New directory.
529
530 2007-08-04 Wakaba <wakaba@suika.fam.cx>
531
532 * ContentChecker.pm: HTML |time| element is implemented.
533
534 * HTMLTable.pm: Comments are updated as HTML5 is revised.
535
536 2007-07-21 Wakaba <wakaba@suika.fam.cx>
537
538 * ContentChecker.pm (check_document): Return value
539 even if no document element is found.
540
541 2007-07-21 Wakaba <wakaba@suika.fam.cx>
542
543 * HTML.pm.src: |$in_body| is no longer a function.
544
545 2007-07-21 Wakaba <wakaba@suika.fam.cx>
546
547 * HTML.pm.src: The |$in_body| code has been moved down.
548
549 2007-07-21 Wakaba <wakaba@suika.fam.cx>
550
551 * HTML.pm.src: The "trailing end" insertion mode
552 is split into "after html body" and "after html frameset"
553 insertion modes. Their codes are merged with "after body"
554 and "after frameset" codes. |$previous_insertion_mode|
555 has been removed. "after frameset" code is
556 merged with "in frameset" code.
557
558 2007-07-21 Wakaba <wakaba@suika.fam.cx>
559
560 * HTML.pm.src: The "before head" insertion mode is
561 merged with the "in head" insertion mode.
562
563 2007-07-21 Wakaba <wakaba@suika.fam.cx>
564
565 * HTML.pm.src: Cases in "in head" insertion mode are
566 reorganized.
567
568 2007-07-21 Wakaba <wakaba@suika.fam.cx>
569
570 * HTML.pm.src: Some cases in "in table" insertion mode
571 are merged.
572
573 2007-07-21 Wakaba <wakaba@suika.fam.cx>
574
575 * HTML.pm.src: The "in row" insertion mode is merged
576 with "in table" insertion mode.
577
578 2007-07-21 Wakaba <wakaba@suika.fam.cx>
579
580 * HTML.pm.src: The "in table" and "in table body" insertion
581 modes are merged.
582
583 2007-07-21 Wakaba <wakaba@suika.fam.cx>
584
585 * HTML.pm.src: There is no "in table head" or "in table foot"
586 insertion mode!
587
588 2007-07-21 Wakaba <wakaba@suika.fam.cx>
589
590 * HTML.pm.src: |<noframes>| "in frameset" and "in noframes"
591 now directly invoke the handler.
592
593 2007-07-21 Wakaba <wakaba@suika.fam.cx>
594
595 * HTML.pm.src: Codes for "in cell" insertion mode
596 is merged to the "in body" insertion mode code.
597
598 2007-07-21 Wakaba <wakaba@suika.fam.cx>
599
600 * HTML.pm.src: Codes for "in body" and "in caption"
601 insertion modes are merged.
602
603 2007-07-21 Wakaba <wakaba@suika.fam.cx>
604
605 * HTML.pm.src: Two |!!!next-token|s were missing.
606
607 2007-07-21 Wakaba <wakaba@suika.fam.cx>
608
609 * HTML.pm.src: Use numeric constant for |{content_mode}|
610 instead of string constant for |{content_model_flag}|.
611
612 2007-07-21 Wakaba <wakaba@suika.fam.cx>
613
614 * HTML.pm.src: Add the name of the attribute
615 to the "duplicate attribute" error.
616
617 2007-07-17 Wakaba <wakaba@suika.fam.cx>
618
619 * ContentChecker.pm: Return the |class| node list.
620
621 2007-07-17 Wakaba <wakaba@suika.fam.cx>
622
623 * ContentChecker.pm: Return the |id| node list.
624
625 * HTML.pm.src: A typo is fixed.
626
627 2007-07-16 Wakaba <wakaba@suika.fam.cx>
628
629 * ContentChecker.pm: Drop wrong |level => 'error'| specification
630 from "in HTML:xml:lang" error. Character position
631 is now the last part of the error type in the URI error
632 description. Report "unsupported" status for language
633 tags, media queries, script codes, and style sheets.
634
635 2007-07-16 Wakaba <wakaba@suika.fam.cx>
636
637 * ContentChecker.pm: Report error if |xml:lang|
638 in HTML, |lang| in XML, |xmlns| in XML, and |meta| |charset|
639 in XML.
640
641 * NanoDOM.pm (Attr.owner_document): New attribute.
642
643 2007-07-16 Wakaba <wakaba@suika.fam.cx>
644
645 * HTML.pm.src: The character immediately following
646 a bare |hcro| was discarded. Fix handling of
647 entity references in attribute values.
648
649 2007-07-16 Wakaba <wakaba@suika.fam.cx>
650
651 * HTML.pm.src (main and trailin end phases): Token
652 types |DOCTYPE|, |comment|, |end-of-file|, and |<html>| are
653 factored out. Error types |in html:#DOCTYPE|
654 and |after html:#DOCTYPE| are merged
655 into |DOCTYPE in the middle|. |</frameset>|
656 in fragment parsing mode changed the
657 insertion mode.
658
659 2007-07-16 Wakaba <wakaba@suika.fam.cx>
660
661 * HTML.pm.src: |$phase| has been removed; The |trailing end|
662 phase is now an insertion mode. Treatments for white
663 space character tokens were incorrect for some
664 insertion modes. An old |meta| case was not removed.
665
666 2007-07-16 Wakaba <wakaba@suika.fam.cx>
667
668 * HTML.pm.src: |meta| charset declaration extraction
669 implemented (but changing the encoding is not yet:-).
670
671 2007-07-15 Wakaba <wakaba@suika.fam.cx>
672
673 * Charset/: New directory.
674
675 2007-07-15 Wakaba <wakaba@suika.fam.cx>
676
677 * H2H.pm: New Perl module (created from
678 manakai's H2H.dis).
679
680 2007-07-15 Wakaba <wakaba@suika.fam.cx>
681
682 * XMLSerializer.pm: New Perl module (created from
683 manakai's SimpleLS.dis).
684
685 2007-07-07 Wakaba <wakaba@suika.fam.cx>
686
687 * HTML.pm.src: |<!---x-->| was not processed correctly.
688
689 2007-07-01 Wakaba <wakaba@suika.fam.cx>
690
691 * HTML.pm.src: Report correct error message
692 for |<body></div></body>|.
693
694 2007-07-01 Wakaba <wakaba@suika.fam.cx>
695
696 * HTMLTable.pm: An error description was incorrect.
697
698 2007-06-30 Wakaba <wakaba@suika.fam.cx>
699
700 * ContentChecker.pm: Return |{term}| list.
701
702 2007-06-30 Wakaba <wakaba@suika.fam.cx>
703
704 * HTML.pm.src: HTML5 revisions 961-966 (</p>, </br>,
705 nested <nobr>, implied </tbody>, </tfoot>, and </thead>,
706 and <title> outside of head).
707
708 2007-06-30 Wakaba <wakaba@suika.fam.cx>
709
710 * IMTChecker.pm: Report warning for unregistered
711 and private types/subtypes.
712
713 * ContentChecker.pm, HTML.pm.src, IMTChecker.pm,
714 URIChecker.pm, HTMLTable.pm: Error messages are now
715 consistent; they are all listed in
716 <http://suika.fam.cx/gate/2005/sw/Whatpm%20Error%20Types>.
717
718 2007-06-25 Wakaba <wakaba@suika.fam.cx>
719
720 * ContentChecker.pm: |<img ismap>| not in |<a></a>|
721 is now erred. |<datalist>| is implemented.
722 Attribute checker for |<command>| and |<menu>| are
723 added. Support for |contextmenu| global attribute
724 is added.
725
726 2007-06-25 Wakaba <wakaba@suika.fam.cx>
727
728 * HTML.pm.src (_reset_insertion_mode): Interpretation
729 of Step 3 has been changed.
730
731 2007-06-25 Wakaba <wakaba@suika.fam.cx>
732
733 * HTML.pm.src: Late |<html>| parse error is implemented.
734
735 2007-06-24 Wakaba <wakaba@suika.fam.cx>
736
737 * URIChecker.pm (check_iri_reference): A |decode| method name was
738 incorrect.
739
740 * ContentChecker.pm: Support for the |footer| element.
741 Check URI syntax for space-separated URI attributes.
742 Support for the |tabindex| attribute. Support
743 for |datetime| attribute.
744
745 2007-06-24 Wakaba <wakaba@suika.fam.cx>
746
747 * HTML.pm.src: HTML5 revision 1.144 (&#x0D;) and 1.145 (invalid
748 character references). HTML5 revision 1.146 (white space
749 characters before root start tag). HTML5 revision
750 1.148 (named character references in attribute values).
751 HTML5 revision 1.152 (<plaintext>.innerHTML get).
752
753 2007-06-24 Wakaba <wakaba@suika.fam.cx>
754
755 * HTML.pm.src: HTML5 revisions 1.142 and 1.143 (<noscript>
756 in <head>).
757
758 2007-06-24 Wakaba <wakaba@suika.fam.cx>
759
760 * HTML.pm.src: HTML5 revision 935 (<base>, <link>, <meta>
761 in body).
762
763 * ContentChecker.pm: HTML5 revision 938 (scoped="").
764
765 2007-06-24 Wakaba <wakaba@suika.fam.cx>
766
767 * HTML.pm.src: HTML5 revision 923 (matching end tag in CDATA
768 or RCDATA in fragment parsing mode). HTML5 revision
769 924 (<!--> and <!--->). HTML5 revision 926 (hn in hn).
770
771 2007-06-23 Wakaba <wakaba@suika.fam.cx>
772
773 * HTML.pm.src (get_inner_html): HTML5 revision 922 (inner_html
774 for <pre> and <textarea>).
775
776 2007-06-23 Wakaba <wakaba@suika.fam.cx>
777
778 * HTML.pm.src: HTML5 revision 920 (<isindex>).
779
780 2007-06-23 Wakaba <wakaba@suika.fam.cx>
781
782 * HTML.pm.src: HTML5 revision 918 (</head>, </body>, </html>).
783
784 2007-06-23 Wakaba <wakaba@suika.fam.cx>
785
786 * HTML.pm.src: HTML5 revision 916 (</body>).
787 HTML5 revision 917 (conforming bare &).
788
789 2007-06-23 Wakaba <wakaba@suika.fam.cx>
790
791 * NanoDOM.pm (manakai_is_html): Setting to false did not work.
792
793 * HTML.pm.src: HTML5 revision 914 (</ in CDATA, RCDATA).
794 HTML5 revision 915 (<nobr>).
795
796 2007-06-23 Wakaba <wakaba@suika.fam.cx>
797
798 * HTML.pm.src: HTML5 revisions 908, 909, 912, and 913 (quirks mode).
799
800 * NanoDOM.pm (manakai_is_html, manakai_compat_mode, compat_mode):
801 New attributes.
802
803 2007-06-23 Wakaba <wakaba@suika.fam.cx>
804
805 * HTML.pm.src: HTML5 revisions 900, 901, 902, and 911 (<
806 in tags).
807
808 2007-06-23 Wakaba <wakaba@suika.fam.cx>
809
810 * .cvsignore: |Entities.html| is added.
811
812 * HTML.pm.src: |$entity_char| is removed and
813 requires |Whatpm::_NamedEntityList| instead.
814 HTML5 revision 898 (refc), except that lack of refc
815 is parse error.
816
817 * mkentitylist.pl: New script.
818
819 * Makefile (all): |_NamedEntityList.pm| is added.
820 (_NamedEntityList.pm, Entities.html): New rules.
821
822 2007-06-23 Wakaba <wakaba@suika.fam.cx>
823
824 * HTML.pm.src: Parse errors immediately after U+000D
825 were ignored and U+000D immediately following another
826 U+000D was not converted to U+000A.
827
828 2007-06-23 Wakaba <wakaba@suika.fam.cx>
829
830 * HTML.pm.src (set_inner_html): HTML5 revision 892 (adopt
831 nodes before appended). Parser was not ready for NULL
832 parse error and escape flag.
833
834 * NanoDOM.pm (adopt_node): New.
835
836 2007-06-23 Wakaba <wakaba@suika.fam.cx>
837
838 * HTML.pm.src: HTML5 revision 886 (insane comment in
839 CDATA and RCDATA). Note that current implementation
840 is simply repeating what the spec says and it is maybe not
841 a best way to do it.
842
843 2007-06-23 Wakaba <wakaba@suika.fam.cx>
844
845 * HTML.pm.src: HTML5 revision 884 (</form> don't close
846 the form element if a descendant element without implied end tag has
847 still been open).
848
849 2007-06-23 Wakaba <wakaba@suika.fam.cx>
850
851 * ContentChecker.pm: HTML5 revision 881 (Make |id| attribute with
852 space characters non-conforming).
853
854 2007-06-23 Wakaba <wakaba@suika.fam.cx>
855
856 * HTML.pm.src: An error message was incorrect.
857 HTML5 revision 869 (C1 character references).
858
859 2007-06-23 Wakaba <wakaba@suika.fam.cx>
860
861 * HTML.pm.src: HTML5 revision 867 (a LF at the beginning of
862 a |textarea| is removed).
863
864 2007-06-05 Wakaba <wakaba@suika.fam.cx>
865
866 * NanoDOM.pm (get_attribute_node_ns): New method.
867
868 * ContentChecker.pm: |script| |async| and |defer|
869 no longer require |src|. |async| MUST NOT be
870 specified if |defer|. (HTML5 revision 858).
871
872 2007-05-30 Wakaba <wakaba@suika.fam.cx>
873
874 * HTML.pm.src: |<form><form>| went to inifinite loop.
875
876 2007-05-27 Wakaba <wakaba@suika.fam.cx>
877
878 * ContentChecker.pm (html): Set |is_root| (allowed
879 as a document element) flag on.
880 (new): Removed.
881 (check_document): New method.
882
883 2007-05-27 Wakaba <wakaba@suika.fam.cx>
884
885 * ContentChecker.pm (thead, tfoot): Checker specifications
886 were incorrect.
887
888 2007-05-27 Wakaba <wakaba@suika.fam.cx>
889
890 * ContentChecker.pm ($HTMLURIAttrChecker): Include
891 error position in the |type| option of the error.
892
893 * HTMLTable.pm (form_table): The |$onerror| parameter
894 is now optional. Some bugs are fixed.
895
896 2007-05-27 Wakaba <wakaba@suika.fam.cx>
897
898 * HTMLTable.pm: New module.
899
900 * ContentChecker.pm (table): Invoke table model error checker.
901
902 * NanoDOM.pm (first_child, get_attribute_ns): New.
903
904 2007-05-26 Wakaba <wakaba@suika.fam.cx>
905
906 * ContentChecker.pm ($HTMLLinkTypesAttrChecker): New checker.
907 (link/@rel, a/@rel, area/@rel): Use new checker.
908
909 * Makefile (_LinkTypeList.pm, RelExtensions.html): New rules.
910
911 * _LinkTypeList.pm: New file.
912
913 * mklinktypelist.pl: New file.
914
915 * .cvsignore: |RelExtensions.html| added.
916
917 * NanoDOM.pm (child_nodes): Returns an empty array
918 for non-child-containing node types.
919 (text_content): New attribute.
920
921 2007-05-26 Wakaba <wakaba@suika.fam.cx>
922
923 * IMTChecker.pm: New module.
924
925 * ContentChecker.pm ($HTMLIMTAttrChecker): Call IMTChecker
926 to test parameter value validity.
927
928 * HTML.pm.src ($style_start_tag): Attributes were
929 discarded.
930
931 2007-05-25 Wakaba <wakaba@suika.fam.cx>
932
933 * ContentChecker.pm ($HTMLURIAttrChecker): Implemented.
934
935 2007-05-25 Wakaba <wakaba@suika.fam.cx>
936
937 * URIChecker.pm: All recommendations from RFC 3986
938 and RFC 3987 are listed (not all testable items are checked yet).
939
940 2007-05-25 Wakaba <wakaba@suika.fam.cx>
941
942 * URIChecker.pm: New module.
943
944 2007-05-20 Wakaba <wakaba@suika.fam.cx>
945
946 * ContentChecker.pm: Now most attributes are implemented
947 or associated to some placeholder.
948 ($ElementDefault): Warn unknown attributes
949 for unknown elements as "attribute not supported".
950 ($HTMLLanguageTagAttrChecker, $HTMLMQAttrChecker): New placeholders.
951 ($HTMLUsemapAttrChecker, $HTMLTargetAttrChecker): New checkers.
952 (|a| attribute checker): Reimplemented.
953
954 2007-05-20 Wakaba <wakaba@suika.fam.cx>
955
956 * ContentChecker.pm ($HTMLEventHandlerAttrChecker): New placeholder.
957 ($HTMLAttrChecker): Event handler content attributes
958 are added.
959 (link, embed): Required attribute is now checked.
960 (embed): Unknown local attributes are no longer warned.
961
962 2007-05-20 Wakaba <wakaba@suika.fam.cx>
963
964 * ContentChecker.pm ($HTMLSpaceURIsAttrChecker): New placeholder.
965 ($HTMLIMTAttrChecker): New checker.
966 (link@rel, link@href, link@type, style@type,
967 a@href, a@ping, a@ping, a@type, embed@src, embed@type,
968 object@data, object@type, source@src, source@type, area@alt,
969 area@shape, area@coords, area@href,
970 area@ping, area@rel, area@type, script@src,
971 script@defer, script@async, script@type): Checkers added.
972
973 2007-05-20 Wakaba <wakaba@suika.fam.cx>
974
975 * ContentChecker.pm: Descendant checking was incorrect.
976
977 2007-05-19 Wakaba <wakaba@suika.fam.cx>
978
979 * ContentChecker.pm: Support |xml:*| and |xmlns:*|
980 attributes. Report an error if |Element.prefix|
981 is |xmlns|.
982
983 * NanoDOM.pm (prefix): New attribute.
984
985 2007-05-19 Wakaba <wakaba@suika.fam.cx>
986
987 * HTML.pm.src: In |main| phase, |in body| insertion
988 mode, action for |<iframe>| was missing.
989
990 2007-05-19 Wakaba <wakaba@suika.fam.cx>
991
992 * ContentChecker.pm: Support for many of HTML5 elements.
993 ($GetHTMLNonNegativeIntegerAttrChecker): New.
994
995 2007-05-19 Wakaba <wakaba@suika.fam.cx>
996
997 * ContentChecker.pm: Support for most elements up to |progress|.
998 ($HTMLURIAttrChecker): Placeholder.
999 ($HTMLIntegerAttrChecker, $GetHTMLFloatingPointNumberAttrChecker):
1000 New.
1001
1002 2007-05-19 Wakaba <wakaba@suika.fam.cx>
1003
1004 * ContentChecker.pm: Attribute checkers
1005 for global attributes, |html|, |base|, |style|, and |meta|.
1006
1007 * NanoDOM.pm (insert_before): Weaken reference
1008 to the parent node.
1009 (Attr::new): Set |owner_element| attribute.
1010 (namespace_uri, manakai_local_name): New attribute implementations.
1011 (owner_element): New attribute.
1012
1013 2007-05-19 Wakaba <wakaba@suika.fam.cx>
1014
1015 * ContentChecker.pm ($AttrChecker, $HTMLAttrChecker,
1016 $AnyChecker->{attr_checker}, $HTMLAttrsChecker,
1017 $Element->{$HTML_NS}->{''}): New.
1018 (check_element): Invoke attrs_checker for each element.
1019
1020 2007-05-13 Wakaba <wakaba@suika.fam.cx>
1021
1022 * ContentChecker.pm: Don't use |manakai_element_type_match|.
1023
1024 2007-05-13 Wakaba <wakaba@suika.fam.cx>
1025
1026 * ContentChecker.pm: Use hashs rather than lists for
1027 element type testings.
1028
1029 2007-05-13 Wakaba <wakaba@suika.fam.cx>
1030
1031 * ContentChecker.pm: Don't generate duplicate
1032 error when an element type is put in the "minus" list
1033 and the element type is not allowed explicitly in the particular
1034 element content model.
1035 (html:a checker): New checker.
1036 (html:details, html:datagrid): New checkers.
1037 (html:legend): New checker.
1038
1039 2007-05-13 Wakaba <wakaba@suika.fam.cx>
1040
1041 * ContentChecker.pm (html:li checker): Implemented.
1042
1043 2007-05-13 Wakaba <wakaba@suika.fam.cx>
1044
1045 * ContentChecker.pm ($HTMLInlineOrStriclyInlineChecker): New
1046 checker.
1047 (html:dd checker): New checker.
1048 (html:q, html:em, html:strong, html:small,
1049 html:m, html:dfn, html:code, html:samp, html:span): New checkers.
1050
1051 2007-05-13 Wakaba <wakaba@suika.fam.cx>
1052
1053 * ContentChecker.pm ($AnyChecker): Renamed
1054 from |$ElementDefault->{checker}|.
1055 ($ElementDefault->{checker}): Throw an error that
1056 the element type is not supported by the checker.
1057 ($HTMLMetadataElement): |html:base| was missing.
1058 ($HTMLEmptyChecker): Don't throw an error
1059 for inter-element whitespace nodes.
1060 (html:html checker): Errors were not
1061 thrown even if |html:head| and/or |html:body|
1062 children were missing.
1063 (html:head checker): An error was not
1064 thrown if <meta charset> appered after other
1065 elements.
1066
1067 2007-05-05 Wakaba <wakaba@suika.fam.cx>
1068
1069 * ContentChecker.pm: |footer|, |video|, |audio|, |script|,
1070 and |noscript| elements are implemented.
1071 (new): New method.
1072
1073 2007-05-04 Wakaba <wakaba@suika.fam.cx>
1074
1075 * ContentChecker.pm: New module.
1076
1077 2007-05-04 Wakaba <wakaba@suika.fam.cx>
1078
1079 * NanoDOM.pm (manakai_parent_element,
1080 document_element, manakai_local_name,
1081 manakai_element_type_match): New method.
1082
1083 2007-05-03 Wakaba <wakaba@suika.fam.cx>
1084
1085 * HTML.pm.src: Replace decimal and hexadecimal numeric
1086 entities in C1 range using Windows-1252 mapping. Bare LF
1087 did not count as new line for error reporting.
1088
1089 2007-05-02 Wakaba <wakaba@suika.fam.cx>
1090
1091 * NanoDOM.pm (DOMImplementation): New class.
1092 (append_child): Weaken the |parent_node| reference.
1093 (create_element_ns, Element new): Set the |owner_document|
1094 reference.
1095 (implementation): New attribute.
1096 (owner_document, local_name, namespace_uri): New attributes.
1097
1098 * HTML.pm.src (parse_string): Line and column numbers
1099 are now provided to error handler.
1100 (!!!parse-error): Short descriptions are added.
1101 (_construct_tree): Split into three methods; support
1102 for innerHTML mode.
1103 (set_inner_html): New method.
1104
1105 2007-05-01 Wakaba <wakaba@suika.fam.cx>
1106
1107 * NanoDOM.html: Documentation is added.
1108
1109 * HTML.pod, ContentType.html: Documentation is revised.
1110
1111 * .cvsignore: Pod2html temporary files are added.
1112
1113 * Makefile: Make |NanoDOM.html|.
1114
1115 2007-05-01 Wakaba <wakaba@suika.fam.cx>
1116
1117 * HTML.pm.src (parse_string): New method.
1118 (get_inner_html): Renamed from |inner_html|.
1119
1120 * Makefile: A rule for |HTML.html| is added.
1121
1122 * HTML.pod: New documentation.
1123
1124 2007-05-01 Wakaba <wakaba@suika.fam.cx>
1125
1126 * NanoDOM.pm (last_child, previous_sibling): New attributes.
1127 (clone_node): Attribute nodes were not completely copied.
1128
1129 * HTML.pm.src: Many bugs are fixed.
1130
1131 2007-04-30 Wakaba <wakaba@suika.fam.cx>
1132
1133 * HTML.pm.src: Some typos are fixed.
1134
1135 2007-04-30 Wakaba <wakaba@suika.fam.cx>
1136
1137 * mkhtmlparser.pl, Makefile: References to the |HTML-consume-entity.src|
1138 are removed.
1139
1140 * HTML.pm.src: Tokenizer's handling on named entities are rewritten.
1141
1142 * HTML-consume-entity.src: Removed.
1143
1144 2007-04-30 Wakaba <wakaba@suika.fam.cx>
1145
1146 * HTML.pm.src: Tokenizer's handling on hexadecimal
1147 numeric entities are rewritten.
1148
1149 2007-04-30 Wakaba <wakaba@suika.fam.cx>
1150
1151 * HTML.pm.src: Some tokenizer bugs are fixed.
1152
1153 2007-04-30 Wakaba <wakaba@suika.fam.cx>
1154
1155 * HTML.pm.src: The tree construction stage is implemented.
1156
1157 * mkhtmlparser.pl: New macros are added.
1158
1159 2007-04-28 Wakaba <wakaba@suika.fam.cx>
1160
1161 * ContentType.pm: A note on bug in the specification
1162 is removed since it's been now fixed.
1163
1164 * .cvsignore: New file.
1165
1166 2007-04-28 Wakaba <wakaba@suika.fam.cx>
1167
1168 * HTML.pm.src, HTML-consume-entity.src: New files.
1169
1170 * Makefile (HTML.pm): New rule.
1171
1172 * mkhtmlparser.pl: New script.
1173
1174 2007-04-25 Wakaba <wakaba@suika.fam.cx>
1175
1176 * Makefile: New file.
1177
1178 2007-04-24 Wakaba <wakaba@suika.fam.cx>
1179
1180 * ContentType.pm: An error in pod is fixed.
1181
1182 2007-04-24 Wakaba <wakaba@suika.fam.cx>
1183
1184 * ContentType.pm: Documentation is added.
1185
1186 2007-04-24 Wakaba <wakaba@suika.fam.cx>
1187
1188 * ContentType.pm: New Perl module.
1189
1190 * ChangeLog: New file.
1191
1192

admin@suikawiki.org
ViewVC Help
Powered by ViewVC 1.1.24