/[suikacvs]/markup/html/whatpm/Whatpm/ChangeLog
Suika

Contents of /markup/html/whatpm/Whatpm/ChangeLog

Parent Directory Parent Directory | Revision Log Revision Log


Revision 1.182 - (show annotations) (download)
Sun Mar 9 10:49:13 2008 UTC (17 years, 4 months ago) by wakaba
Branch: MAIN
Changes since 1.181: +6 -0 lines
++ whatpm/Whatpm/ChangeLog	9 Mar 2008 10:48:54 -0000
	* HTML.pm.src: |</body>| treatement has been changed (HTML5
	revision 1348).  Note that I really don't know this makes
	any difference in the black-box behavior of the parser.

2008-03-09  Wakaba  <wakaba@suika.fam.cx>

1 2008-03-09 Wakaba <wakaba@suika.fam.cx>
2
3 * HTML.pm.src: |</body>| treatement has been changed (HTML5
4 revision 1348). Note that I really don't know this makes
5 any difference in the black-box behavior of the parser.
6
7 2008-03-09 Wakaba <wakaba@suika.fam.cx>
8
9 * HTML.pm.src: New end-of-file token implementation (HTML5
10 revision 1348).
11
12 2008-03-09 Wakaba <wakaba@suika.fam.cx>
13
14 * HTML.pm.src: |applet| support (HTML5 revision 1347).
15
16 2008-03-09 Wakaba <wakaba@suika.fam.cx>
17
18 * HTML.pm.src: Foster parenting in AAA (HTML5 revision 1343).
19
20 2008-03-09 Wakaba <wakaba@suika.fam.cx>
21
22 * HTML.pm.src: Support for |<input>| in the "in select" insertion
23 mode and support for the "in select in table" insertion mode (HTML5
24 revision 1342).
25
26 2008-03-09 Wakaba <wakaba@suika.fam.cx>
27
28 * HTML.pm.src: No foster parenting for <script> and <script>
29 in non-tainted <table>s (HTML5 revision 1336).
30
31 2008-03-09 Wakaba <wakaba@suika.fam.cx>
32
33 * HTML.pm.src: Ignore white space characters between <html>
34 and <head> (HTML5 revision 1332).
35
36 2008-03-09 Wakaba <wakaba@suika.fam.cx>
37
38 * HTML.pm.src: Treat <input type=hidden> as if it were a
39 white space (HTML5 revision 1331).
40
41 2008-03-08 Wakaba <wakaba@suika.fam.cx>
42
43 * HTML.pm.src: Ignore U+000A at the beginning of a |listing|
44 element (HTML5 revision 1330).
45
46 2008-03-08 Wakaba <wakaba@suika.fam.cx>
47
48 * HTML.pm.src: <title> is always appended to the current
49 element (HTML5 revision 1328).
50
51 2008-03-08 Wakaba <wakaba@suika.fam.cx>
52
53 * HTML.pm.src: White space in tainted tables are moved
54 into foster parents (HTML5 revision 1326).
55
56 2008-03-08 Wakaba <wakaba@suika.fam.cx>
57
58 * HTML.pm.src: Reduce errors from foster parenting cases (HTML5
59 revision 1321).
60
61 2008-03-08 Wakaba <wakaba@suika.fam.cx>
62
63 * HTML.pm.src: |</h/n/>| case code rearranged to align with
64 the spec (HTML5 revision 1320). Note that we finally complete
65 all of HTML5 revision 1320 changes.
66
67 2008-03-08 Wakaba <wakaba@suika.fam.cx>
68
69 * HTML.pm.src: |</form>| now works similar to |</div>| for unclosed
70 tags (HTML5 revision 1320).
71
72 2008-03-08 Wakaba <wakaba@suika.fam.cx>
73
74 * HTML.pm.src: |</p>| case rearranged with no actual change in fact.
75
76 2008-03-08 Wakaba <wakaba@suika.fam.cx>
77
78 * HTML.pm.src: A "generate implied end tags" code (t409.1)
79 could not be reached so that it is now removed (HTML5 revision
80 1320).
81
82 2008-03-08 Wakaba <wakaba@suika.fam.cx>
83
84 * HTML.pm.src: Code for the case of |</div>| and so on are revised
85 to align with new spec text (HTML5 revision 1320).
86
87 2008-03-08 Wakaba <wakaba@suika.fam.cx>
88
89 * HTML.pm.src: Remove strange |if| condition; however, it should
90 have had no harm in theory.
91
92 2008-03-08 Wakaba <wakaba@suika.fam.cx>
93
94 * HTML.pm.src (_tree_construction_main): '</p>' in body
95 case is split from other end tags for the preparation
96 of implementing HTML revision 1320.
97
98 2008-03-07 Wakaba <wakaba@suika.fam.cx>
99
100 * HTML.pm.src: Simplified "generate implied end tag" (HTML5
101 revision 1320).
102
103 2008-03-07 Wakaba <wakaba@suika.fam.cx>
104
105 * HTML.pm.src (_tree_construction_main): Merge rules for "h1"
106 and "div" (HTML5 revision 1318). Add comments to where
107 |form| pointer association codes should be inserted (HTML5 revision
108 1319).
109
110 2008-03-06 Wakaba <wakaba@suika.fam.cx>
111
112 * HTML.pm.src: <html> treatement refined (HTML5 revision 1314).
113
114 2008-03-05 Wakaba <wakaba@suika.fam.cx>
115
116 * HTML.pm.src: Since the case t268 should never be reached (no
117 other token type, there are), it is replaced by a |die| statement.
118
119 2008-03-05 Wakaba <wakaba@suika.fam.cx>
120
121 * HTML.pm.src: Typo fixed.
122
123 2008-03-04 Wakaba <wakaba@suika.fam.cx>
124
125 * HTML.pm.src (_tree_construction_initial): Some limited quirks
126 doctypes were not uppercased for comparison.
127
128 2008-03-03 Wakaba <wakaba@suika.fam.cx>
129
130 * HTML.pm.src (tree construction and set_inner_html): Checkpoints
131 are added.
132
133 2008-03-03 Wakaba <wakaba@suika.fam.cx>
134
135 * HTML.pm.src (_tokenize_attempt_to_consume_an_entity): Checkpoints
136 are set. Cases that are unlikely reached are noted as so.
137
138 2008-03-03 Wakaba <wakaba@suika.fam.cx>
139
140 * HTML.pm.src: Checkpoints for debugging are added.
141
142 * mkhtmlparser.pl: Support for |!!!cp| syntax.
143
144 2008-03-03 Wakaba <wakaba@suika.fam.cx>
145
146 * HTML.pm.src, mkhtmlparser.pl: s/_input_character/_char/g
147 for simplicity.
148
149 2008-03-03 Wakaba <wakaba@suika.fam.cx>
150
151 * HTML.pm.src: Flag name changed: s/correct/force-quirks/g (HTML5
152 revision 1307).
153
154 2008-03-03 Wakaba <wakaba@suika.fam.cx>
155
156 * HTML.pm.src (_get_next_token): Where the /incorrect/ flag
157 is set are changed (HTML5 revision 1305).
158
159 2008-03-02 Wakaba <wakaba@suika.fam.cx>
160
161 * HTML.pm.src: Raise a parse error for |<span ===>| (HTML5 revision
162 1292). Entities are not parsed in comment-like part in RCDATA
163 elements (HTML5 revision 1294). Allow bare & at the end
164 of attribute value literals (HTML5 revision 1296). More
165 quirks mode doctypes (HTML5 revision 1302). Requires spaces
166 between attributes and ban attribute names or unquoted
167 attribute values containing single or double quotes (HTML5
168 revision 1303).
169
170 2008-03-02 Wakaba <wakaba@suika.fam.cx>
171
172 * ContentChecker.pm: Typo fixed. Don't raise "character encoding"
173 and related errors unless it is an HTML document (though the spec
174 is unclear on whether it is applied to XHTML document).
175
176 * HTML.pm (%HTMLAttrStatus): WF2 repetition model attributes
177 are added.
178
179 2008-03-02 Wakaba <wakaba@suika.fam.cx>
180
181 * HTML.pm.src: s/local_name/manakai_local_name/g.
182
183 2008-03-01 Wakaba <wakaba@suika.fam.cx>
184
185 * _NamedEntityList.pm: Updated (HTML5 revision 1286).
186
187 * HTML.pm.src: |charset| in |content| attribute is
188 case-insensitive (HTML5 revision 1270).
189
190 2008-02-26 Wakaba <wakaba@suika.fam.cx>
191
192 * ContentChecker.pm: New status constants are added.
193 ($ElementDefault): |status| added.
194 (check_element): Err for non-standard or deprecated elements.
195 (_attr_status_info): For non-standard or deprecated attributes.
196
197 2008-02-24 Wakaba <wakaba@suika.fam.cx>
198
199 * ContentChecker.pm (_attr_status_info): New internal method.
200
201 2008-02-24 Wakaba <wakaba@suika.fam.cx>
202
203 * ContentChecker.pm (check_element): Element standardized
204 status information is now dispatched.
205
206 2008-02-24 Wakaba <wakaba@suika.fam.cx>
207
208 * ContentChecker.pm (check_element): Fix |del|-and-significant
209 problem by adding some more arguments.
210
211 2008-02-24 Wakaba <wakaba@suika.fam.cx>
212
213 * ContentChecker.pm (check_element): Use context of
214 container-for-the-purpose-of-content-model element (not transparent
215 element) for |check_child_element| calling and significant
216 text flag marking. This reintroduces |<del>|-and-significant
217 problem again.
218
219 2008-02-24 Wakaba <wakaba@suika.fam.cx>
220
221 * ContentChecker.pm (check_element): Make semi-transparent
222 elements ignored for the purpose of phase changes in
223 content model checking.
224
225 2008-02-23 Wakaba <wakaba@suika.fam.cx>
226
227 * ContentChecker.pm (check_element): In-element state
228 was not properly managed for transparent cases.
229
230 2008-02-23 Wakaba <wakaba@suika.fam.cx>
231
232 * ContentChecker.pm (check_element): Support for |video|
233 and |audio| as semi-transparent elements.
234
235 2008-02-23 Wakaba <wakaba@suika.fam.cx>
236
237 * ContentChecker.pm ($HTMLSemiTransparentElements): New.
238 (check_element): s/minuses/minus_elements/, s/pluses/plus_elements/.
239 Support for |html:object| as a semi-transparent element.
240
241 2008-02-23 Wakaba <wakaba@suika.fam.cx>
242
243 * ContentChecker.pm (check_element): The way to traverse
244 the tree is entirely revised to make it easier to track
245 the state of ancestors/descendants. As a result of this
246 revision (which rewrites almost all of Whatpm::ContentChecker::HTML),
247 support for content model checking for HTML elements |figure|,
248 |object|, |video|, and |audio| and checking for XML elements (and
249 some XMLNS checkings) are dropped for now. They will be
250 reimplemented in due cource.
251
252 2008-02-17 Wakaba <wakaba@suika.fam.cx>
253
254 * HTML.pm.src: |>| in public or system literal closes the DOCTYPE
255 token (HTML5 revision 1225).
256
257 2008-02-17 Wakaba <wakaba@suika.fam.cx>
258
259 * ContentChecker.pm ({unsupported_level}): New value.
260
261 * HTML.pm.src: Save whether |meta| |content| attribute
262 contains character references or not.
263
264 2008-02-17 Wakaba <wakaba@suika.fam.cx>
265
266 * ContentChecker.pm (_get_children): (Incompleted) attempt to
267 imlement significant content checking for contents
268 with |del| elements.
269
270 2008-02-17 Wakaba <wakaba@suika.fam.cx>
271
272 * ContenteChecker.pm ($HTMLTransparentElements): More
273 elements are added.
274 (_get_children): HTML |object| elements are now semi-transparent.
275
276 * NanoDOM.pm (manakai_html, manakai_head): New methods.
277
278 2008-02-16 Wakaba <wakaba@suika.fam.cx>
279
280 * CacheManifest.pm: HTML5 revision 1211 implemented.
281
282 * CacheManifest.pod: Updated.
283
284 2008-02-10 Wakaba <wakaba@suika.fam.cx>
285
286 * ContentChecker.pm (check_document, check_element): Support
287 for second argument ($onsubdoc).
288 (_get_css_parser): Removed (now it is part of WDCC).
289
290 2008-02-09 Wakaba <wakaba@suika.fam.cx>
291
292 * ContentChecker.pm (_get_css_parser): New.
293
294 2007-11-25 Wakaba <wakaba@suika.fam.cx>
295
296 * ContentChecker.pm ($AnyChecker): Old way to add child elements
297 for checking had been used.
298
299 2007-11-25 Wakaba <wakaba@suika.fam.cx>
300
301 * ContentChecker.pm (check_element): New todo item type |descendant|.
302
303 2007-11-23 Wakaba <wakaba@suika.fam.cx>
304
305 * IMTChecker.pm: Revised to raise errors and warnings as (poorly)
306 specced in RFC 2046 and RFC 4288.
307 (application/atom+xml): Definition added.
308
309 2007-11-23 Wakaba <wakaba@suika.fam.cx>
310
311 * URIChecker.pm: Make RFC 3986 should-level errors
312 warnings (rather than SHOULD-level errors).
313
314 2007-11-23 Wakaba <wakaba@suika.fam.cx>
315
316 * NanoDOM.pm (get_user_data, set_user_data): New methods.
317
318 * HTML.pm.src: A flag for character references in attribute
319 values are added. Set |manakai_has_reference| user data
320 to |charset| attribute.
321
322 2007-11-23 Wakaba <wakaba@suika.fam.cx>
323
324 * NanoDOM.pm (input_encoding, manakai_charset, manakai_has_bom): New
325 attributes.
326
327 * ContentChecker.pm (check_document): Warn if charset requirements
328 cannot be tested.
329
330 2007-11-19 Wakaba <wakaba@suika.fam.cx>
331
332 * HTML.pm.src (parse_byte_string): Detect charset
333 by universalchardet if charset parameter is not specified.
334
335 * Makefile (Charset-all, Charset-clean): New rules.
336
337 2007-11-18 Wakaba <wakaba@suika.fam.cx>
338
339 * ContentChecker.pm (check_document): Check the existence
340 of character encoding declaration and selection of encoding
341 for HTML document.
342
343 2007-11-18 Wakaba <wakaba@suika.fam.cx>
344
345 * ContentType.pm (get_sniffed_type): Return also the official
346 type in list context.
347
348 2007-11-18 Wakaba <wakaba@suika.fam.cx>
349
350 * ContentType.pm: Sniffing with leading white space
351 ignoring (HTML5 revisions 1013 and 1016).
352
353 2007-11-18 Wakaba <wakaba@suika.fam.cx>
354
355 * ContentType.pm: HTML5 revision 1013 changes, except for leading
356 white spaces, are implemented.
357
358 2007-11-11 Wakaba <wakaba@suika.fam.cx>
359
360 * HTML.pm.src (parse_char_string): Set |inner_encoding|
361 attribute if possible.
362
363 2007-11-11 Wakaba <wakaba@suika.fam.cx>
364
365 * HTML.pm.src (parse_byte_string): New method.
366 (parse_char_string): New alias for |parse_string|.
367 (main phase): Invoking "change the encoding" algorithm if desired.
368
369 * HTML.pod: Updated.
370
371 2007-11-11 Wakaba <wakaba@suika.fam.cx>
372
373 * HTML.pod (get_inner_html): Removed.
374
375 * Makefile (HTML-all, HTML-clean): New.
376
377 2007-11-11 Wakaba <wakaba@suika.fam.cx>
378
379 * HTML.pm.src (get_inner_html): Removed (moved to HTML/Serializer.pm).
380
381 2007-11-08 Wakaba <wakaba@suika.fam.cx>
382
383 * mklinktypelist.pl: s/noreferer/noreferrer/ (HTML5 revision 1132).
384
385 2007-11-04 Wakaba <wakaba@suika.fam.cx>
386
387 * Makefile: |CacheManifest.html| is added.
388
389 * CacheManifest.pod: New file.
390
391 2007-11-04 Wakaba <wakaba@suika.fam.cx>
392
393 * CacheManifest.pm: New module.
394
395 2007-11-04 Wakaba <wakaba@suika.fam.cx>
396
397 * HTML.pm.src: Support for application cache selection algorithm
398 callback.
399
400 2007-11-04 Wakaba <wakaba@suika.fam.cx>
401
402 * mklinktypelist.pl: Support for rel=noreferer (HTML5 revision 1118).
403
404 2007-10-17 Wakaba <wakaba@suika.fam.cx>
405
406 * Makefile (clean): New rule.
407
408 * NanoDOM.pm (public_id, system_id): New attributes.
409
410 2007-10-17 Wakaba <wakaba@suika.fam.cx>
411
412 * Makefile (CSS-all, CSS-clean, clean): New rules.
413
414 2007-10-14 Wakaba <wakaba@suika.fam.cx>
415
416 * ContentChecker.pm (check_document): Support for
417 new |is_xml_root| flag.
418 (check_element): Support for new |pluses| state.
419 (_add_pluses): New method.
420 (_remove_minuses): Support for new |minus| item.
421
422 2007-09-24 Wakaba <wakaba@suika.fam.cx>
423
424 * ContentChecker.pm: Raise specific error for invalid
425 root element.
426
427 2007-09-24 Wakaba <wakaba@suika.fam.cx>
428
429 * ContentChecker.pm: Set level values for later uses.
430
431 2007-09-09 Wakaba <wakaba@suika.fam.cx>
432
433 * ContentChecker.pm: Support for language tag validation.
434
435 2007-09-09 Wakaba <wakaba@suika.fam.cx>
436
437 * LangTag.pm (check_rfc3066_language_tag): New method.
438
439 2007-09-09 Wakaba <wakaba@suika.fam.cx>
440
441 * LangTag.pm: New module.
442
443 2007-09-04 Wakaba <wakaba@suika.fam.cx>
444
445 * HTML.pm.src: Some error types were wrong.
446
447 2007-08-17 Wakaba <wakaba@suika.fam.cx>
448
449 * CSS/: New directory.
450
451 2007-08-17 Wakaba <wakaba@suika.fam.cx>
452
453 * ContentChecker.pm (_check_get_children): Support
454 for |noscript| in |head|.
455
456 2007-08-12 Wakaba <wakaba@suika.fam.cx>
457
458 * URI/: New directory.
459
460 2007-08-11 Wakaba <wakaba@suika.fam.cx>
461
462 * HTML.pm.src: Tokenizer's states are now represented in
463 number.
464
465 2007-08-11 Wakaba <wakaba@suika.fam.cx>
466
467 * HTML.pm.src: |or|s for insertion modes are replaced
468 by |&|s.
469
470 2007-08-11 Wakaba <wakaba@suika.fam.cx>
471
472 * HTML.pm.src: Token types are now represented in number.
473
474 2007-08-11 Wakaba <wakaba@suika.fam.cx>
475
476 * ContentType.pm (SEE ALSO): Updated.
477
478 * HTML.pm.src: Insertion modes are now represented in number.
479
480 2007-08-11 Wakaba <wakaba@suika.fam.cx>
481
482 * ContentType.pm: Sniffing for bitmap images (HTML5 revision
483 999) is implemented.
484
485 2007-08-08 Wakaba <wakaba@suika.fam.cx>
486
487 * ContentType.pm: Sniffing for |<script| (HTML5 revision
488 983) is implemented.
489
490 2007-08-06 Wakaba <wakaba@suika.fam.cx>
491
492 * ContentChecker.pod: New documentation.
493
494 * Makefile: A rule for |ContentChecker.html| is added.
495
496 * ContentChecker.pm: A pod "LICENSE" section is added.
497
498 * NanoDOM.pm ($VERSION): New variable.
499
500 2007-08-05 Wakaba <wakaba@suika.fam.cx>
501
502 * H2H.pm: |b|, |i|, and |sub| are added to the
503 list of allowed HTML elements.
504
505 2007-08-05 Wakaba <wakaba@suika.fam.cx>
506
507 * H2H.pm: |samp| is added to the list of allowed
508 HTML elements.
509
510 * URIChecker.pm (check_iri): New.
511 (check_iri_reference): Error type for IRI reference
512 syntax error is changed.
513
514 2007-08-04 Wakaba <wakaba@suika.fam.cx>
515
516 * ContentChecker.pm: Reference to the |Whatpm::ContentChecker::Atom|
517 is added.
518 (check_document): Load appropriate module before validation.
519
520 2007-08-04 Wakaba <wakaba@suika.fam.cx>
521
522 * ContentChecker/: New directory.
523
524 2007-08-04 Wakaba <wakaba@suika.fam.cx>
525
526 * ContentChecker.pm: HTML |time| element is implemented.
527
528 * HTMLTable.pm: Comments are updated as HTML5 is revised.
529
530 2007-07-21 Wakaba <wakaba@suika.fam.cx>
531
532 * ContentChecker.pm (check_document): Return value
533 even if no document element is found.
534
535 2007-07-21 Wakaba <wakaba@suika.fam.cx>
536
537 * HTML.pm.src: |$in_body| is no longer a function.
538
539 2007-07-21 Wakaba <wakaba@suika.fam.cx>
540
541 * HTML.pm.src: The |$in_body| code has been moved down.
542
543 2007-07-21 Wakaba <wakaba@suika.fam.cx>
544
545 * HTML.pm.src: The "trailing end" insertion mode
546 is split into "after html body" and "after html frameset"
547 insertion modes. Their codes are merged with "after body"
548 and "after frameset" codes. |$previous_insertion_mode|
549 has been removed. "after frameset" code is
550 merged with "in frameset" code.
551
552 2007-07-21 Wakaba <wakaba@suika.fam.cx>
553
554 * HTML.pm.src: The "before head" insertion mode is
555 merged with the "in head" insertion mode.
556
557 2007-07-21 Wakaba <wakaba@suika.fam.cx>
558
559 * HTML.pm.src: Cases in "in head" insertion mode are
560 reorganized.
561
562 2007-07-21 Wakaba <wakaba@suika.fam.cx>
563
564 * HTML.pm.src: Some cases in "in table" insertion mode
565 are merged.
566
567 2007-07-21 Wakaba <wakaba@suika.fam.cx>
568
569 * HTML.pm.src: The "in row" insertion mode is merged
570 with "in table" insertion mode.
571
572 2007-07-21 Wakaba <wakaba@suika.fam.cx>
573
574 * HTML.pm.src: The "in table" and "in table body" insertion
575 modes are merged.
576
577 2007-07-21 Wakaba <wakaba@suika.fam.cx>
578
579 * HTML.pm.src: There is no "in table head" or "in table foot"
580 insertion mode!
581
582 2007-07-21 Wakaba <wakaba@suika.fam.cx>
583
584 * HTML.pm.src: |<noframes>| "in frameset" and "in noframes"
585 now directly invoke the handler.
586
587 2007-07-21 Wakaba <wakaba@suika.fam.cx>
588
589 * HTML.pm.src: Codes for "in cell" insertion mode
590 is merged to the "in body" insertion mode code.
591
592 2007-07-21 Wakaba <wakaba@suika.fam.cx>
593
594 * HTML.pm.src: Codes for "in body" and "in caption"
595 insertion modes are merged.
596
597 2007-07-21 Wakaba <wakaba@suika.fam.cx>
598
599 * HTML.pm.src: Two |!!!next-token|s were missing.
600
601 2007-07-21 Wakaba <wakaba@suika.fam.cx>
602
603 * HTML.pm.src: Use numeric constant for |{content_mode}|
604 instead of string constant for |{content_model_flag}|.
605
606 2007-07-21 Wakaba <wakaba@suika.fam.cx>
607
608 * HTML.pm.src: Add the name of the attribute
609 to the "duplicate attribute" error.
610
611 2007-07-17 Wakaba <wakaba@suika.fam.cx>
612
613 * ContentChecker.pm: Return the |class| node list.
614
615 2007-07-17 Wakaba <wakaba@suika.fam.cx>
616
617 * ContentChecker.pm: Return the |id| node list.
618
619 * HTML.pm.src: A typo is fixed.
620
621 2007-07-16 Wakaba <wakaba@suika.fam.cx>
622
623 * ContentChecker.pm: Drop wrong |level => 'error'| specification
624 from "in HTML:xml:lang" error. Character position
625 is now the last part of the error type in the URI error
626 description. Report "unsupported" status for language
627 tags, media queries, script codes, and style sheets.
628
629 2007-07-16 Wakaba <wakaba@suika.fam.cx>
630
631 * ContentChecker.pm: Report error if |xml:lang|
632 in HTML, |lang| in XML, |xmlns| in XML, and |meta| |charset|
633 in XML.
634
635 * NanoDOM.pm (Attr.owner_document): New attribute.
636
637 2007-07-16 Wakaba <wakaba@suika.fam.cx>
638
639 * HTML.pm.src: The character immediately following
640 a bare |hcro| was discarded. Fix handling of
641 entity references in attribute values.
642
643 2007-07-16 Wakaba <wakaba@suika.fam.cx>
644
645 * HTML.pm.src (main and trailin end phases): Token
646 types |DOCTYPE|, |comment|, |end-of-file|, and |<html>| are
647 factored out. Error types |in html:#DOCTYPE|
648 and |after html:#DOCTYPE| are merged
649 into |DOCTYPE in the middle|. |</frameset>|
650 in fragment parsing mode changed the
651 insertion mode.
652
653 2007-07-16 Wakaba <wakaba@suika.fam.cx>
654
655 * HTML.pm.src: |$phase| has been removed; The |trailing end|
656 phase is now an insertion mode. Treatments for white
657 space character tokens were incorrect for some
658 insertion modes. An old |meta| case was not removed.
659
660 2007-07-16 Wakaba <wakaba@suika.fam.cx>
661
662 * HTML.pm.src: |meta| charset declaration extraction
663 implemented (but changing the encoding is not yet:-).
664
665 2007-07-15 Wakaba <wakaba@suika.fam.cx>
666
667 * Charset/: New directory.
668
669 2007-07-15 Wakaba <wakaba@suika.fam.cx>
670
671 * H2H.pm: New Perl module (created from
672 manakai's H2H.dis).
673
674 2007-07-15 Wakaba <wakaba@suika.fam.cx>
675
676 * XMLSerializer.pm: New Perl module (created from
677 manakai's SimpleLS.dis).
678
679 2007-07-07 Wakaba <wakaba@suika.fam.cx>
680
681 * HTML.pm.src: |<!---x-->| was not processed correctly.
682
683 2007-07-01 Wakaba <wakaba@suika.fam.cx>
684
685 * HTML.pm.src: Report correct error message
686 for |<body></div></body>|.
687
688 2007-07-01 Wakaba <wakaba@suika.fam.cx>
689
690 * HTMLTable.pm: An error description was incorrect.
691
692 2007-06-30 Wakaba <wakaba@suika.fam.cx>
693
694 * ContentChecker.pm: Return |{term}| list.
695
696 2007-06-30 Wakaba <wakaba@suika.fam.cx>
697
698 * HTML.pm.src: HTML5 revisions 961-966 (</p>, </br>,
699 nested <nobr>, implied </tbody>, </tfoot>, and </thead>,
700 and <title> outside of head).
701
702 2007-06-30 Wakaba <wakaba@suika.fam.cx>
703
704 * IMTChecker.pm: Report warning for unregistered
705 and private types/subtypes.
706
707 * ContentChecker.pm, HTML.pm.src, IMTChecker.pm,
708 URIChecker.pm, HTMLTable.pm: Error messages are now
709 consistent; they are all listed in
710 <http://suika.fam.cx/gate/2005/sw/Whatpm%20Error%20Types>.
711
712 2007-06-25 Wakaba <wakaba@suika.fam.cx>
713
714 * ContentChecker.pm: |<img ismap>| not in |<a></a>|
715 is now erred. |<datalist>| is implemented.
716 Attribute checker for |<command>| and |<menu>| are
717 added. Support for |contextmenu| global attribute
718 is added.
719
720 2007-06-25 Wakaba <wakaba@suika.fam.cx>
721
722 * HTML.pm.src (_reset_insertion_mode): Interpretation
723 of Step 3 has been changed.
724
725 2007-06-25 Wakaba <wakaba@suika.fam.cx>
726
727 * HTML.pm.src: Late |<html>| parse error is implemented.
728
729 2007-06-24 Wakaba <wakaba@suika.fam.cx>
730
731 * URIChecker.pm (check_iri_reference): A |decode| method name was
732 incorrect.
733
734 * ContentChecker.pm: Support for the |footer| element.
735 Check URI syntax for space-separated URI attributes.
736 Support for the |tabindex| attribute. Support
737 for |datetime| attribute.
738
739 2007-06-24 Wakaba <wakaba@suika.fam.cx>
740
741 * HTML.pm.src: HTML5 revision 1.144 (&#x0D;) and 1.145 (invalid
742 character references). HTML5 revision 1.146 (white space
743 characters before root start tag). HTML5 revision
744 1.148 (named character references in attribute values).
745 HTML5 revision 1.152 (<plaintext>.innerHTML get).
746
747 2007-06-24 Wakaba <wakaba@suika.fam.cx>
748
749 * HTML.pm.src: HTML5 revisions 1.142 and 1.143 (<noscript>
750 in <head>).
751
752 2007-06-24 Wakaba <wakaba@suika.fam.cx>
753
754 * HTML.pm.src: HTML5 revision 935 (<base>, <link>, <meta>
755 in body).
756
757 * ContentChecker.pm: HTML5 revision 938 (scoped="").
758
759 2007-06-24 Wakaba <wakaba@suika.fam.cx>
760
761 * HTML.pm.src: HTML5 revision 923 (matching end tag in CDATA
762 or RCDATA in fragment parsing mode). HTML5 revision
763 924 (<!--> and <!--->). HTML5 revision 926 (hn in hn).
764
765 2007-06-23 Wakaba <wakaba@suika.fam.cx>
766
767 * HTML.pm.src (get_inner_html): HTML5 revision 922 (inner_html
768 for <pre> and <textarea>).
769
770 2007-06-23 Wakaba <wakaba@suika.fam.cx>
771
772 * HTML.pm.src: HTML5 revision 920 (<isindex>).
773
774 2007-06-23 Wakaba <wakaba@suika.fam.cx>
775
776 * HTML.pm.src: HTML5 revision 918 (</head>, </body>, </html>).
777
778 2007-06-23 Wakaba <wakaba@suika.fam.cx>
779
780 * HTML.pm.src: HTML5 revision 916 (</body>).
781 HTML5 revision 917 (conforming bare &).
782
783 2007-06-23 Wakaba <wakaba@suika.fam.cx>
784
785 * NanoDOM.pm (manakai_is_html): Setting to false did not work.
786
787 * HTML.pm.src: HTML5 revision 914 (</ in CDATA, RCDATA).
788 HTML5 revision 915 (<nobr>).
789
790 2007-06-23 Wakaba <wakaba@suika.fam.cx>
791
792 * HTML.pm.src: HTML5 revisions 908, 909, 912, and 913 (quirks mode).
793
794 * NanoDOM.pm (manakai_is_html, manakai_compat_mode, compat_mode):
795 New attributes.
796
797 2007-06-23 Wakaba <wakaba@suika.fam.cx>
798
799 * HTML.pm.src: HTML5 revisions 900, 901, 902, and 911 (<
800 in tags).
801
802 2007-06-23 Wakaba <wakaba@suika.fam.cx>
803
804 * .cvsignore: |Entities.html| is added.
805
806 * HTML.pm.src: |$entity_char| is removed and
807 requires |Whatpm::_NamedEntityList| instead.
808 HTML5 revision 898 (refc), except that lack of refc
809 is parse error.
810
811 * mkentitylist.pl: New script.
812
813 * Makefile (all): |_NamedEntityList.pm| is added.
814 (_NamedEntityList.pm, Entities.html): New rules.
815
816 2007-06-23 Wakaba <wakaba@suika.fam.cx>
817
818 * HTML.pm.src: Parse errors immediately after U+000D
819 were ignored and U+000D immediately following another
820 U+000D was not converted to U+000A.
821
822 2007-06-23 Wakaba <wakaba@suika.fam.cx>
823
824 * HTML.pm.src (set_inner_html): HTML5 revision 892 (adopt
825 nodes before appended). Parser was not ready for NULL
826 parse error and escape flag.
827
828 * NanoDOM.pm (adopt_node): New.
829
830 2007-06-23 Wakaba <wakaba@suika.fam.cx>
831
832 * HTML.pm.src: HTML5 revision 886 (insane comment in
833 CDATA and RCDATA). Note that current implementation
834 is simply repeating what the spec says and it is maybe not
835 a best way to do it.
836
837 2007-06-23 Wakaba <wakaba@suika.fam.cx>
838
839 * HTML.pm.src: HTML5 revision 884 (</form> don't close
840 the form element if a descendant element without implied end tag has
841 still been open).
842
843 2007-06-23 Wakaba <wakaba@suika.fam.cx>
844
845 * ContentChecker.pm: HTML5 revision 881 (Make |id| attribute with
846 space characters non-conforming).
847
848 2007-06-23 Wakaba <wakaba@suika.fam.cx>
849
850 * HTML.pm.src: An error message was incorrect.
851 HTML5 revision 869 (C1 character references).
852
853 2007-06-23 Wakaba <wakaba@suika.fam.cx>
854
855 * HTML.pm.src: HTML5 revision 867 (a LF at the beginning of
856 a |textarea| is removed).
857
858 2007-06-05 Wakaba <wakaba@suika.fam.cx>
859
860 * NanoDOM.pm (get_attribute_node_ns): New method.
861
862 * ContentChecker.pm: |script| |async| and |defer|
863 no longer require |src|. |async| MUST NOT be
864 specified if |defer|. (HTML5 revision 858).
865
866 2007-05-30 Wakaba <wakaba@suika.fam.cx>
867
868 * HTML.pm.src: |<form><form>| went to inifinite loop.
869
870 2007-05-27 Wakaba <wakaba@suika.fam.cx>
871
872 * ContentChecker.pm (html): Set |is_root| (allowed
873 as a document element) flag on.
874 (new): Removed.
875 (check_document): New method.
876
877 2007-05-27 Wakaba <wakaba@suika.fam.cx>
878
879 * ContentChecker.pm (thead, tfoot): Checker specifications
880 were incorrect.
881
882 2007-05-27 Wakaba <wakaba@suika.fam.cx>
883
884 * ContentChecker.pm ($HTMLURIAttrChecker): Include
885 error position in the |type| option of the error.
886
887 * HTMLTable.pm (form_table): The |$onerror| parameter
888 is now optional. Some bugs are fixed.
889
890 2007-05-27 Wakaba <wakaba@suika.fam.cx>
891
892 * HTMLTable.pm: New module.
893
894 * ContentChecker.pm (table): Invoke table model error checker.
895
896 * NanoDOM.pm (first_child, get_attribute_ns): New.
897
898 2007-05-26 Wakaba <wakaba@suika.fam.cx>
899
900 * ContentChecker.pm ($HTMLLinkTypesAttrChecker): New checker.
901 (link/@rel, a/@rel, area/@rel): Use new checker.
902
903 * Makefile (_LinkTypeList.pm, RelExtensions.html): New rules.
904
905 * _LinkTypeList.pm: New file.
906
907 * mklinktypelist.pl: New file.
908
909 * .cvsignore: |RelExtensions.html| added.
910
911 * NanoDOM.pm (child_nodes): Returns an empty array
912 for non-child-containing node types.
913 (text_content): New attribute.
914
915 2007-05-26 Wakaba <wakaba@suika.fam.cx>
916
917 * IMTChecker.pm: New module.
918
919 * ContentChecker.pm ($HTMLIMTAttrChecker): Call IMTChecker
920 to test parameter value validity.
921
922 * HTML.pm.src ($style_start_tag): Attributes were
923 discarded.
924
925 2007-05-25 Wakaba <wakaba@suika.fam.cx>
926
927 * ContentChecker.pm ($HTMLURIAttrChecker): Implemented.
928
929 2007-05-25 Wakaba <wakaba@suika.fam.cx>
930
931 * URIChecker.pm: All recommendations from RFC 3986
932 and RFC 3987 are listed (not all testable items are checked yet).
933
934 2007-05-25 Wakaba <wakaba@suika.fam.cx>
935
936 * URIChecker.pm: New module.
937
938 2007-05-20 Wakaba <wakaba@suika.fam.cx>
939
940 * ContentChecker.pm: Now most attributes are implemented
941 or associated to some placeholder.
942 ($ElementDefault): Warn unknown attributes
943 for unknown elements as "attribute not supported".
944 ($HTMLLanguageTagAttrChecker, $HTMLMQAttrChecker): New placeholders.
945 ($HTMLUsemapAttrChecker, $HTMLTargetAttrChecker): New checkers.
946 (|a| attribute checker): Reimplemented.
947
948 2007-05-20 Wakaba <wakaba@suika.fam.cx>
949
950 * ContentChecker.pm ($HTMLEventHandlerAttrChecker): New placeholder.
951 ($HTMLAttrChecker): Event handler content attributes
952 are added.
953 (link, embed): Required attribute is now checked.
954 (embed): Unknown local attributes are no longer warned.
955
956 2007-05-20 Wakaba <wakaba@suika.fam.cx>
957
958 * ContentChecker.pm ($HTMLSpaceURIsAttrChecker): New placeholder.
959 ($HTMLIMTAttrChecker): New checker.
960 (link@rel, link@href, link@type, style@type,
961 a@href, a@ping, a@ping, a@type, embed@src, embed@type,
962 object@data, object@type, source@src, source@type, area@alt,
963 area@shape, area@coords, area@href,
964 area@ping, area@rel, area@type, script@src,
965 script@defer, script@async, script@type): Checkers added.
966
967 2007-05-20 Wakaba <wakaba@suika.fam.cx>
968
969 * ContentChecker.pm: Descendant checking was incorrect.
970
971 2007-05-19 Wakaba <wakaba@suika.fam.cx>
972
973 * ContentChecker.pm: Support |xml:*| and |xmlns:*|
974 attributes. Report an error if |Element.prefix|
975 is |xmlns|.
976
977 * NanoDOM.pm (prefix): New attribute.
978
979 2007-05-19 Wakaba <wakaba@suika.fam.cx>
980
981 * HTML.pm.src: In |main| phase, |in body| insertion
982 mode, action for |<iframe>| was missing.
983
984 2007-05-19 Wakaba <wakaba@suika.fam.cx>
985
986 * ContentChecker.pm: Support for many of HTML5 elements.
987 ($GetHTMLNonNegativeIntegerAttrChecker): New.
988
989 2007-05-19 Wakaba <wakaba@suika.fam.cx>
990
991 * ContentChecker.pm: Support for most elements up to |progress|.
992 ($HTMLURIAttrChecker): Placeholder.
993 ($HTMLIntegerAttrChecker, $GetHTMLFloatingPointNumberAttrChecker):
994 New.
995
996 2007-05-19 Wakaba <wakaba@suika.fam.cx>
997
998 * ContentChecker.pm: Attribute checkers
999 for global attributes, |html|, |base|, |style|, and |meta|.
1000
1001 * NanoDOM.pm (insert_before): Weaken reference
1002 to the parent node.
1003 (Attr::new): Set |owner_element| attribute.
1004 (namespace_uri, manakai_local_name): New attribute implementations.
1005 (owner_element): New attribute.
1006
1007 2007-05-19 Wakaba <wakaba@suika.fam.cx>
1008
1009 * ContentChecker.pm ($AttrChecker, $HTMLAttrChecker,
1010 $AnyChecker->{attr_checker}, $HTMLAttrsChecker,
1011 $Element->{$HTML_NS}->{''}): New.
1012 (check_element): Invoke attrs_checker for each element.
1013
1014 2007-05-13 Wakaba <wakaba@suika.fam.cx>
1015
1016 * ContentChecker.pm: Don't use |manakai_element_type_match|.
1017
1018 2007-05-13 Wakaba <wakaba@suika.fam.cx>
1019
1020 * ContentChecker.pm: Use hashs rather than lists for
1021 element type testings.
1022
1023 2007-05-13 Wakaba <wakaba@suika.fam.cx>
1024
1025 * ContentChecker.pm: Don't generate duplicate
1026 error when an element type is put in the "minus" list
1027 and the element type is not allowed explicitly in the particular
1028 element content model.
1029 (html:a checker): New checker.
1030 (html:details, html:datagrid): New checkers.
1031 (html:legend): New checker.
1032
1033 2007-05-13 Wakaba <wakaba@suika.fam.cx>
1034
1035 * ContentChecker.pm (html:li checker): Implemented.
1036
1037 2007-05-13 Wakaba <wakaba@suika.fam.cx>
1038
1039 * ContentChecker.pm ($HTMLInlineOrStriclyInlineChecker): New
1040 checker.
1041 (html:dd checker): New checker.
1042 (html:q, html:em, html:strong, html:small,
1043 html:m, html:dfn, html:code, html:samp, html:span): New checkers.
1044
1045 2007-05-13 Wakaba <wakaba@suika.fam.cx>
1046
1047 * ContentChecker.pm ($AnyChecker): Renamed
1048 from |$ElementDefault->{checker}|.
1049 ($ElementDefault->{checker}): Throw an error that
1050 the element type is not supported by the checker.
1051 ($HTMLMetadataElement): |html:base| was missing.
1052 ($HTMLEmptyChecker): Don't throw an error
1053 for inter-element whitespace nodes.
1054 (html:html checker): Errors were not
1055 thrown even if |html:head| and/or |html:body|
1056 children were missing.
1057 (html:head checker): An error was not
1058 thrown if <meta charset> appered after other
1059 elements.
1060
1061 2007-05-05 Wakaba <wakaba@suika.fam.cx>
1062
1063 * ContentChecker.pm: |footer|, |video|, |audio|, |script|,
1064 and |noscript| elements are implemented.
1065 (new): New method.
1066
1067 2007-05-04 Wakaba <wakaba@suika.fam.cx>
1068
1069 * ContentChecker.pm: New module.
1070
1071 2007-05-04 Wakaba <wakaba@suika.fam.cx>
1072
1073 * NanoDOM.pm (manakai_parent_element,
1074 document_element, manakai_local_name,
1075 manakai_element_type_match): New method.
1076
1077 2007-05-03 Wakaba <wakaba@suika.fam.cx>
1078
1079 * HTML.pm.src: Replace decimal and hexadecimal numeric
1080 entities in C1 range using Windows-1252 mapping. Bare LF
1081 did not count as new line for error reporting.
1082
1083 2007-05-02 Wakaba <wakaba@suika.fam.cx>
1084
1085 * NanoDOM.pm (DOMImplementation): New class.
1086 (append_child): Weaken the |parent_node| reference.
1087 (create_element_ns, Element new): Set the |owner_document|
1088 reference.
1089 (implementation): New attribute.
1090 (owner_document, local_name, namespace_uri): New attributes.
1091
1092 * HTML.pm.src (parse_string): Line and column numbers
1093 are now provided to error handler.
1094 (!!!parse-error): Short descriptions are added.
1095 (_construct_tree): Split into three methods; support
1096 for innerHTML mode.
1097 (set_inner_html): New method.
1098
1099 2007-05-01 Wakaba <wakaba@suika.fam.cx>
1100
1101 * NanoDOM.html: Documentation is added.
1102
1103 * HTML.pod, ContentType.html: Documentation is revised.
1104
1105 * .cvsignore: Pod2html temporary files are added.
1106
1107 * Makefile: Make |NanoDOM.html|.
1108
1109 2007-05-01 Wakaba <wakaba@suika.fam.cx>
1110
1111 * HTML.pm.src (parse_string): New method.
1112 (get_inner_html): Renamed from |inner_html|.
1113
1114 * Makefile: A rule for |HTML.html| is added.
1115
1116 * HTML.pod: New documentation.
1117
1118 2007-05-01 Wakaba <wakaba@suika.fam.cx>
1119
1120 * NanoDOM.pm (last_child, previous_sibling): New attributes.
1121 (clone_node): Attribute nodes were not completely copied.
1122
1123 * HTML.pm.src: Many bugs are fixed.
1124
1125 2007-04-30 Wakaba <wakaba@suika.fam.cx>
1126
1127 * HTML.pm.src: Some typos are fixed.
1128
1129 2007-04-30 Wakaba <wakaba@suika.fam.cx>
1130
1131 * mkhtmlparser.pl, Makefile: References to the |HTML-consume-entity.src|
1132 are removed.
1133
1134 * HTML.pm.src: Tokenizer's handling on named entities are rewritten.
1135
1136 * HTML-consume-entity.src: Removed.
1137
1138 2007-04-30 Wakaba <wakaba@suika.fam.cx>
1139
1140 * HTML.pm.src: Tokenizer's handling on hexadecimal
1141 numeric entities are rewritten.
1142
1143 2007-04-30 Wakaba <wakaba@suika.fam.cx>
1144
1145 * HTML.pm.src: Some tokenizer bugs are fixed.
1146
1147 2007-04-30 Wakaba <wakaba@suika.fam.cx>
1148
1149 * HTML.pm.src: The tree construction stage is implemented.
1150
1151 * mkhtmlparser.pl: New macros are added.
1152
1153 2007-04-28 Wakaba <wakaba@suika.fam.cx>
1154
1155 * ContentType.pm: A note on bug in the specification
1156 is removed since it's been now fixed.
1157
1158 * .cvsignore: New file.
1159
1160 2007-04-28 Wakaba <wakaba@suika.fam.cx>
1161
1162 * HTML.pm.src, HTML-consume-entity.src: New files.
1163
1164 * Makefile (HTML.pm): New rule.
1165
1166 * mkhtmlparser.pl: New script.
1167
1168 2007-04-25 Wakaba <wakaba@suika.fam.cx>
1169
1170 * Makefile: New file.
1171
1172 2007-04-24 Wakaba <wakaba@suika.fam.cx>
1173
1174 * ContentType.pm: An error in pod is fixed.
1175
1176 2007-04-24 Wakaba <wakaba@suika.fam.cx>
1177
1178 * ContentType.pm: Documentation is added.
1179
1180 2007-04-24 Wakaba <wakaba@suika.fam.cx>
1181
1182 * ContentType.pm: New Perl module.
1183
1184 * ChangeLog: New file.
1185
1186

admin@suikawiki.org
ViewVC Help
Powered by ViewVC 1.1.24