/[suikacvs]/markup/html/whatpm/Whatpm/ChangeLog
Suika

Contents of /markup/html/whatpm/Whatpm/ChangeLog

Parent Directory Parent Directory | Revision Log Revision Log


Revision 1.191 - (show annotations) (download)
Mon Mar 17 13:23:38 2008 UTC (17 years, 4 months ago) by wakaba
Branch: MAIN
Changes since 1.190: +8 -0 lines
++ whatpm/Whatpm/ChangeLog	17 Mar 2008 13:23:31 -0000
	* HTML.pm.src: Set line/column information to element nodes.

	* mkhtmlparser.pl (!!!create-element, !!!insert-element,
	and !!!insert-element-t): Set line/column information to
	element nodes.

2008-03-17  Wakaba  <wakaba@suika.fam.cx>

1 2008-03-17 Wakaba <wakaba@suika.fam.cx>
2
3 * HTML.pm.src: Set line/column information to element nodes.
4
5 * mkhtmlparser.pl (!!!create-element, !!!insert-element,
6 and !!!insert-element-t): Set line/column information to
7 element nodes.
8
9 2008-03-17 Wakaba <wakaba@suika.fam.cx>
10
11 * HTML.pm.src (_get_next_token): The first "<" character
12 in "<?", "<>", or "</>" should be the error point.
13
14 2008-03-16 Wakaba <wakaba@suika.fam.cx>
15
16 * HTML.pm.src: Some more fixes on error position reporting.
17
18 2008-03-16 Wakaba <wakaba@suika.fam.cx>
19
20 * HTML.pm.src: Token-level precious error reporting.
21
22 2008-03-16 Wakaba <wakaba@suika.fam.cx>
23
24 * HTML.pm.src: Preparation for more precious error point
25 reporting.
26
27 2008-03-11 Wakaba <wakaba@suika.fam.cx>
28
29 * HTML.pm.src: Error type revised.
30
31 2008-03-11 Wakaba <wakaba@suika.fam.cx>
32
33 * HTML.pm.src: Similar codes are merged together, again.
34
35 2008-03-11 Wakaba <wakaba@suika.fam.cx>
36
37 * HTML.pm.src: Similar codes are merged together.
38
39 2008-03-10 Wakaba <wakaba@suika.fam.cx>
40
41 * mkhtmlparser.pl: Set "level" parameter to parse errors.
42
43 * HTML.pm.src: Code refined.
44
45 2008-03-09 Wakaba <wakaba@suika.fam.cx>
46
47 * HTML.pm.src: |</body>| treatement has been changed (HTML5
48 revision 1348). Note that I really don't know this makes
49 any difference in the black-box behavior of the parser.
50
51 2008-03-09 Wakaba <wakaba@suika.fam.cx>
52
53 * HTML.pm.src: New end-of-file token implementation (HTML5
54 revision 1348).
55
56 2008-03-09 Wakaba <wakaba@suika.fam.cx>
57
58 * HTML.pm.src: |applet| support (HTML5 revision 1347).
59
60 2008-03-09 Wakaba <wakaba@suika.fam.cx>
61
62 * HTML.pm.src: Foster parenting in AAA (HTML5 revision 1343).
63
64 2008-03-09 Wakaba <wakaba@suika.fam.cx>
65
66 * HTML.pm.src: Support for |<input>| in the "in select" insertion
67 mode and support for the "in select in table" insertion mode (HTML5
68 revision 1342).
69
70 2008-03-09 Wakaba <wakaba@suika.fam.cx>
71
72 * HTML.pm.src: No foster parenting for <script> and <script>
73 in non-tainted <table>s (HTML5 revision 1336).
74
75 2008-03-09 Wakaba <wakaba@suika.fam.cx>
76
77 * HTML.pm.src: Ignore white space characters between <html>
78 and <head> (HTML5 revision 1332).
79
80 2008-03-09 Wakaba <wakaba@suika.fam.cx>
81
82 * HTML.pm.src: Treat <input type=hidden> as if it were a
83 white space (HTML5 revision 1331).
84
85 2008-03-08 Wakaba <wakaba@suika.fam.cx>
86
87 * HTML.pm.src: Ignore U+000A at the beginning of a |listing|
88 element (HTML5 revision 1330).
89
90 2008-03-08 Wakaba <wakaba@suika.fam.cx>
91
92 * HTML.pm.src: <title> is always appended to the current
93 element (HTML5 revision 1328).
94
95 2008-03-08 Wakaba <wakaba@suika.fam.cx>
96
97 * HTML.pm.src: White space in tainted tables are moved
98 into foster parents (HTML5 revision 1326).
99
100 2008-03-08 Wakaba <wakaba@suika.fam.cx>
101
102 * HTML.pm.src: Reduce errors from foster parenting cases (HTML5
103 revision 1321).
104
105 2008-03-08 Wakaba <wakaba@suika.fam.cx>
106
107 * HTML.pm.src: |</h/n/>| case code rearranged to align with
108 the spec (HTML5 revision 1320). Note that we finally complete
109 all of HTML5 revision 1320 changes.
110
111 2008-03-08 Wakaba <wakaba@suika.fam.cx>
112
113 * HTML.pm.src: |</form>| now works similar to |</div>| for unclosed
114 tags (HTML5 revision 1320).
115
116 2008-03-08 Wakaba <wakaba@suika.fam.cx>
117
118 * HTML.pm.src: |</p>| case rearranged with no actual change in fact.
119
120 2008-03-08 Wakaba <wakaba@suika.fam.cx>
121
122 * HTML.pm.src: A "generate implied end tags" code (t409.1)
123 could not be reached so that it is now removed (HTML5 revision
124 1320).
125
126 2008-03-08 Wakaba <wakaba@suika.fam.cx>
127
128 * HTML.pm.src: Code for the case of |</div>| and so on are revised
129 to align with new spec text (HTML5 revision 1320).
130
131 2008-03-08 Wakaba <wakaba@suika.fam.cx>
132
133 * HTML.pm.src: Remove strange |if| condition; however, it should
134 have had no harm in theory.
135
136 2008-03-08 Wakaba <wakaba@suika.fam.cx>
137
138 * HTML.pm.src (_tree_construction_main): '</p>' in body
139 case is split from other end tags for the preparation
140 of implementing HTML revision 1320.
141
142 2008-03-07 Wakaba <wakaba@suika.fam.cx>
143
144 * HTML.pm.src: Simplified "generate implied end tag" (HTML5
145 revision 1320).
146
147 2008-03-07 Wakaba <wakaba@suika.fam.cx>
148
149 * HTML.pm.src (_tree_construction_main): Merge rules for "h1"
150 and "div" (HTML5 revision 1318). Add comments to where
151 |form| pointer association codes should be inserted (HTML5 revision
152 1319).
153
154 2008-03-06 Wakaba <wakaba@suika.fam.cx>
155
156 * HTML.pm.src: <html> treatement refined (HTML5 revision 1314).
157
158 2008-03-05 Wakaba <wakaba@suika.fam.cx>
159
160 * HTML.pm.src: Since the case t268 should never be reached (no
161 other token type, there are), it is replaced by a |die| statement.
162
163 2008-03-05 Wakaba <wakaba@suika.fam.cx>
164
165 * HTML.pm.src: Typo fixed.
166
167 2008-03-04 Wakaba <wakaba@suika.fam.cx>
168
169 * HTML.pm.src (_tree_construction_initial): Some limited quirks
170 doctypes were not uppercased for comparison.
171
172 2008-03-03 Wakaba <wakaba@suika.fam.cx>
173
174 * HTML.pm.src (tree construction and set_inner_html): Checkpoints
175 are added.
176
177 2008-03-03 Wakaba <wakaba@suika.fam.cx>
178
179 * HTML.pm.src (_tokenize_attempt_to_consume_an_entity): Checkpoints
180 are set. Cases that are unlikely reached are noted as so.
181
182 2008-03-03 Wakaba <wakaba@suika.fam.cx>
183
184 * HTML.pm.src: Checkpoints for debugging are added.
185
186 * mkhtmlparser.pl: Support for |!!!cp| syntax.
187
188 2008-03-03 Wakaba <wakaba@suika.fam.cx>
189
190 * HTML.pm.src, mkhtmlparser.pl: s/_input_character/_char/g
191 for simplicity.
192
193 2008-03-03 Wakaba <wakaba@suika.fam.cx>
194
195 * HTML.pm.src: Flag name changed: s/correct/force-quirks/g (HTML5
196 revision 1307).
197
198 2008-03-03 Wakaba <wakaba@suika.fam.cx>
199
200 * HTML.pm.src (_get_next_token): Where the /incorrect/ flag
201 is set are changed (HTML5 revision 1305).
202
203 2008-03-02 Wakaba <wakaba@suika.fam.cx>
204
205 * HTML.pm.src: Raise a parse error for |<span ===>| (HTML5 revision
206 1292). Entities are not parsed in comment-like part in RCDATA
207 elements (HTML5 revision 1294). Allow bare & at the end
208 of attribute value literals (HTML5 revision 1296). More
209 quirks mode doctypes (HTML5 revision 1302). Requires spaces
210 between attributes and ban attribute names or unquoted
211 attribute values containing single or double quotes (HTML5
212 revision 1303).
213
214 2008-03-02 Wakaba <wakaba@suika.fam.cx>
215
216 * ContentChecker.pm: Typo fixed. Don't raise "character encoding"
217 and related errors unless it is an HTML document (though the spec
218 is unclear on whether it is applied to XHTML document).
219
220 * HTML.pm (%HTMLAttrStatus): WF2 repetition model attributes
221 are added.
222
223 2008-03-02 Wakaba <wakaba@suika.fam.cx>
224
225 * HTML.pm.src: s/local_name/manakai_local_name/g.
226
227 2008-03-01 Wakaba <wakaba@suika.fam.cx>
228
229 * _NamedEntityList.pm: Updated (HTML5 revision 1286).
230
231 * HTML.pm.src: |charset| in |content| attribute is
232 case-insensitive (HTML5 revision 1270).
233
234 2008-02-26 Wakaba <wakaba@suika.fam.cx>
235
236 * ContentChecker.pm: New status constants are added.
237 ($ElementDefault): |status| added.
238 (check_element): Err for non-standard or deprecated elements.
239 (_attr_status_info): For non-standard or deprecated attributes.
240
241 2008-02-24 Wakaba <wakaba@suika.fam.cx>
242
243 * ContentChecker.pm (_attr_status_info): New internal method.
244
245 2008-02-24 Wakaba <wakaba@suika.fam.cx>
246
247 * ContentChecker.pm (check_element): Element standardized
248 status information is now dispatched.
249
250 2008-02-24 Wakaba <wakaba@suika.fam.cx>
251
252 * ContentChecker.pm (check_element): Fix |del|-and-significant
253 problem by adding some more arguments.
254
255 2008-02-24 Wakaba <wakaba@suika.fam.cx>
256
257 * ContentChecker.pm (check_element): Use context of
258 container-for-the-purpose-of-content-model element (not transparent
259 element) for |check_child_element| calling and significant
260 text flag marking. This reintroduces |<del>|-and-significant
261 problem again.
262
263 2008-02-24 Wakaba <wakaba@suika.fam.cx>
264
265 * ContentChecker.pm (check_element): Make semi-transparent
266 elements ignored for the purpose of phase changes in
267 content model checking.
268
269 2008-02-23 Wakaba <wakaba@suika.fam.cx>
270
271 * ContentChecker.pm (check_element): In-element state
272 was not properly managed for transparent cases.
273
274 2008-02-23 Wakaba <wakaba@suika.fam.cx>
275
276 * ContentChecker.pm (check_element): Support for |video|
277 and |audio| as semi-transparent elements.
278
279 2008-02-23 Wakaba <wakaba@suika.fam.cx>
280
281 * ContentChecker.pm ($HTMLSemiTransparentElements): New.
282 (check_element): s/minuses/minus_elements/, s/pluses/plus_elements/.
283 Support for |html:object| as a semi-transparent element.
284
285 2008-02-23 Wakaba <wakaba@suika.fam.cx>
286
287 * ContentChecker.pm (check_element): The way to traverse
288 the tree is entirely revised to make it easier to track
289 the state of ancestors/descendants. As a result of this
290 revision (which rewrites almost all of Whatpm::ContentChecker::HTML),
291 support for content model checking for HTML elements |figure|,
292 |object|, |video|, and |audio| and checking for XML elements (and
293 some XMLNS checkings) are dropped for now. They will be
294 reimplemented in due cource.
295
296 2008-02-17 Wakaba <wakaba@suika.fam.cx>
297
298 * HTML.pm.src: |>| in public or system literal closes the DOCTYPE
299 token (HTML5 revision 1225).
300
301 2008-02-17 Wakaba <wakaba@suika.fam.cx>
302
303 * ContentChecker.pm ({unsupported_level}): New value.
304
305 * HTML.pm.src: Save whether |meta| |content| attribute
306 contains character references or not.
307
308 2008-02-17 Wakaba <wakaba@suika.fam.cx>
309
310 * ContentChecker.pm (_get_children): (Incompleted) attempt to
311 imlement significant content checking for contents
312 with |del| elements.
313
314 2008-02-17 Wakaba <wakaba@suika.fam.cx>
315
316 * ContenteChecker.pm ($HTMLTransparentElements): More
317 elements are added.
318 (_get_children): HTML |object| elements are now semi-transparent.
319
320 * NanoDOM.pm (manakai_html, manakai_head): New methods.
321
322 2008-02-16 Wakaba <wakaba@suika.fam.cx>
323
324 * CacheManifest.pm: HTML5 revision 1211 implemented.
325
326 * CacheManifest.pod: Updated.
327
328 2008-02-10 Wakaba <wakaba@suika.fam.cx>
329
330 * ContentChecker.pm (check_document, check_element): Support
331 for second argument ($onsubdoc).
332 (_get_css_parser): Removed (now it is part of WDCC).
333
334 2008-02-09 Wakaba <wakaba@suika.fam.cx>
335
336 * ContentChecker.pm (_get_css_parser): New.
337
338 2007-11-25 Wakaba <wakaba@suika.fam.cx>
339
340 * ContentChecker.pm ($AnyChecker): Old way to add child elements
341 for checking had been used.
342
343 2007-11-25 Wakaba <wakaba@suika.fam.cx>
344
345 * ContentChecker.pm (check_element): New todo item type |descendant|.
346
347 2007-11-23 Wakaba <wakaba@suika.fam.cx>
348
349 * IMTChecker.pm: Revised to raise errors and warnings as (poorly)
350 specced in RFC 2046 and RFC 4288.
351 (application/atom+xml): Definition added.
352
353 2007-11-23 Wakaba <wakaba@suika.fam.cx>
354
355 * URIChecker.pm: Make RFC 3986 should-level errors
356 warnings (rather than SHOULD-level errors).
357
358 2007-11-23 Wakaba <wakaba@suika.fam.cx>
359
360 * NanoDOM.pm (get_user_data, set_user_data): New methods.
361
362 * HTML.pm.src: A flag for character references in attribute
363 values are added. Set |manakai_has_reference| user data
364 to |charset| attribute.
365
366 2007-11-23 Wakaba <wakaba@suika.fam.cx>
367
368 * NanoDOM.pm (input_encoding, manakai_charset, manakai_has_bom): New
369 attributes.
370
371 * ContentChecker.pm (check_document): Warn if charset requirements
372 cannot be tested.
373
374 2007-11-19 Wakaba <wakaba@suika.fam.cx>
375
376 * HTML.pm.src (parse_byte_string): Detect charset
377 by universalchardet if charset parameter is not specified.
378
379 * Makefile (Charset-all, Charset-clean): New rules.
380
381 2007-11-18 Wakaba <wakaba@suika.fam.cx>
382
383 * ContentChecker.pm (check_document): Check the existence
384 of character encoding declaration and selection of encoding
385 for HTML document.
386
387 2007-11-18 Wakaba <wakaba@suika.fam.cx>
388
389 * ContentType.pm (get_sniffed_type): Return also the official
390 type in list context.
391
392 2007-11-18 Wakaba <wakaba@suika.fam.cx>
393
394 * ContentType.pm: Sniffing with leading white space
395 ignoring (HTML5 revisions 1013 and 1016).
396
397 2007-11-18 Wakaba <wakaba@suika.fam.cx>
398
399 * ContentType.pm: HTML5 revision 1013 changes, except for leading
400 white spaces, are implemented.
401
402 2007-11-11 Wakaba <wakaba@suika.fam.cx>
403
404 * HTML.pm.src (parse_char_string): Set |inner_encoding|
405 attribute if possible.
406
407 2007-11-11 Wakaba <wakaba@suika.fam.cx>
408
409 * HTML.pm.src (parse_byte_string): New method.
410 (parse_char_string): New alias for |parse_string|.
411 (main phase): Invoking "change the encoding" algorithm if desired.
412
413 * HTML.pod: Updated.
414
415 2007-11-11 Wakaba <wakaba@suika.fam.cx>
416
417 * HTML.pod (get_inner_html): Removed.
418
419 * Makefile (HTML-all, HTML-clean): New.
420
421 2007-11-11 Wakaba <wakaba@suika.fam.cx>
422
423 * HTML.pm.src (get_inner_html): Removed (moved to HTML/Serializer.pm).
424
425 2007-11-08 Wakaba <wakaba@suika.fam.cx>
426
427 * mklinktypelist.pl: s/noreferer/noreferrer/ (HTML5 revision 1132).
428
429 2007-11-04 Wakaba <wakaba@suika.fam.cx>
430
431 * Makefile: |CacheManifest.html| is added.
432
433 * CacheManifest.pod: New file.
434
435 2007-11-04 Wakaba <wakaba@suika.fam.cx>
436
437 * CacheManifest.pm: New module.
438
439 2007-11-04 Wakaba <wakaba@suika.fam.cx>
440
441 * HTML.pm.src: Support for application cache selection algorithm
442 callback.
443
444 2007-11-04 Wakaba <wakaba@suika.fam.cx>
445
446 * mklinktypelist.pl: Support for rel=noreferer (HTML5 revision 1118).
447
448 2007-10-17 Wakaba <wakaba@suika.fam.cx>
449
450 * Makefile (clean): New rule.
451
452 * NanoDOM.pm (public_id, system_id): New attributes.
453
454 2007-10-17 Wakaba <wakaba@suika.fam.cx>
455
456 * Makefile (CSS-all, CSS-clean, clean): New rules.
457
458 2007-10-14 Wakaba <wakaba@suika.fam.cx>
459
460 * ContentChecker.pm (check_document): Support for
461 new |is_xml_root| flag.
462 (check_element): Support for new |pluses| state.
463 (_add_pluses): New method.
464 (_remove_minuses): Support for new |minus| item.
465
466 2007-09-24 Wakaba <wakaba@suika.fam.cx>
467
468 * ContentChecker.pm: Raise specific error for invalid
469 root element.
470
471 2007-09-24 Wakaba <wakaba@suika.fam.cx>
472
473 * ContentChecker.pm: Set level values for later uses.
474
475 2007-09-09 Wakaba <wakaba@suika.fam.cx>
476
477 * ContentChecker.pm: Support for language tag validation.
478
479 2007-09-09 Wakaba <wakaba@suika.fam.cx>
480
481 * LangTag.pm (check_rfc3066_language_tag): New method.
482
483 2007-09-09 Wakaba <wakaba@suika.fam.cx>
484
485 * LangTag.pm: New module.
486
487 2007-09-04 Wakaba <wakaba@suika.fam.cx>
488
489 * HTML.pm.src: Some error types were wrong.
490
491 2007-08-17 Wakaba <wakaba@suika.fam.cx>
492
493 * CSS/: New directory.
494
495 2007-08-17 Wakaba <wakaba@suika.fam.cx>
496
497 * ContentChecker.pm (_check_get_children): Support
498 for |noscript| in |head|.
499
500 2007-08-12 Wakaba <wakaba@suika.fam.cx>
501
502 * URI/: New directory.
503
504 2007-08-11 Wakaba <wakaba@suika.fam.cx>
505
506 * HTML.pm.src: Tokenizer's states are now represented in
507 number.
508
509 2007-08-11 Wakaba <wakaba@suika.fam.cx>
510
511 * HTML.pm.src: |or|s for insertion modes are replaced
512 by |&|s.
513
514 2007-08-11 Wakaba <wakaba@suika.fam.cx>
515
516 * HTML.pm.src: Token types are now represented in number.
517
518 2007-08-11 Wakaba <wakaba@suika.fam.cx>
519
520 * ContentType.pm (SEE ALSO): Updated.
521
522 * HTML.pm.src: Insertion modes are now represented in number.
523
524 2007-08-11 Wakaba <wakaba@suika.fam.cx>
525
526 * ContentType.pm: Sniffing for bitmap images (HTML5 revision
527 999) is implemented.
528
529 2007-08-08 Wakaba <wakaba@suika.fam.cx>
530
531 * ContentType.pm: Sniffing for |<script| (HTML5 revision
532 983) is implemented.
533
534 2007-08-06 Wakaba <wakaba@suika.fam.cx>
535
536 * ContentChecker.pod: New documentation.
537
538 * Makefile: A rule for |ContentChecker.html| is added.
539
540 * ContentChecker.pm: A pod "LICENSE" section is added.
541
542 * NanoDOM.pm ($VERSION): New variable.
543
544 2007-08-05 Wakaba <wakaba@suika.fam.cx>
545
546 * H2H.pm: |b|, |i|, and |sub| are added to the
547 list of allowed HTML elements.
548
549 2007-08-05 Wakaba <wakaba@suika.fam.cx>
550
551 * H2H.pm: |samp| is added to the list of allowed
552 HTML elements.
553
554 * URIChecker.pm (check_iri): New.
555 (check_iri_reference): Error type for IRI reference
556 syntax error is changed.
557
558 2007-08-04 Wakaba <wakaba@suika.fam.cx>
559
560 * ContentChecker.pm: Reference to the |Whatpm::ContentChecker::Atom|
561 is added.
562 (check_document): Load appropriate module before validation.
563
564 2007-08-04 Wakaba <wakaba@suika.fam.cx>
565
566 * ContentChecker/: New directory.
567
568 2007-08-04 Wakaba <wakaba@suika.fam.cx>
569
570 * ContentChecker.pm: HTML |time| element is implemented.
571
572 * HTMLTable.pm: Comments are updated as HTML5 is revised.
573
574 2007-07-21 Wakaba <wakaba@suika.fam.cx>
575
576 * ContentChecker.pm (check_document): Return value
577 even if no document element is found.
578
579 2007-07-21 Wakaba <wakaba@suika.fam.cx>
580
581 * HTML.pm.src: |$in_body| is no longer a function.
582
583 2007-07-21 Wakaba <wakaba@suika.fam.cx>
584
585 * HTML.pm.src: The |$in_body| code has been moved down.
586
587 2007-07-21 Wakaba <wakaba@suika.fam.cx>
588
589 * HTML.pm.src: The "trailing end" insertion mode
590 is split into "after html body" and "after html frameset"
591 insertion modes. Their codes are merged with "after body"
592 and "after frameset" codes. |$previous_insertion_mode|
593 has been removed. "after frameset" code is
594 merged with "in frameset" code.
595
596 2007-07-21 Wakaba <wakaba@suika.fam.cx>
597
598 * HTML.pm.src: The "before head" insertion mode is
599 merged with the "in head" insertion mode.
600
601 2007-07-21 Wakaba <wakaba@suika.fam.cx>
602
603 * HTML.pm.src: Cases in "in head" insertion mode are
604 reorganized.
605
606 2007-07-21 Wakaba <wakaba@suika.fam.cx>
607
608 * HTML.pm.src: Some cases in "in table" insertion mode
609 are merged.
610
611 2007-07-21 Wakaba <wakaba@suika.fam.cx>
612
613 * HTML.pm.src: The "in row" insertion mode is merged
614 with "in table" insertion mode.
615
616 2007-07-21 Wakaba <wakaba@suika.fam.cx>
617
618 * HTML.pm.src: The "in table" and "in table body" insertion
619 modes are merged.
620
621 2007-07-21 Wakaba <wakaba@suika.fam.cx>
622
623 * HTML.pm.src: There is no "in table head" or "in table foot"
624 insertion mode!
625
626 2007-07-21 Wakaba <wakaba@suika.fam.cx>
627
628 * HTML.pm.src: |<noframes>| "in frameset" and "in noframes"
629 now directly invoke the handler.
630
631 2007-07-21 Wakaba <wakaba@suika.fam.cx>
632
633 * HTML.pm.src: Codes for "in cell" insertion mode
634 is merged to the "in body" insertion mode code.
635
636 2007-07-21 Wakaba <wakaba@suika.fam.cx>
637
638 * HTML.pm.src: Codes for "in body" and "in caption"
639 insertion modes are merged.
640
641 2007-07-21 Wakaba <wakaba@suika.fam.cx>
642
643 * HTML.pm.src: Two |!!!next-token|s were missing.
644
645 2007-07-21 Wakaba <wakaba@suika.fam.cx>
646
647 * HTML.pm.src: Use numeric constant for |{content_mode}|
648 instead of string constant for |{content_model_flag}|.
649
650 2007-07-21 Wakaba <wakaba@suika.fam.cx>
651
652 * HTML.pm.src: Add the name of the attribute
653 to the "duplicate attribute" error.
654
655 2007-07-17 Wakaba <wakaba@suika.fam.cx>
656
657 * ContentChecker.pm: Return the |class| node list.
658
659 2007-07-17 Wakaba <wakaba@suika.fam.cx>
660
661 * ContentChecker.pm: Return the |id| node list.
662
663 * HTML.pm.src: A typo is fixed.
664
665 2007-07-16 Wakaba <wakaba@suika.fam.cx>
666
667 * ContentChecker.pm: Drop wrong |level => 'error'| specification
668 from "in HTML:xml:lang" error. Character position
669 is now the last part of the error type in the URI error
670 description. Report "unsupported" status for language
671 tags, media queries, script codes, and style sheets.
672
673 2007-07-16 Wakaba <wakaba@suika.fam.cx>
674
675 * ContentChecker.pm: Report error if |xml:lang|
676 in HTML, |lang| in XML, |xmlns| in XML, and |meta| |charset|
677 in XML.
678
679 * NanoDOM.pm (Attr.owner_document): New attribute.
680
681 2007-07-16 Wakaba <wakaba@suika.fam.cx>
682
683 * HTML.pm.src: The character immediately following
684 a bare |hcro| was discarded. Fix handling of
685 entity references in attribute values.
686
687 2007-07-16 Wakaba <wakaba@suika.fam.cx>
688
689 * HTML.pm.src (main and trailin end phases): Token
690 types |DOCTYPE|, |comment|, |end-of-file|, and |<html>| are
691 factored out. Error types |in html:#DOCTYPE|
692 and |after html:#DOCTYPE| are merged
693 into |DOCTYPE in the middle|. |</frameset>|
694 in fragment parsing mode changed the
695 insertion mode.
696
697 2007-07-16 Wakaba <wakaba@suika.fam.cx>
698
699 * HTML.pm.src: |$phase| has been removed; The |trailing end|
700 phase is now an insertion mode. Treatments for white
701 space character tokens were incorrect for some
702 insertion modes. An old |meta| case was not removed.
703
704 2007-07-16 Wakaba <wakaba@suika.fam.cx>
705
706 * HTML.pm.src: |meta| charset declaration extraction
707 implemented (but changing the encoding is not yet:-).
708
709 2007-07-15 Wakaba <wakaba@suika.fam.cx>
710
711 * Charset/: New directory.
712
713 2007-07-15 Wakaba <wakaba@suika.fam.cx>
714
715 * H2H.pm: New Perl module (created from
716 manakai's H2H.dis).
717
718 2007-07-15 Wakaba <wakaba@suika.fam.cx>
719
720 * XMLSerializer.pm: New Perl module (created from
721 manakai's SimpleLS.dis).
722
723 2007-07-07 Wakaba <wakaba@suika.fam.cx>
724
725 * HTML.pm.src: |<!---x-->| was not processed correctly.
726
727 2007-07-01 Wakaba <wakaba@suika.fam.cx>
728
729 * HTML.pm.src: Report correct error message
730 for |<body></div></body>|.
731
732 2007-07-01 Wakaba <wakaba@suika.fam.cx>
733
734 * HTMLTable.pm: An error description was incorrect.
735
736 2007-06-30 Wakaba <wakaba@suika.fam.cx>
737
738 * ContentChecker.pm: Return |{term}| list.
739
740 2007-06-30 Wakaba <wakaba@suika.fam.cx>
741
742 * HTML.pm.src: HTML5 revisions 961-966 (</p>, </br>,
743 nested <nobr>, implied </tbody>, </tfoot>, and </thead>,
744 and <title> outside of head).
745
746 2007-06-30 Wakaba <wakaba@suika.fam.cx>
747
748 * IMTChecker.pm: Report warning for unregistered
749 and private types/subtypes.
750
751 * ContentChecker.pm, HTML.pm.src, IMTChecker.pm,
752 URIChecker.pm, HTMLTable.pm: Error messages are now
753 consistent; they are all listed in
754 <http://suika.fam.cx/gate/2005/sw/Whatpm%20Error%20Types>.
755
756 2007-06-25 Wakaba <wakaba@suika.fam.cx>
757
758 * ContentChecker.pm: |<img ismap>| not in |<a></a>|
759 is now erred. |<datalist>| is implemented.
760 Attribute checker for |<command>| and |<menu>| are
761 added. Support for |contextmenu| global attribute
762 is added.
763
764 2007-06-25 Wakaba <wakaba@suika.fam.cx>
765
766 * HTML.pm.src (_reset_insertion_mode): Interpretation
767 of Step 3 has been changed.
768
769 2007-06-25 Wakaba <wakaba@suika.fam.cx>
770
771 * HTML.pm.src: Late |<html>| parse error is implemented.
772
773 2007-06-24 Wakaba <wakaba@suika.fam.cx>
774
775 * URIChecker.pm (check_iri_reference): A |decode| method name was
776 incorrect.
777
778 * ContentChecker.pm: Support for the |footer| element.
779 Check URI syntax for space-separated URI attributes.
780 Support for the |tabindex| attribute. Support
781 for |datetime| attribute.
782
783 2007-06-24 Wakaba <wakaba@suika.fam.cx>
784
785 * HTML.pm.src: HTML5 revision 1.144 (&#x0D;) and 1.145 (invalid
786 character references). HTML5 revision 1.146 (white space
787 characters before root start tag). HTML5 revision
788 1.148 (named character references in attribute values).
789 HTML5 revision 1.152 (<plaintext>.innerHTML get).
790
791 2007-06-24 Wakaba <wakaba@suika.fam.cx>
792
793 * HTML.pm.src: HTML5 revisions 1.142 and 1.143 (<noscript>
794 in <head>).
795
796 2007-06-24 Wakaba <wakaba@suika.fam.cx>
797
798 * HTML.pm.src: HTML5 revision 935 (<base>, <link>, <meta>
799 in body).
800
801 * ContentChecker.pm: HTML5 revision 938 (scoped="").
802
803 2007-06-24 Wakaba <wakaba@suika.fam.cx>
804
805 * HTML.pm.src: HTML5 revision 923 (matching end tag in CDATA
806 or RCDATA in fragment parsing mode). HTML5 revision
807 924 (<!--> and <!--->). HTML5 revision 926 (hn in hn).
808
809 2007-06-23 Wakaba <wakaba@suika.fam.cx>
810
811 * HTML.pm.src (get_inner_html): HTML5 revision 922 (inner_html
812 for <pre> and <textarea>).
813
814 2007-06-23 Wakaba <wakaba@suika.fam.cx>
815
816 * HTML.pm.src: HTML5 revision 920 (<isindex>).
817
818 2007-06-23 Wakaba <wakaba@suika.fam.cx>
819
820 * HTML.pm.src: HTML5 revision 918 (</head>, </body>, </html>).
821
822 2007-06-23 Wakaba <wakaba@suika.fam.cx>
823
824 * HTML.pm.src: HTML5 revision 916 (</body>).
825 HTML5 revision 917 (conforming bare &).
826
827 2007-06-23 Wakaba <wakaba@suika.fam.cx>
828
829 * NanoDOM.pm (manakai_is_html): Setting to false did not work.
830
831 * HTML.pm.src: HTML5 revision 914 (</ in CDATA, RCDATA).
832 HTML5 revision 915 (<nobr>).
833
834 2007-06-23 Wakaba <wakaba@suika.fam.cx>
835
836 * HTML.pm.src: HTML5 revisions 908, 909, 912, and 913 (quirks mode).
837
838 * NanoDOM.pm (manakai_is_html, manakai_compat_mode, compat_mode):
839 New attributes.
840
841 2007-06-23 Wakaba <wakaba@suika.fam.cx>
842
843 * HTML.pm.src: HTML5 revisions 900, 901, 902, and 911 (<
844 in tags).
845
846 2007-06-23 Wakaba <wakaba@suika.fam.cx>
847
848 * .cvsignore: |Entities.html| is added.
849
850 * HTML.pm.src: |$entity_char| is removed and
851 requires |Whatpm::_NamedEntityList| instead.
852 HTML5 revision 898 (refc), except that lack of refc
853 is parse error.
854
855 * mkentitylist.pl: New script.
856
857 * Makefile (all): |_NamedEntityList.pm| is added.
858 (_NamedEntityList.pm, Entities.html): New rules.
859
860 2007-06-23 Wakaba <wakaba@suika.fam.cx>
861
862 * HTML.pm.src: Parse errors immediately after U+000D
863 were ignored and U+000D immediately following another
864 U+000D was not converted to U+000A.
865
866 2007-06-23 Wakaba <wakaba@suika.fam.cx>
867
868 * HTML.pm.src (set_inner_html): HTML5 revision 892 (adopt
869 nodes before appended). Parser was not ready for NULL
870 parse error and escape flag.
871
872 * NanoDOM.pm (adopt_node): New.
873
874 2007-06-23 Wakaba <wakaba@suika.fam.cx>
875
876 * HTML.pm.src: HTML5 revision 886 (insane comment in
877 CDATA and RCDATA). Note that current implementation
878 is simply repeating what the spec says and it is maybe not
879 a best way to do it.
880
881 2007-06-23 Wakaba <wakaba@suika.fam.cx>
882
883 * HTML.pm.src: HTML5 revision 884 (</form> don't close
884 the form element if a descendant element without implied end tag has
885 still been open).
886
887 2007-06-23 Wakaba <wakaba@suika.fam.cx>
888
889 * ContentChecker.pm: HTML5 revision 881 (Make |id| attribute with
890 space characters non-conforming).
891
892 2007-06-23 Wakaba <wakaba@suika.fam.cx>
893
894 * HTML.pm.src: An error message was incorrect.
895 HTML5 revision 869 (C1 character references).
896
897 2007-06-23 Wakaba <wakaba@suika.fam.cx>
898
899 * HTML.pm.src: HTML5 revision 867 (a LF at the beginning of
900 a |textarea| is removed).
901
902 2007-06-05 Wakaba <wakaba@suika.fam.cx>
903
904 * NanoDOM.pm (get_attribute_node_ns): New method.
905
906 * ContentChecker.pm: |script| |async| and |defer|
907 no longer require |src|. |async| MUST NOT be
908 specified if |defer|. (HTML5 revision 858).
909
910 2007-05-30 Wakaba <wakaba@suika.fam.cx>
911
912 * HTML.pm.src: |<form><form>| went to inifinite loop.
913
914 2007-05-27 Wakaba <wakaba@suika.fam.cx>
915
916 * ContentChecker.pm (html): Set |is_root| (allowed
917 as a document element) flag on.
918 (new): Removed.
919 (check_document): New method.
920
921 2007-05-27 Wakaba <wakaba@suika.fam.cx>
922
923 * ContentChecker.pm (thead, tfoot): Checker specifications
924 were incorrect.
925
926 2007-05-27 Wakaba <wakaba@suika.fam.cx>
927
928 * ContentChecker.pm ($HTMLURIAttrChecker): Include
929 error position in the |type| option of the error.
930
931 * HTMLTable.pm (form_table): The |$onerror| parameter
932 is now optional. Some bugs are fixed.
933
934 2007-05-27 Wakaba <wakaba@suika.fam.cx>
935
936 * HTMLTable.pm: New module.
937
938 * ContentChecker.pm (table): Invoke table model error checker.
939
940 * NanoDOM.pm (first_child, get_attribute_ns): New.
941
942 2007-05-26 Wakaba <wakaba@suika.fam.cx>
943
944 * ContentChecker.pm ($HTMLLinkTypesAttrChecker): New checker.
945 (link/@rel, a/@rel, area/@rel): Use new checker.
946
947 * Makefile (_LinkTypeList.pm, RelExtensions.html): New rules.
948
949 * _LinkTypeList.pm: New file.
950
951 * mklinktypelist.pl: New file.
952
953 * .cvsignore: |RelExtensions.html| added.
954
955 * NanoDOM.pm (child_nodes): Returns an empty array
956 for non-child-containing node types.
957 (text_content): New attribute.
958
959 2007-05-26 Wakaba <wakaba@suika.fam.cx>
960
961 * IMTChecker.pm: New module.
962
963 * ContentChecker.pm ($HTMLIMTAttrChecker): Call IMTChecker
964 to test parameter value validity.
965
966 * HTML.pm.src ($style_start_tag): Attributes were
967 discarded.
968
969 2007-05-25 Wakaba <wakaba@suika.fam.cx>
970
971 * ContentChecker.pm ($HTMLURIAttrChecker): Implemented.
972
973 2007-05-25 Wakaba <wakaba@suika.fam.cx>
974
975 * URIChecker.pm: All recommendations from RFC 3986
976 and RFC 3987 are listed (not all testable items are checked yet).
977
978 2007-05-25 Wakaba <wakaba@suika.fam.cx>
979
980 * URIChecker.pm: New module.
981
982 2007-05-20 Wakaba <wakaba@suika.fam.cx>
983
984 * ContentChecker.pm: Now most attributes are implemented
985 or associated to some placeholder.
986 ($ElementDefault): Warn unknown attributes
987 for unknown elements as "attribute not supported".
988 ($HTMLLanguageTagAttrChecker, $HTMLMQAttrChecker): New placeholders.
989 ($HTMLUsemapAttrChecker, $HTMLTargetAttrChecker): New checkers.
990 (|a| attribute checker): Reimplemented.
991
992 2007-05-20 Wakaba <wakaba@suika.fam.cx>
993
994 * ContentChecker.pm ($HTMLEventHandlerAttrChecker): New placeholder.
995 ($HTMLAttrChecker): Event handler content attributes
996 are added.
997 (link, embed): Required attribute is now checked.
998 (embed): Unknown local attributes are no longer warned.
999
1000 2007-05-20 Wakaba <wakaba@suika.fam.cx>
1001
1002 * ContentChecker.pm ($HTMLSpaceURIsAttrChecker): New placeholder.
1003 ($HTMLIMTAttrChecker): New checker.
1004 (link@rel, link@href, link@type, style@type,
1005 a@href, a@ping, a@ping, a@type, embed@src, embed@type,
1006 object@data, object@type, source@src, source@type, area@alt,
1007 area@shape, area@coords, area@href,
1008 area@ping, area@rel, area@type, script@src,
1009 script@defer, script@async, script@type): Checkers added.
1010
1011 2007-05-20 Wakaba <wakaba@suika.fam.cx>
1012
1013 * ContentChecker.pm: Descendant checking was incorrect.
1014
1015 2007-05-19 Wakaba <wakaba@suika.fam.cx>
1016
1017 * ContentChecker.pm: Support |xml:*| and |xmlns:*|
1018 attributes. Report an error if |Element.prefix|
1019 is |xmlns|.
1020
1021 * NanoDOM.pm (prefix): New attribute.
1022
1023 2007-05-19 Wakaba <wakaba@suika.fam.cx>
1024
1025 * HTML.pm.src: In |main| phase, |in body| insertion
1026 mode, action for |<iframe>| was missing.
1027
1028 2007-05-19 Wakaba <wakaba@suika.fam.cx>
1029
1030 * ContentChecker.pm: Support for many of HTML5 elements.
1031 ($GetHTMLNonNegativeIntegerAttrChecker): New.
1032
1033 2007-05-19 Wakaba <wakaba@suika.fam.cx>
1034
1035 * ContentChecker.pm: Support for most elements up to |progress|.
1036 ($HTMLURIAttrChecker): Placeholder.
1037 ($HTMLIntegerAttrChecker, $GetHTMLFloatingPointNumberAttrChecker):
1038 New.
1039
1040 2007-05-19 Wakaba <wakaba@suika.fam.cx>
1041
1042 * ContentChecker.pm: Attribute checkers
1043 for global attributes, |html|, |base|, |style|, and |meta|.
1044
1045 * NanoDOM.pm (insert_before): Weaken reference
1046 to the parent node.
1047 (Attr::new): Set |owner_element| attribute.
1048 (namespace_uri, manakai_local_name): New attribute implementations.
1049 (owner_element): New attribute.
1050
1051 2007-05-19 Wakaba <wakaba@suika.fam.cx>
1052
1053 * ContentChecker.pm ($AttrChecker, $HTMLAttrChecker,
1054 $AnyChecker->{attr_checker}, $HTMLAttrsChecker,
1055 $Element->{$HTML_NS}->{''}): New.
1056 (check_element): Invoke attrs_checker for each element.
1057
1058 2007-05-13 Wakaba <wakaba@suika.fam.cx>
1059
1060 * ContentChecker.pm: Don't use |manakai_element_type_match|.
1061
1062 2007-05-13 Wakaba <wakaba@suika.fam.cx>
1063
1064 * ContentChecker.pm: Use hashs rather than lists for
1065 element type testings.
1066
1067 2007-05-13 Wakaba <wakaba@suika.fam.cx>
1068
1069 * ContentChecker.pm: Don't generate duplicate
1070 error when an element type is put in the "minus" list
1071 and the element type is not allowed explicitly in the particular
1072 element content model.
1073 (html:a checker): New checker.
1074 (html:details, html:datagrid): New checkers.
1075 (html:legend): New checker.
1076
1077 2007-05-13 Wakaba <wakaba@suika.fam.cx>
1078
1079 * ContentChecker.pm (html:li checker): Implemented.
1080
1081 2007-05-13 Wakaba <wakaba@suika.fam.cx>
1082
1083 * ContentChecker.pm ($HTMLInlineOrStriclyInlineChecker): New
1084 checker.
1085 (html:dd checker): New checker.
1086 (html:q, html:em, html:strong, html:small,
1087 html:m, html:dfn, html:code, html:samp, html:span): New checkers.
1088
1089 2007-05-13 Wakaba <wakaba@suika.fam.cx>
1090
1091 * ContentChecker.pm ($AnyChecker): Renamed
1092 from |$ElementDefault->{checker}|.
1093 ($ElementDefault->{checker}): Throw an error that
1094 the element type is not supported by the checker.
1095 ($HTMLMetadataElement): |html:base| was missing.
1096 ($HTMLEmptyChecker): Don't throw an error
1097 for inter-element whitespace nodes.
1098 (html:html checker): Errors were not
1099 thrown even if |html:head| and/or |html:body|
1100 children were missing.
1101 (html:head checker): An error was not
1102 thrown if <meta charset> appered after other
1103 elements.
1104
1105 2007-05-05 Wakaba <wakaba@suika.fam.cx>
1106
1107 * ContentChecker.pm: |footer|, |video|, |audio|, |script|,
1108 and |noscript| elements are implemented.
1109 (new): New method.
1110
1111 2007-05-04 Wakaba <wakaba@suika.fam.cx>
1112
1113 * ContentChecker.pm: New module.
1114
1115 2007-05-04 Wakaba <wakaba@suika.fam.cx>
1116
1117 * NanoDOM.pm (manakai_parent_element,
1118 document_element, manakai_local_name,
1119 manakai_element_type_match): New method.
1120
1121 2007-05-03 Wakaba <wakaba@suika.fam.cx>
1122
1123 * HTML.pm.src: Replace decimal and hexadecimal numeric
1124 entities in C1 range using Windows-1252 mapping. Bare LF
1125 did not count as new line for error reporting.
1126
1127 2007-05-02 Wakaba <wakaba@suika.fam.cx>
1128
1129 * NanoDOM.pm (DOMImplementation): New class.
1130 (append_child): Weaken the |parent_node| reference.
1131 (create_element_ns, Element new): Set the |owner_document|
1132 reference.
1133 (implementation): New attribute.
1134 (owner_document, local_name, namespace_uri): New attributes.
1135
1136 * HTML.pm.src (parse_string): Line and column numbers
1137 are now provided to error handler.
1138 (!!!parse-error): Short descriptions are added.
1139 (_construct_tree): Split into three methods; support
1140 for innerHTML mode.
1141 (set_inner_html): New method.
1142
1143 2007-05-01 Wakaba <wakaba@suika.fam.cx>
1144
1145 * NanoDOM.html: Documentation is added.
1146
1147 * HTML.pod, ContentType.html: Documentation is revised.
1148
1149 * .cvsignore: Pod2html temporary files are added.
1150
1151 * Makefile: Make |NanoDOM.html|.
1152
1153 2007-05-01 Wakaba <wakaba@suika.fam.cx>
1154
1155 * HTML.pm.src (parse_string): New method.
1156 (get_inner_html): Renamed from |inner_html|.
1157
1158 * Makefile: A rule for |HTML.html| is added.
1159
1160 * HTML.pod: New documentation.
1161
1162 2007-05-01 Wakaba <wakaba@suika.fam.cx>
1163
1164 * NanoDOM.pm (last_child, previous_sibling): New attributes.
1165 (clone_node): Attribute nodes were not completely copied.
1166
1167 * HTML.pm.src: Many bugs are fixed.
1168
1169 2007-04-30 Wakaba <wakaba@suika.fam.cx>
1170
1171 * HTML.pm.src: Some typos are fixed.
1172
1173 2007-04-30 Wakaba <wakaba@suika.fam.cx>
1174
1175 * mkhtmlparser.pl, Makefile: References to the |HTML-consume-entity.src|
1176 are removed.
1177
1178 * HTML.pm.src: Tokenizer's handling on named entities are rewritten.
1179
1180 * HTML-consume-entity.src: Removed.
1181
1182 2007-04-30 Wakaba <wakaba@suika.fam.cx>
1183
1184 * HTML.pm.src: Tokenizer's handling on hexadecimal
1185 numeric entities are rewritten.
1186
1187 2007-04-30 Wakaba <wakaba@suika.fam.cx>
1188
1189 * HTML.pm.src: Some tokenizer bugs are fixed.
1190
1191 2007-04-30 Wakaba <wakaba@suika.fam.cx>
1192
1193 * HTML.pm.src: The tree construction stage is implemented.
1194
1195 * mkhtmlparser.pl: New macros are added.
1196
1197 2007-04-28 Wakaba <wakaba@suika.fam.cx>
1198
1199 * ContentType.pm: A note on bug in the specification
1200 is removed since it's been now fixed.
1201
1202 * .cvsignore: New file.
1203
1204 2007-04-28 Wakaba <wakaba@suika.fam.cx>
1205
1206 * HTML.pm.src, HTML-consume-entity.src: New files.
1207
1208 * Makefile (HTML.pm): New rule.
1209
1210 * mkhtmlparser.pl: New script.
1211
1212 2007-04-25 Wakaba <wakaba@suika.fam.cx>
1213
1214 * Makefile: New file.
1215
1216 2007-04-24 Wakaba <wakaba@suika.fam.cx>
1217
1218 * ContentType.pm: An error in pod is fixed.
1219
1220 2007-04-24 Wakaba <wakaba@suika.fam.cx>
1221
1222 * ContentType.pm: Documentation is added.
1223
1224 2007-04-24 Wakaba <wakaba@suika.fam.cx>
1225
1226 * ContentType.pm: New Perl module.
1227
1228 * ChangeLog: New file.
1229
1230

admin@suikawiki.org
ViewVC Help
Powered by ViewVC 1.1.24