/[suikacvs]/markup/html/whatpm/Whatpm/ChangeLog
Suika

Contents of /markup/html/whatpm/Whatpm/ChangeLog

Parent Directory Parent Directory | Revision Log Revision Log


Revision 1.288 - (show annotations) (download)
Fri Sep 5 17:57:47 2008 UTC (16 years, 10 months ago) by wakaba
Branch: MAIN
Changes since 1.287: +4 -0 lines
++ whatpm/t/ChangeLog	5 Sep 2008 17:40:06 -0000
	* content-model-1.dat: Test data for |XSLT-compat|
	added (HTML5 revision 2141).  A redundant test
	entry is removed.

	* HTML-tree.t: Support for should-level errors.

2008-09-06  Wakaba  <wakaba@suika.fam.cx>

++ whatpm/Whatpm/ChangeLog	5 Sep 2008 17:28:08 -0000
2008-09-06  Wakaba  <wakaba@suika.fam.cx>

	* HTML.pm.src: Support for |XSLT-compat| (HTML5 revision 2141).

1 2008-09-06 Wakaba <wakaba@suika.fam.cx>
2
3 * HTML.pm.src: Support for |XSLT-compat| (HTML5 revision 2141).
4
5 2008-08-31 Wakaba <wakaba@suika.fam.cx>
6
7 * CacheManifest.pm: Support for extensibility (HTML5 revision 2051).
8
9 2008-08-31 Wakaba <wakaba@suika.fam.cx>
10
11 * HTML.pm.src: Bug fix and sync with the spec with regard
12 to after after frameset insertion mode processing (HTML5
13 revision 1909). Note that the implementation was wrong
14 per the old spec before the r1909 changes.
15
16 2008-08-30 Wakaba <wakaba@suika.fam.cx>
17
18 * HTMLTable.pm: scope=auto algorithm fix synced with the
19 spec (HTML5 revision 2093).
20 ($process_row): Algorithm step numbers synced with the
21 spec (HTML5 revision 2092).
22
23 2008-08-30 Wakaba <wakaba@suika.fam.cx>
24
25 * HTMLTable.pm: Zs is not what we want; we want White_Space! (HTML5
26 revision 2094).
27
28 2008-08-30 Wakaba <wakaba@suika.fam.cx>
29
30 * ContentType.pm: Support for image/svg+xml (HTML5 revision 2096).
31
32 2008-08-30 Wakaba <wakaba@suika.fam.cx>
33
34 * HTML.pm.src: '"' and "'" at the end of attribute
35 name (after another attribute) now raise parse error (HTML5
36 revision 2123). Empty unquoted attribute values are no
37 longer allowed (HTML5 revision 2122).
38
39 2008-08-30 Wakaba <wakaba@suika.fam.cx>
40
41 * mkhtmlparser.pl: Support for MathML |definitionURL| attribute (HTML5
42 revision 2130).
43
44 2008-08-30 Wakaba <wakaba@suika.fam.cx>
45
46 * ContentChecker.pm: |xml:lang| attribute value must be same
47 as |lang| attribute value for HTML elements (HTML5 revision 2062
48 and so on).
49
50 2008-08-30 Wakaba <wakaba@suika.fam.cx>
51
52 * ContentChecker.pm: Error level definition for |xml_id_error|
53 was missing.
54
55 * URIChecker.pm: The end of the URL should be marked as the
56 error location for an empty path error. The position
57 between the userinfo and the port components should be
58 marked as the error location for an empty host error.
59
60 2008-08-30 Wakaba <wakaba@suika.fam.cx>
61
62 * URIChecker.pm: Set parameters representing where in the
63 value the error occurs for errors. Report unknown
64 address format error in warning level, since address
65 formats are rarely added. Path segments starting with "/.."
66 were misinterpreted as a dot-segment.
67
68 2008-08-30 Wakaba <wakaba@suika.fam.cx>
69
70 * URIChecker.pm (check_iri_reference): Requires
71 |Message::DOM::DOMImplementation|.
72
73 2008-08-29 Wakaba <wakaba@suika.fam.cx>
74
75 * IMTChecker.pm: Updated for the new error reporting architecture.
76
77 * ContentChecker.pm: Error levels for IMTs are added.
78
79 2008-08-17 Wakaba <wakaba@suika.fam.cx>
80
81 * H2H.pm (_shift_token): Support for unquoted HTML attribute
82 values.
83
84 2008-08-16 Wakaba <wakaba@suika.fam.cx>
85
86 * CacheManifest.pm: Support for new style of error
87 reports.
88
89 * HTML.pm.src: Set line=1, column=1 to the document node.
90
91 2008-08-16 Wakaba <wakaba@suika.fam.cx>
92
93 * ContentChecker.pm, RDFXML.pm: Pass {level} object to language tag
94 and URL checkers. Support for more error levels for bogus
95 langauge tag and URL "standards".
96
97 * LangTag.pm, URIChecker.pm: Support for new style error
98 level reporting.
99
100 2008-08-15 Wakaba <wakaba@suika.fam.cx>
101
102 * ContentChecker.pm: Support for RDF/XML error levels.
103
104 * HTMLTable.pm, RDFXML.pm: Support for new style of error level
105 specifying. Error types are revised.
106
107 2008-08-15 Wakaba <wakaba@suika.fam.cx>
108
109 * ContentChecker.pm: All error reporting method calls are
110 renewed.
111
112 2008-08-15 Wakaba <wakaba@suika.fam.cx>
113
114 * HTML.pm.src: All error type names and "text" parameters
115 are revised. Use new style for "level" specification.
116
117 * mkhtmlparser.pl: Use new style for "level" specification.
118
119 2008-08-03 Wakaba <wakaba@suika.fam.cx>
120
121 * WebIDL.pm (parse_char_string): Simplified error
122 reporting process for broken ignored valuetype definition.
123 (Valuetype idl_text): Support for special "DOMString" name.
124
125 2008-08-03 Wakaba <wakaba@suika.fam.cx>
126
127 * WebIDL.pm ($get_scoped_name): Append "::::" if the last
128 terminal of the ScopedName is "DOMString", such that whether
129 the last part of the scoped name is "DOMString" or "_DOMString"
130 later. It is necessary to determine whether a |typedef|
131 definition should be ignored or not.
132 (parse_char_string): Unescape the identifier of
133 exception members.
134 ($resolve): Return undef for builtin types and sequence<T>
135 types (we might not have to do this, however...).
136 (check): Support checking for Exceptions, Valuetypes,
137 and Typedefs.
138 ($serialize_type): Support for "DOMString::::" syntax.
139 (Typedef idl_text): Output Type as "DOMString" if it
140 is really "DOMString" (i.e. its internal representation
141 is "::DOMString::").
142
143 2008-08-03 Wakaba <wakaba@suika.fam.cx>
144
145 * WebIDL.pm ($resolve): New code, based on resolve code
146 for constant types in the |check| method.
147 (check): Support for checking of attributes, operations, and
148 arguments.
149 (Attribute/Operation idl_text): Exception names in getraises,
150 setraises, and raises clauses is serizlied by |$serialize_type|
151 code.
152
153 2008-08-02 Wakaba <wakaba@suika.fam.cx>
154
155 * WebIDL.pm ($integer): Order of selections are changed to match
156 hexadecimal numbers (the original pattern, taken from the spec,
157 was not work for hexadecimal numbers, because the "0" prefix
158 matches to the [0-7]* part (as an empty string) and therefore
159 it does not match with remaining "x..." part of a "0x..." integer
160 literal.
161 ($get_type): It now returns a string, not an array reference,
162 for regular types and |sequence| types (i.e. it in any case
163 returns a string).
164 ($get_next_token): The second item in the array that represents
165 a integer or float token is now a Perl number value, not the
166 original string representation of the number.
167 (check): Support for const value consistency checking.
168 No extended attribute is defined for constants.
169 (Node subclasses): Use simple strings rather than array references
170 for default data type values.
171 ($serialize_type): Type values are now simple strings.
172 (value): If the new attribute value is a false value, then
173 a FALSE value is set to the attribute.
174
175 2008-08-02 Wakaba <wakaba@suika.fam.cx>
176
177 * WebIDL.pm ($get_scoped_name): Now scoped names are stored
178 in its stringified format ("scoped name" as defined in the
179 spec). Note that future version of this module should not use
180 array references for type values and the |type_text| attribute
181 should be made obsolete.
182 (parse_char_string): Unescape attribute names.
183 (check): Support for checking of whether inherited interfaces
184 are actually defined or not. Support for checking of whether
185 interface member identifiers are duplicated or not.
186 ($serialize_type): Scoped names are returned as is. A future
187 version of this code should escape identifiers other than "DOMString",
188 otherwise the idl_text would be non-conforming.
189
190 2008-08-02 Wakaba <wakaba@suika.fam.cx>
191
192 * WebIDL.pm (parse_char_string): Set line/column numbers
193 to generated nodes. Unescape identifiers. Extended attributes
194 for Definition's were ignored.
195 (append_child): Set |parent_node| attribute.
196 (parent_node): New attribute.
197 (check): Support interface/exception members. Support
198 extended attributes. Support definition identifier uniqueness
199 constraint.
200 (qualified_name): New attribute.
201 (Interface/Exception idl_text): Extended attributes were
202 not prepended to the returned text.
203
204 2008-08-02 Wakaba <wakaba@suika.fam.cx>
205
206 * WebIDL.pm (parse_char_string): Set line/column numbers
207 to interface object experimentally. s/shift/pop/g, shift
208 would make things wrong. Support for interface forward
209 declarations was missing. Broken interface declarations
210 with no block were not ignored entirely.
211 (Whatpm::WebIDL::Node): New abstract class. This class
212 makes things easier.
213 (child_nodes): New attribute. Unlike DOM's attribute with
214 same name, this attribute returns a dead list of nodes for
215 simplicity.
216 (get_user_data, set_user_data): New methods.
217 (Module idl_text): A SPACE character should be inserted
218 before the |{| character.
219 (Interface idl_text): Support for interface forward declarations.
220 (is_forward_declaration): New attribute.
221
222 2008-07-19 Wakaba <wakaba@suika.fam.cx>
223
224 * WebIDL.pm (type_text): Better serializer.
225
226 2008-07-19 Wakaba <wakaba@suika.fam.cx>
227
228 * WebIDL.pm: Revise forward-compatible parsing so that
229 it now can handle broken extended attributes and as such.
230
231 2008-07-19 Wakaba <wakaba@suika.fam.cx>
232
233 * WebIDL.pm: Real support for extended attributes.
234 Support for extended attributes with arguments.
235
236 2008-07-19 Wakaba <wakaba@suika.fam.cx>
237
238 * WebIDL.pm: Support for |exception| syntax.
239 (Interface->idl_text): Tentative support for inheritances.
240
241 2008-07-19 Wakaba <wakaba@suika.fam.cx>
242
243 * WebIDL.pm: Hierarchical scoped name support was broken.
244 Support for raises, setraises, and getraises syntaxes.
245
246 2008-07-18 Wakaba <wakaba@suika.fam.cx>
247
248 * WebIDL.pm: Support for |idl_text| attribute, version 1 (no
249 proper support for types, extended attributes, and exceptions yet).
250 WebIDL parser, version 1 (no support for exceptions yet,
251 no proper support for extended attributes yet).
252
253 2008-07-09 Wakaba <wakaba@suika.fam.cx>
254
255 * WebIDL.pm (parse_char_string): Support for basic attribute syntax.
256
257 2008-06-29 Wakaba <wakaba@suika.fam.cx>
258
259 * WebIDL.pm: Support for valuetype and const.
260
261 2008-06-29 Wakaba <wakaba@suika.fam.cx>
262
263 * WebIDL.pm: New module.
264
265 2008-06-15 Wakaba <wakaba@suika.fam.cx>
266
267 * Makefile (Entities.html): URI changed.
268
269 2008-06-08 Wakaba <wakaba@suika.fam.cx>
270
271 * HTML.pm.src: Support for ruby parsing (HTML5 revision 1704).
272
273 2008-06-01 Wakaba <wakaba@suika.fam.cx>
274
275 * HTML.pm.src (_get_next_token): A parse error was missing.
276
277 2008-06-01 Wakaba <wakaba@suika.fam.cx>
278
279 * mklinktypelist.pl: rel=contact is no longer part of the HTML5
280 spec (commented out). (HTML5 revision 1711).
281
282 2008-05-25 Wakaba <wakaba@suika.fam.cx>
283
284 * ContentType.pm: Drop support for UTF-32 (HTML5 revision 1701).
285
286 * HTML.pm.src: UTF-16BE and UTF-16LE should be considered
287 as UTF-16 (HTML5 revision 1701).
288
289 2008-05-25 Wakaba <wakaba@suika.fam.cx>
290
291 * HTML.pm.src: Support for <noframes> in <head> (HTML5 revision
292 1692).
293
294 2008-05-25 Wakaba <wakaba@suika.fam.cx>
295
296 * HTML.pm.src: The secondary insertion mode used when switching
297 to foreign content is the "in body" insertion mode (HTML5 revision
298 1696).
299
300 2008-05-25 Wakaba <wakaba@suika.fam.cx>
301
302 * HTML.pm.src: Don't raise parse error for <isindex/> (HTML5
303 revision 1697).
304
305 2008-05-24 Wakaba <wakaba@suika.fam.cx>
306
307 * HTML.pm.src: Support for end-of-file token in foreign content
308 insertion mode (HTML5 revision 1693). Update SVG camelCase
309 attribute list (HTML5 revision 1700). <textarea> closes
310 </select> (HTML5 revision 1699). More start tags close in
311 foreign content insertion mode (HTML5 revision 1698).
312
313 2008-05-24 Wakaba <wakaba@suika.fam.cx>
314
315 * HTML.pm.src: ";" is not part of charset name (HTML5 revision 1665).
316
317 2008-05-24 Wakaba <wakaba@suika.fam.cx>
318
319 * HTML.pm.src: More robust charset parameter detection (HTML5
320 revision 1674).
321
322 2008-05-24 Wakaba <wakaba@suika.fam.cx>
323
324 * ContentType.pm: Support for image/vnd.microsoft.icon (HTML5
325 revision 1676).
326
327 2008-05-24 Wakaba <wakaba@suika.fam.cx>
328
329 * HTML.pm.src: Ignore language part of public identifiers for
330 quriks mode detection (HTML5 revision 1679).
331
332 2008-05-24 Wakaba <wakaba@suika.fam.cx>
333
334 * HTML.pm.src: Reduce the number of errors in truncated doctypes (HTML5
335 revision 1685).
336
337 2008-05-24 Wakaba <wakaba@suika.fam.cx>
338
339 * HTML.pm.src: Support for EOF in new states for tags (HTML5
340 revision 1684).
341
342 2008-05-24 Wakaba <wakaba@suika.fam.cx>
343
344 * HTML.pm.src (_reset_insertion_mode): Make <td>.innerHTML
345 work (HTML5 revision 1690).
346
347 2008-05-24 Wakaba <wakaba@suika.fam.cx>
348
349 * HTML.pm.src (_tree_construction_main): Change handling of
350 end tags in head insertion modes (HTML5 revision 1686).
351 (parse_char_string): Bug fix for non-utf8 character string handlings.
352 (parse_char_stream): |ungetc| does not work well for this context.
353
354 2008-05-18 Wakaba <wakaba@suika.fam.cx>
355
356 * HTML.pm.src (parse_byte_string): Redefined to invoke
357 |parse_byte_stream|.
358 (parse_byte_stream): New method.
359
360 2008-05-18 Wakaba <wakaba@suika.fam.cx>
361
362 * HTML.pm.src (parse_byte_string): Fix the column number reported
363 by encoding layer error reporter.
364
365 2008-05-17 Wakaba <wakaba@suika.fam.cx>
366
367 * HTML.pm.src (parse_byte_string): Use streaming decoder
368 rather than converting the whole byte string and then parsing.
369 Propagate errors in character encoding layer.
370 (get_next_token): Precise error reporting for |bare stago| error.
371
372 2008-05-17 Wakaba <wakaba@suika.fam.cx>
373
374 * HTML.pm.src (parse_char_stream): New method.
375 (parse_char_string): This method is now defined as an invocation
376 of the |parse_char_stream| method.
377
378 2008-05-17 Wakaba <wakaba@suika.fam.cx>
379
380 * HTML.pm.src (parse_byte_string): Report various status
381 of the sniffing as info-level errors. Support for new
382 decoding framework in parser resestting.
383 (new): Various default error levels were not set.
384
385 2008-05-17 Wakaba <wakaba@suika.fam.cx>
386
387 * HTML.pm.src (parse_byte_string): HTML5 encoding siniffing
388 algorithm, except for the actual sniffing, is implemented
389 with new framework with Message::Charset::Info.
390
391 2008-05-16 Wakaba <wakaba@suika.fam.cx>
392
393 * CacheManifest.pm (_parse): Drop fragment identifiers from
394 URIs in fallback section (HTML5 revision 1596).
395
396 2008-05-10 Wakaba <wakaba@suika.fam.cx>
397
398 * Makefile (Entities.html): URI has changed.
399
400 2008-05-10 Wakaba <wakaba@suika.fam.cx>
401
402 * CacheManifest.pm: Don't replace U+0000 NULL (HTML5 revision
403 1553).
404
405 2008-05-06 Wakaba <wakaba@suika.fam.cx>
406
407 * ContentChecker.pm: Noted that those returned in |table| are
408 no longer table elements, but table objects returned
409 by Whatpm::HTMLTable.
410
411 * HTMLTable.pm (form_table): Return table element node
412 as |$table->{element}|.
413 (assign_header): Support for the |headers=""| attribute.
414
415 2008-05-06 Wakaba <wakaba@suika.fam.cx>
416
417 * HTMLTable.pm (assign_header): New function; first version
418 with no support for headers="".
419 (form_table): Include table width and height to the returned
420 table object for covenience. Indexing in column assignement was wrong.
421 Set whether a data cell is empty or not for convenience.
422
423 2008-05-05 Wakaba <wakaba@suika.fam.cx>
424
425 * HTMLTable.pm: Robuster caption support (HTML5 revision 1393).
426
427 2008-05-05 Wakaba <wakaba@suika.fam.cx>
428
429 * HTMLTable.pm: How table model errors are detected is
430 changed (HTML5 revision 1387).
431
432 2008-05-05 Wakaba <wakaba@suika.fam.cx>
433
434 * HTMLTable.pm: The algorithm now moves |tfoot| elements
435 to the end of the table (HTML5 revision 1380).
436
437 2008-05-05 Wakaba <wakaba@suika.fam.cx>
438
439 * HTMLTable.pm: The algorithm is now 0-based indexing, instead
440 of 1-based (HTML5 revision 1376).
441
442 2008-05-05 Wakaba <wakaba@suika.fam.cx>
443
444 * ContentType.pm: "Content-Type: text/plain; charset=UTF-8"
445 and "Content-Encoding" no longer prevent sniffing (HTML5
446 revision 1288).
447
448 2008-05-05 Wakaba <wakaba@suika.fam.cx>
449
450 * ContentType.pm: Skip BOMs n feed or HTML algorithm (HTML5 revision
451 1282).
452
453 2008-05-03 Wakaba <wakaba@suika.fam.cx>
454
455 * ContentChecker.pm: Support for global attributes.
456 Status of XML specs are added.
457
458 2008-05-03 Wakaba <wakaba@suika.fam.cx>
459
460 * ContentChecker.pm (check_element): Support for |template=""|
461 and |ref=""| attribute (referent element type checking).
462
463 2008-04-29 Wakaba <wakaba@suika.fam.cx>
464
465 * CacheManifest.pm (_parse): New same origin definition (HTML5 revision
466 1500) is implemented (except for IDNA part and URI-scheme-specific
467 knowledge). Line number counting was wrong for LF-only
468 documents.
469
470 2008-04-13 Wakaba <wakaba@suika.fam.cx>
471
472 * HTML.pm.src: Raise an parse error for any disallowed
473 character (HTML5 revision 1263).
474
475 2008-04-13 Wakaba <wakaba@suika.fam.cx>
476
477 * mkentitylist.pl: Support for new HTML5 entity table format (the
478 definition for |AElig;| was missing).
479
480 2008-04-13 Wakaba <wakaba@suika.fam.cx>
481
482 * HTML.pm.src, mkhtmlparser.pl: Support for element/attribute
483 name/namespace fixup (HTML5 revisions 1413, 1415, 1416, and 1417).
484
485 2008-04-13 Wakaba <wakaba@suika.fam.cx>
486
487 * HTML.pm.src: List of element names that close foreign content
488 insertion mode is added (HTML5 revisions 1412 and 1418).
489
490 2008-04-13 Wakaba <wakaba@suika.fam.cx>
491
492 * HTML.pm.src: Support for |mglyph| and |malignmark|
493 elements (HTML5 revision 1410).
494
495 2008-04-13 Wakaba <wakaba@suika.fam.cx>
496
497 * HTML.pm.src: Support for new long MathML entities (HTML5
498 revision 1406).
499
500 2008-04-13 Wakaba <wakaba@suika.fam.cx>
501
502 * HTML.pm.src: CDATA section support for MathML and SVG
503 elements (HTML5 revision 1404 and 1420).
504
505 2008-04-12 Wakaba <wakaba@suika.fam.cx>
506
507 * HTML.pm.src, mkhtmlparser.pl: Support for MathML and SVG
508 elements (HTML5 revision 1404). Unused !!!macro definitions
509 are removed.
510
511 2008-04-12 Wakaba <wakaba@suika.fam.cx>
512
513 * HTML.pm.src, mkhtmlparser.pl: The way permitted slash errors
514 are raised is changed (HTML5 revision 1404).
515
516 2008-04-06 Wakaba <wakaba@suika.fam.cx>
517
518 * HTML.pm.src: Category sets that are no longer used are removed.
519
520 2008-04-06 Wakaba <wakaba@suika.fam.cx>
521
522 * HTML.pm.src: The ->[1] property of stack entries are now
523 replaced by constants representing element category.
524
525 2008-04-06 Wakaba <wakaba@suika.fam.cx>
526
527 * HTML.pm.src: Don't use local name stored in stack (i.e. ->[1])
528 for error reporting. (This is a preparation for using constant
529 value for ->[1].)
530
531 2008-03-22 Wakaba <wakaba@suika.fam.cx>
532
533 * RDFXML.pm: Typo fixed.
534
535 2008-03-22 Wakaba <wakaba@suika.fam.cx>
536
537 * ContentChecker.pm: |fact_level| is now treated
538 as same as |must_level|, i.e. level = |m|.
539 (check_element): Make list of URIs in the DOM.
540
541 2008-03-21 Wakaba <wakaba@suika.fam.cx>
542
543 * RDFXML.pm: Language accessor implemented.
544 Local (null-namespace) attribute support.
545
546 2008-03-21 Wakaba <wakaba@suika.fam.cx>
547
548 * RDFXML.pm: Factored out ID checking code.
549
550 2008-03-21 Wakaba <wakaba@suika.fam.cx>
551
552 * RDFXML.pm: TODO items noted. Validation of ID
553 and URI attributes is implemented. Warn if unknown
554 value is used in rdf:parseType="" attribute.
555
556 * URIChecker.pm (check_rdf_uri_reference): New function.
557
558 2008-03-21 Wakaba <wakaba@suika.fam.cx>
559
560 * RDFXML.pm: bnodeid implemented. Relative references
561 are now resolved.
562
563 2008-03-21 Wakaba <wakaba@suika.fam.cx>
564
565 * ContentChecker.pm: RDF reification implemented.
566
567 * RDFXML.pm: undef vs false bug fixed.
568 Reification implemented.
569
570 2008-03-21 Wakaba <wakaba@suika.fam.cx>
571
572 * RDFXML.pm: s/id/ID/ for attribute name.
573 The |node| arguments are added for |ontriple| calls.
574 Too many "attribute not allowed" errors were raised.
575
576 * ContentChecker.pm: Initial experimental support for rdf:RDF
577 element.
578
579 2008-03-21 Wakaba <wakaba@suika.fam.cx>
580
581 * RDFXML.pm: New module.
582
583 2008-03-20 Wakaba <wakaba@suika.fam.cx>
584
585 * HTML.pm.src (set_inner_html): Line/column number
586 code was old one yet.
587
588 2008-03-20 Wakaba <wakaba@suika.fam.cx>
589
590 * HTML.pm.src: Better line/column reporting for "duplicate attribute"
591 errors. Line/column markings for DOCTYPE, comment, and
592 character tokens are reintroduced; otherwise,
593 error location for "not HTML5" error and errors
594 for implied elements are not attached.
595
596 2008-03-20 Wakaba <wakaba@suika.fam.cx>
597
598 * HTML.pm.src: Set line/column numbers to attributes.
599
600 * NanoDOM.pm (create_attribute_ns, set_attribute_node_ns): Added.
601 (value): Setter implemented.
602
603 * mkhtmlparser.pl: Set line/column numbers to Attr nodes.
604
605 2008-03-20 Wakaba <wakaba@suika.fam.cx>
606
607 * HTML.pm.src: Unused line/column markings are removed.
608
609 2008-03-20 Wakaba <wakaba@suika.fam.cx>
610
611 * HTML.pm.src (_get_next_token): Remove |first_start_tag|
612 flag, which is no longer used.
613
614 2008-03-17 Wakaba <wakaba@suika.fam.cx>
615
616 * HTML.pm.src: Set line/column information to element nodes.
617
618 * mkhtmlparser.pl (!!!create-element, !!!insert-element,
619 and !!!insert-element-t): Set line/column information to
620 element nodes.
621
622 2008-03-17 Wakaba <wakaba@suika.fam.cx>
623
624 * HTML.pm.src (_get_next_token): The first "<" character
625 in "<?", "<>", or "</>" should be the error point.
626
627 2008-03-16 Wakaba <wakaba@suika.fam.cx>
628
629 * HTML.pm.src: Some more fixes on error position reporting.
630
631 2008-03-16 Wakaba <wakaba@suika.fam.cx>
632
633 * HTML.pm.src: Token-level precious error reporting.
634
635 2008-03-16 Wakaba <wakaba@suika.fam.cx>
636
637 * HTML.pm.src: Preparation for more precious error point
638 reporting.
639
640 2008-03-11 Wakaba <wakaba@suika.fam.cx>
641
642 * HTML.pm.src: Error type revised.
643
644 2008-03-11 Wakaba <wakaba@suika.fam.cx>
645
646 * HTML.pm.src: Similar codes are merged together, again.
647
648 2008-03-11 Wakaba <wakaba@suika.fam.cx>
649
650 * HTML.pm.src: Similar codes are merged together.
651
652 2008-03-10 Wakaba <wakaba@suika.fam.cx>
653
654 * mkhtmlparser.pl: Set "level" parameter to parse errors.
655
656 * HTML.pm.src: Code refined.
657
658 2008-03-09 Wakaba <wakaba@suika.fam.cx>
659
660 * HTML.pm.src: |</body>| treatement has been changed (HTML5
661 revision 1348). Note that I really don't know this makes
662 any difference in the black-box behavior of the parser.
663
664 2008-03-09 Wakaba <wakaba@suika.fam.cx>
665
666 * HTML.pm.src: New end-of-file token implementation (HTML5
667 revision 1348).
668
669 2008-03-09 Wakaba <wakaba@suika.fam.cx>
670
671 * HTML.pm.src: |applet| support (HTML5 revision 1347).
672
673 2008-03-09 Wakaba <wakaba@suika.fam.cx>
674
675 * HTML.pm.src: Foster parenting in AAA (HTML5 revision 1343).
676
677 2008-03-09 Wakaba <wakaba@suika.fam.cx>
678
679 * HTML.pm.src: Support for |<input>| in the "in select" insertion
680 mode and support for the "in select in table" insertion mode (HTML5
681 revision 1342).
682
683 2008-03-09 Wakaba <wakaba@suika.fam.cx>
684
685 * HTML.pm.src: No foster parenting for <script> and <script>
686 in non-tainted <table>s (HTML5 revision 1336).
687
688 2008-03-09 Wakaba <wakaba@suika.fam.cx>
689
690 * HTML.pm.src: Ignore white space characters between <html>
691 and <head> (HTML5 revision 1332).
692
693 2008-03-09 Wakaba <wakaba@suika.fam.cx>
694
695 * HTML.pm.src: Treat <input type=hidden> as if it were a
696 white space (HTML5 revision 1331).
697
698 2008-03-08 Wakaba <wakaba@suika.fam.cx>
699
700 * HTML.pm.src: Ignore U+000A at the beginning of a |listing|
701 element (HTML5 revision 1330).
702
703 2008-03-08 Wakaba <wakaba@suika.fam.cx>
704
705 * HTML.pm.src: <title> is always appended to the current
706 element (HTML5 revision 1328).
707
708 2008-03-08 Wakaba <wakaba@suika.fam.cx>
709
710 * HTML.pm.src: White space in tainted tables are moved
711 into foster parents (HTML5 revision 1326).
712
713 2008-03-08 Wakaba <wakaba@suika.fam.cx>
714
715 * HTML.pm.src: Reduce errors from foster parenting cases (HTML5
716 revision 1321).
717
718 2008-03-08 Wakaba <wakaba@suika.fam.cx>
719
720 * HTML.pm.src: |</h/n/>| case code rearranged to align with
721 the spec (HTML5 revision 1320). Note that we finally complete
722 all of HTML5 revision 1320 changes.
723
724 2008-03-08 Wakaba <wakaba@suika.fam.cx>
725
726 * HTML.pm.src: |</form>| now works similar to |</div>| for unclosed
727 tags (HTML5 revision 1320).
728
729 2008-03-08 Wakaba <wakaba@suika.fam.cx>
730
731 * HTML.pm.src: |</p>| case rearranged with no actual change in fact.
732
733 2008-03-08 Wakaba <wakaba@suika.fam.cx>
734
735 * HTML.pm.src: A "generate implied end tags" code (t409.1)
736 could not be reached so that it is now removed (HTML5 revision
737 1320).
738
739 2008-03-08 Wakaba <wakaba@suika.fam.cx>
740
741 * HTML.pm.src: Code for the case of |</div>| and so on are revised
742 to align with new spec text (HTML5 revision 1320).
743
744 2008-03-08 Wakaba <wakaba@suika.fam.cx>
745
746 * HTML.pm.src: Remove strange |if| condition; however, it should
747 have had no harm in theory.
748
749 2008-03-08 Wakaba <wakaba@suika.fam.cx>
750
751 * HTML.pm.src (_tree_construction_main): '</p>' in body
752 case is split from other end tags for the preparation
753 of implementing HTML revision 1320.
754
755 2008-03-07 Wakaba <wakaba@suika.fam.cx>
756
757 * HTML.pm.src: Simplified "generate implied end tag" (HTML5
758 revision 1320).
759
760 2008-03-07 Wakaba <wakaba@suika.fam.cx>
761
762 * HTML.pm.src (_tree_construction_main): Merge rules for "h1"
763 and "div" (HTML5 revision 1318). Add comments to where
764 |form| pointer association codes should be inserted (HTML5 revision
765 1319).
766
767 2008-03-06 Wakaba <wakaba@suika.fam.cx>
768
769 * HTML.pm.src: <html> treatement refined (HTML5 revision 1314).
770
771 2008-03-05 Wakaba <wakaba@suika.fam.cx>
772
773 * HTML.pm.src: Since the case t268 should never be reached (no
774 other token type, there are), it is replaced by a |die| statement.
775
776 2008-03-05 Wakaba <wakaba@suika.fam.cx>
777
778 * HTML.pm.src: Typo fixed.
779
780 2008-03-04 Wakaba <wakaba@suika.fam.cx>
781
782 * HTML.pm.src (_tree_construction_initial): Some limited quirks
783 doctypes were not uppercased for comparison.
784
785 2008-03-03 Wakaba <wakaba@suika.fam.cx>
786
787 * HTML.pm.src (tree construction and set_inner_html): Checkpoints
788 are added.
789
790 2008-03-03 Wakaba <wakaba@suika.fam.cx>
791
792 * HTML.pm.src (_tokenize_attempt_to_consume_an_entity): Checkpoints
793 are set. Cases that are unlikely reached are noted as so.
794
795 2008-03-03 Wakaba <wakaba@suika.fam.cx>
796
797 * HTML.pm.src: Checkpoints for debugging are added.
798
799 * mkhtmlparser.pl: Support for |!!!cp| syntax.
800
801 2008-03-03 Wakaba <wakaba@suika.fam.cx>
802
803 * HTML.pm.src, mkhtmlparser.pl: s/_input_character/_char/g
804 for simplicity.
805
806 2008-03-03 Wakaba <wakaba@suika.fam.cx>
807
808 * HTML.pm.src: Flag name changed: s/correct/force-quirks/g (HTML5
809 revision 1307).
810
811 2008-03-03 Wakaba <wakaba@suika.fam.cx>
812
813 * HTML.pm.src (_get_next_token): Where the /incorrect/ flag
814 is set are changed (HTML5 revision 1305).
815
816 2008-03-02 Wakaba <wakaba@suika.fam.cx>
817
818 * HTML.pm.src: Raise a parse error for |<span ===>| (HTML5 revision
819 1292). Entities are not parsed in comment-like part in RCDATA
820 elements (HTML5 revision 1294). Allow bare & at the end
821 of attribute value literals (HTML5 revision 1296). More
822 quirks mode doctypes (HTML5 revision 1302). Requires spaces
823 between attributes and ban attribute names or unquoted
824 attribute values containing single or double quotes (HTML5
825 revision 1303).
826
827 2008-03-02 Wakaba <wakaba@suika.fam.cx>
828
829 * ContentChecker.pm: Typo fixed. Don't raise "character encoding"
830 and related errors unless it is an HTML document (though the spec
831 is unclear on whether it is applied to XHTML document).
832
833 * HTML.pm (%HTMLAttrStatus): WF2 repetition model attributes
834 are added.
835
836 2008-03-02 Wakaba <wakaba@suika.fam.cx>
837
838 * HTML.pm.src: s/local_name/manakai_local_name/g.
839
840 2008-03-01 Wakaba <wakaba@suika.fam.cx>
841
842 * _NamedEntityList.pm: Updated (HTML5 revision 1286).
843
844 * HTML.pm.src: |charset| in |content| attribute is
845 case-insensitive (HTML5 revision 1270).
846
847 2008-02-26 Wakaba <wakaba@suika.fam.cx>
848
849 * ContentChecker.pm: New status constants are added.
850 ($ElementDefault): |status| added.
851 (check_element): Err for non-standard or deprecated elements.
852 (_attr_status_info): For non-standard or deprecated attributes.
853
854 2008-02-24 Wakaba <wakaba@suika.fam.cx>
855
856 * ContentChecker.pm (_attr_status_info): New internal method.
857
858 2008-02-24 Wakaba <wakaba@suika.fam.cx>
859
860 * ContentChecker.pm (check_element): Element standardized
861 status information is now dispatched.
862
863 2008-02-24 Wakaba <wakaba@suika.fam.cx>
864
865 * ContentChecker.pm (check_element): Fix |del|-and-significant
866 problem by adding some more arguments.
867
868 2008-02-24 Wakaba <wakaba@suika.fam.cx>
869
870 * ContentChecker.pm (check_element): Use context of
871 container-for-the-purpose-of-content-model element (not transparent
872 element) for |check_child_element| calling and significant
873 text flag marking. This reintroduces |<del>|-and-significant
874 problem again.
875
876 2008-02-24 Wakaba <wakaba@suika.fam.cx>
877
878 * ContentChecker.pm (check_element): Make semi-transparent
879 elements ignored for the purpose of phase changes in
880 content model checking.
881
882 2008-02-23 Wakaba <wakaba@suika.fam.cx>
883
884 * ContentChecker.pm (check_element): In-element state
885 was not properly managed for transparent cases.
886
887 2008-02-23 Wakaba <wakaba@suika.fam.cx>
888
889 * ContentChecker.pm (check_element): Support for |video|
890 and |audio| as semi-transparent elements.
891
892 2008-02-23 Wakaba <wakaba@suika.fam.cx>
893
894 * ContentChecker.pm ($HTMLSemiTransparentElements): New.
895 (check_element): s/minuses/minus_elements/, s/pluses/plus_elements/.
896 Support for |html:object| as a semi-transparent element.
897
898 2008-02-23 Wakaba <wakaba@suika.fam.cx>
899
900 * ContentChecker.pm (check_element): The way to traverse
901 the tree is entirely revised to make it easier to track
902 the state of ancestors/descendants. As a result of this
903 revision (which rewrites almost all of Whatpm::ContentChecker::HTML),
904 support for content model checking for HTML elements |figure|,
905 |object|, |video|, and |audio| and checking for XML elements (and
906 some XMLNS checkings) are dropped for now. They will be
907 reimplemented in due cource.
908
909 2008-02-17 Wakaba <wakaba@suika.fam.cx>
910
911 * HTML.pm.src: |>| in public or system literal closes the DOCTYPE
912 token (HTML5 revision 1225).
913
914 2008-02-17 Wakaba <wakaba@suika.fam.cx>
915
916 * ContentChecker.pm ({unsupported_level}): New value.
917
918 * HTML.pm.src: Save whether |meta| |content| attribute
919 contains character references or not.
920
921 2008-02-17 Wakaba <wakaba@suika.fam.cx>
922
923 * ContentChecker.pm (_get_children): (Incompleted) attempt to
924 imlement significant content checking for contents
925 with |del| elements.
926
927 2008-02-17 Wakaba <wakaba@suika.fam.cx>
928
929 * ContenteChecker.pm ($HTMLTransparentElements): More
930 elements are added.
931 (_get_children): HTML |object| elements are now semi-transparent.
932
933 * NanoDOM.pm (manakai_html, manakai_head): New methods.
934
935 2008-02-16 Wakaba <wakaba@suika.fam.cx>
936
937 * CacheManifest.pm: HTML5 revision 1211 implemented.
938
939 * CacheManifest.pod: Updated.
940
941 2008-02-10 Wakaba <wakaba@suika.fam.cx>
942
943 * ContentChecker.pm (check_document, check_element): Support
944 for second argument ($onsubdoc).
945 (_get_css_parser): Removed (now it is part of WDCC).
946
947 2008-02-09 Wakaba <wakaba@suika.fam.cx>
948
949 * ContentChecker.pm (_get_css_parser): New.
950
951 2007-11-25 Wakaba <wakaba@suika.fam.cx>
952
953 * ContentChecker.pm ($AnyChecker): Old way to add child elements
954 for checking had been used.
955
956 2007-11-25 Wakaba <wakaba@suika.fam.cx>
957
958 * ContentChecker.pm (check_element): New todo item type |descendant|.
959
960 2007-11-23 Wakaba <wakaba@suika.fam.cx>
961
962 * IMTChecker.pm: Revised to raise errors and warnings as (poorly)
963 specced in RFC 2046 and RFC 4288.
964 (application/atom+xml): Definition added.
965
966 2007-11-23 Wakaba <wakaba@suika.fam.cx>
967
968 * URIChecker.pm: Make RFC 3986 should-level errors
969 warnings (rather than SHOULD-level errors).
970
971 2007-11-23 Wakaba <wakaba@suika.fam.cx>
972
973 * NanoDOM.pm (get_user_data, set_user_data): New methods.
974
975 * HTML.pm.src: A flag for character references in attribute
976 values are added. Set |manakai_has_reference| user data
977 to |charset| attribute.
978
979 2007-11-23 Wakaba <wakaba@suika.fam.cx>
980
981 * NanoDOM.pm (input_encoding, manakai_charset, manakai_has_bom): New
982 attributes.
983
984 * ContentChecker.pm (check_document): Warn if charset requirements
985 cannot be tested.
986
987 2007-11-19 Wakaba <wakaba@suika.fam.cx>
988
989 * HTML.pm.src (parse_byte_string): Detect charset
990 by universalchardet if charset parameter is not specified.
991
992 * Makefile (Charset-all, Charset-clean): New rules.
993
994 2007-11-18 Wakaba <wakaba@suika.fam.cx>
995
996 * ContentChecker.pm (check_document): Check the existence
997 of character encoding declaration and selection of encoding
998 for HTML document.
999
1000 2007-11-18 Wakaba <wakaba@suika.fam.cx>
1001
1002 * ContentType.pm (get_sniffed_type): Return also the official
1003 type in list context.
1004
1005 2007-11-18 Wakaba <wakaba@suika.fam.cx>
1006
1007 * ContentType.pm: Sniffing with leading white space
1008 ignoring (HTML5 revisions 1013 and 1016).
1009
1010 2007-11-18 Wakaba <wakaba@suika.fam.cx>
1011
1012 * ContentType.pm: HTML5 revision 1013 changes, except for leading
1013 white spaces, are implemented.
1014
1015 2007-11-11 Wakaba <wakaba@suika.fam.cx>
1016
1017 * HTML.pm.src (parse_char_string): Set |inner_encoding|
1018 attribute if possible.
1019
1020 2007-11-11 Wakaba <wakaba@suika.fam.cx>
1021
1022 * HTML.pm.src (parse_byte_string): New method.
1023 (parse_char_string): New alias for |parse_string|.
1024 (main phase): Invoking "change the encoding" algorithm if desired.
1025
1026 * HTML.pod: Updated.
1027
1028 2007-11-11 Wakaba <wakaba@suika.fam.cx>
1029
1030 * HTML.pod (get_inner_html): Removed.
1031
1032 * Makefile (HTML-all, HTML-clean): New.
1033
1034 2007-11-11 Wakaba <wakaba@suika.fam.cx>
1035
1036 * HTML.pm.src (get_inner_html): Removed (moved to HTML/Serializer.pm).
1037
1038 2007-11-08 Wakaba <wakaba@suika.fam.cx>
1039
1040 * mklinktypelist.pl: s/noreferer/noreferrer/ (HTML5 revision 1132).
1041
1042 2007-11-04 Wakaba <wakaba@suika.fam.cx>
1043
1044 * Makefile: |CacheManifest.html| is added.
1045
1046 * CacheManifest.pod: New file.
1047
1048 2007-11-04 Wakaba <wakaba@suika.fam.cx>
1049
1050 * CacheManifest.pm: New module.
1051
1052 2007-11-04 Wakaba <wakaba@suika.fam.cx>
1053
1054 * HTML.pm.src: Support for application cache selection algorithm
1055 callback.
1056
1057 2007-11-04 Wakaba <wakaba@suika.fam.cx>
1058
1059 * mklinktypelist.pl: Support for rel=noreferer (HTML5 revision 1118).
1060
1061 2007-10-17 Wakaba <wakaba@suika.fam.cx>
1062
1063 * Makefile (clean): New rule.
1064
1065 * NanoDOM.pm (public_id, system_id): New attributes.
1066
1067 2007-10-17 Wakaba <wakaba@suika.fam.cx>
1068
1069 * Makefile (CSS-all, CSS-clean, clean): New rules.
1070
1071 2007-10-14 Wakaba <wakaba@suika.fam.cx>
1072
1073 * ContentChecker.pm (check_document): Support for
1074 new |is_xml_root| flag.
1075 (check_element): Support for new |pluses| state.
1076 (_add_pluses): New method.
1077 (_remove_minuses): Support for new |minus| item.
1078
1079 2007-09-24 Wakaba <wakaba@suika.fam.cx>
1080
1081 * ContentChecker.pm: Raise specific error for invalid
1082 root element.
1083
1084 2007-09-24 Wakaba <wakaba@suika.fam.cx>
1085
1086 * ContentChecker.pm: Set level values for later uses.
1087
1088 2007-09-09 Wakaba <wakaba@suika.fam.cx>
1089
1090 * ContentChecker.pm: Support for language tag validation.
1091
1092 2007-09-09 Wakaba <wakaba@suika.fam.cx>
1093
1094 * LangTag.pm (check_rfc3066_language_tag): New method.
1095
1096 2007-09-09 Wakaba <wakaba@suika.fam.cx>
1097
1098 * LangTag.pm: New module.
1099
1100 2007-09-04 Wakaba <wakaba@suika.fam.cx>
1101
1102 * HTML.pm.src: Some error types were wrong.
1103
1104 2007-08-17 Wakaba <wakaba@suika.fam.cx>
1105
1106 * CSS/: New directory.
1107
1108 2007-08-17 Wakaba <wakaba@suika.fam.cx>
1109
1110 * ContentChecker.pm (_check_get_children): Support
1111 for |noscript| in |head|.
1112
1113 2007-08-12 Wakaba <wakaba@suika.fam.cx>
1114
1115 * URI/: New directory.
1116
1117 2007-08-11 Wakaba <wakaba@suika.fam.cx>
1118
1119 * HTML.pm.src: Tokenizer's states are now represented in
1120 number.
1121
1122 2007-08-11 Wakaba <wakaba@suika.fam.cx>
1123
1124 * HTML.pm.src: |or|s for insertion modes are replaced
1125 by |&|s.
1126
1127 2007-08-11 Wakaba <wakaba@suika.fam.cx>
1128
1129 * HTML.pm.src: Token types are now represented in number.
1130
1131 2007-08-11 Wakaba <wakaba@suika.fam.cx>
1132
1133 * ContentType.pm (SEE ALSO): Updated.
1134
1135 * HTML.pm.src: Insertion modes are now represented in number.
1136
1137 2007-08-11 Wakaba <wakaba@suika.fam.cx>
1138
1139 * ContentType.pm: Sniffing for bitmap images (HTML5 revision
1140 999) is implemented.
1141
1142 2007-08-08 Wakaba <wakaba@suika.fam.cx>
1143
1144 * ContentType.pm: Sniffing for |<script| (HTML5 revision
1145 983) is implemented.
1146
1147 2007-08-06 Wakaba <wakaba@suika.fam.cx>
1148
1149 * ContentChecker.pod: New documentation.
1150
1151 * Makefile: A rule for |ContentChecker.html| is added.
1152
1153 * ContentChecker.pm: A pod "LICENSE" section is added.
1154
1155 * NanoDOM.pm ($VERSION): New variable.
1156
1157 2007-08-05 Wakaba <wakaba@suika.fam.cx>
1158
1159 * H2H.pm: |b|, |i|, and |sub| are added to the
1160 list of allowed HTML elements.
1161
1162 2007-08-05 Wakaba <wakaba@suika.fam.cx>
1163
1164 * H2H.pm: |samp| is added to the list of allowed
1165 HTML elements.
1166
1167 * URIChecker.pm (check_iri): New.
1168 (check_iri_reference): Error type for IRI reference
1169 syntax error is changed.
1170
1171 2007-08-04 Wakaba <wakaba@suika.fam.cx>
1172
1173 * ContentChecker.pm: Reference to the |Whatpm::ContentChecker::Atom|
1174 is added.
1175 (check_document): Load appropriate module before validation.
1176
1177 2007-08-04 Wakaba <wakaba@suika.fam.cx>
1178
1179 * ContentChecker/: New directory.
1180
1181 2007-08-04 Wakaba <wakaba@suika.fam.cx>
1182
1183 * ContentChecker.pm: HTML |time| element is implemented.
1184
1185 * HTMLTable.pm: Comments are updated as HTML5 is revised.
1186
1187 2007-07-21 Wakaba <wakaba@suika.fam.cx>
1188
1189 * ContentChecker.pm (check_document): Return value
1190 even if no document element is found.
1191
1192 2007-07-21 Wakaba <wakaba@suika.fam.cx>
1193
1194 * HTML.pm.src: |$in_body| is no longer a function.
1195
1196 2007-07-21 Wakaba <wakaba@suika.fam.cx>
1197
1198 * HTML.pm.src: The |$in_body| code has been moved down.
1199
1200 2007-07-21 Wakaba <wakaba@suika.fam.cx>
1201
1202 * HTML.pm.src: The "trailing end" insertion mode
1203 is split into "after html body" and "after html frameset"
1204 insertion modes. Their codes are merged with "after body"
1205 and "after frameset" codes. |$previous_insertion_mode|
1206 has been removed. "after frameset" code is
1207 merged with "in frameset" code.
1208
1209 2007-07-21 Wakaba <wakaba@suika.fam.cx>
1210
1211 * HTML.pm.src: The "before head" insertion mode is
1212 merged with the "in head" insertion mode.
1213
1214 2007-07-21 Wakaba <wakaba@suika.fam.cx>
1215
1216 * HTML.pm.src: Cases in "in head" insertion mode are
1217 reorganized.
1218
1219 2007-07-21 Wakaba <wakaba@suika.fam.cx>
1220
1221 * HTML.pm.src: Some cases in "in table" insertion mode
1222 are merged.
1223
1224 2007-07-21 Wakaba <wakaba@suika.fam.cx>
1225
1226 * HTML.pm.src: The "in row" insertion mode is merged
1227 with "in table" insertion mode.
1228
1229 2007-07-21 Wakaba <wakaba@suika.fam.cx>
1230
1231 * HTML.pm.src: The "in table" and "in table body" insertion
1232 modes are merged.
1233
1234 2007-07-21 Wakaba <wakaba@suika.fam.cx>
1235
1236 * HTML.pm.src: There is no "in table head" or "in table foot"
1237 insertion mode!
1238
1239 2007-07-21 Wakaba <wakaba@suika.fam.cx>
1240
1241 * HTML.pm.src: |<noframes>| "in frameset" and "in noframes"
1242 now directly invoke the handler.
1243
1244 2007-07-21 Wakaba <wakaba@suika.fam.cx>
1245
1246 * HTML.pm.src: Codes for "in cell" insertion mode
1247 is merged to the "in body" insertion mode code.
1248
1249 2007-07-21 Wakaba <wakaba@suika.fam.cx>
1250
1251 * HTML.pm.src: Codes for "in body" and "in caption"
1252 insertion modes are merged.
1253
1254 2007-07-21 Wakaba <wakaba@suika.fam.cx>
1255
1256 * HTML.pm.src: Two |!!!next-token|s were missing.
1257
1258 2007-07-21 Wakaba <wakaba@suika.fam.cx>
1259
1260 * HTML.pm.src: Use numeric constant for |{content_mode}|
1261 instead of string constant for |{content_model_flag}|.
1262
1263 2007-07-21 Wakaba <wakaba@suika.fam.cx>
1264
1265 * HTML.pm.src: Add the name of the attribute
1266 to the "duplicate attribute" error.
1267
1268 2007-07-17 Wakaba <wakaba@suika.fam.cx>
1269
1270 * ContentChecker.pm: Return the |class| node list.
1271
1272 2007-07-17 Wakaba <wakaba@suika.fam.cx>
1273
1274 * ContentChecker.pm: Return the |id| node list.
1275
1276 * HTML.pm.src: A typo is fixed.
1277
1278 2007-07-16 Wakaba <wakaba@suika.fam.cx>
1279
1280 * ContentChecker.pm: Drop wrong |level => 'error'| specification
1281 from "in HTML:xml:lang" error. Character position
1282 is now the last part of the error type in the URI error
1283 description. Report "unsupported" status for language
1284 tags, media queries, script codes, and style sheets.
1285
1286 2007-07-16 Wakaba <wakaba@suika.fam.cx>
1287
1288 * ContentChecker.pm: Report error if |xml:lang|
1289 in HTML, |lang| in XML, |xmlns| in XML, and |meta| |charset|
1290 in XML.
1291
1292 * NanoDOM.pm (Attr.owner_document): New attribute.
1293
1294 2007-07-16 Wakaba <wakaba@suika.fam.cx>
1295
1296 * HTML.pm.src: The character immediately following
1297 a bare |hcro| was discarded. Fix handling of
1298 entity references in attribute values.
1299
1300 2007-07-16 Wakaba <wakaba@suika.fam.cx>
1301
1302 * HTML.pm.src (main and trailin end phases): Token
1303 types |DOCTYPE|, |comment|, |end-of-file|, and |<html>| are
1304 factored out. Error types |in html:#DOCTYPE|
1305 and |after html:#DOCTYPE| are merged
1306 into |DOCTYPE in the middle|. |</frameset>|
1307 in fragment parsing mode changed the
1308 insertion mode.
1309
1310 2007-07-16 Wakaba <wakaba@suika.fam.cx>
1311
1312 * HTML.pm.src: |$phase| has been removed; The |trailing end|
1313 phase is now an insertion mode. Treatments for white
1314 space character tokens were incorrect for some
1315 insertion modes. An old |meta| case was not removed.
1316
1317 2007-07-16 Wakaba <wakaba@suika.fam.cx>
1318
1319 * HTML.pm.src: |meta| charset declaration extraction
1320 implemented (but changing the encoding is not yet:-).
1321
1322 2007-07-15 Wakaba <wakaba@suika.fam.cx>
1323
1324 * Charset/: New directory.
1325
1326 2007-07-15 Wakaba <wakaba@suika.fam.cx>
1327
1328 * H2H.pm: New Perl module (created from
1329 manakai's H2H.dis).
1330
1331 2007-07-15 Wakaba <wakaba@suika.fam.cx>
1332
1333 * XMLSerializer.pm: New Perl module (created from
1334 manakai's SimpleLS.dis).
1335
1336 2007-07-07 Wakaba <wakaba@suika.fam.cx>
1337
1338 * HTML.pm.src: |<!---x-->| was not processed correctly.
1339
1340 2007-07-01 Wakaba <wakaba@suika.fam.cx>
1341
1342 * HTML.pm.src: Report correct error message
1343 for |<body></div></body>|.
1344
1345 2007-07-01 Wakaba <wakaba@suika.fam.cx>
1346
1347 * HTMLTable.pm: An error description was incorrect.
1348
1349 2007-06-30 Wakaba <wakaba@suika.fam.cx>
1350
1351 * ContentChecker.pm: Return |{term}| list.
1352
1353 2007-06-30 Wakaba <wakaba@suika.fam.cx>
1354
1355 * HTML.pm.src: HTML5 revisions 961-966 (</p>, </br>,
1356 nested <nobr>, implied </tbody>, </tfoot>, and </thead>,
1357 and <title> outside of head).
1358
1359 2007-06-30 Wakaba <wakaba@suika.fam.cx>
1360
1361 * IMTChecker.pm: Report warning for unregistered
1362 and private types/subtypes.
1363
1364 * ContentChecker.pm, HTML.pm.src, IMTChecker.pm,
1365 URIChecker.pm, HTMLTable.pm: Error messages are now
1366 consistent; they are all listed in
1367 <http://suika.fam.cx/gate/2005/sw/Whatpm%20Error%20Types>.
1368
1369 2007-06-25 Wakaba <wakaba@suika.fam.cx>
1370
1371 * ContentChecker.pm: |<img ismap>| not in |<a></a>|
1372 is now erred. |<datalist>| is implemented.
1373 Attribute checker for |<command>| and |<menu>| are
1374 added. Support for |contextmenu| global attribute
1375 is added.
1376
1377 2007-06-25 Wakaba <wakaba@suika.fam.cx>
1378
1379 * HTML.pm.src (_reset_insertion_mode): Interpretation
1380 of Step 3 has been changed.
1381
1382 2007-06-25 Wakaba <wakaba@suika.fam.cx>
1383
1384 * HTML.pm.src: Late |<html>| parse error is implemented.
1385
1386 2007-06-24 Wakaba <wakaba@suika.fam.cx>
1387
1388 * URIChecker.pm (check_iri_reference): A |decode| method name was
1389 incorrect.
1390
1391 * ContentChecker.pm: Support for the |footer| element.
1392 Check URI syntax for space-separated URI attributes.
1393 Support for the |tabindex| attribute. Support
1394 for |datetime| attribute.
1395
1396 2007-06-24 Wakaba <wakaba@suika.fam.cx>
1397
1398 * HTML.pm.src: HTML5 revision 1.144 (&#x0D;) and 1.145 (invalid
1399 character references). HTML5 revision 1.146 (white space
1400 characters before root start tag). HTML5 revision
1401 1.148 (named character references in attribute values).
1402 HTML5 revision 1.152 (<plaintext>.innerHTML get).
1403
1404 2007-06-24 Wakaba <wakaba@suika.fam.cx>
1405
1406 * HTML.pm.src: HTML5 revisions 1.142 and 1.143 (<noscript>
1407 in <head>).
1408
1409 2007-06-24 Wakaba <wakaba@suika.fam.cx>
1410
1411 * HTML.pm.src: HTML5 revision 935 (<base>, <link>, <meta>
1412 in body).
1413
1414 * ContentChecker.pm: HTML5 revision 938 (scoped="").
1415
1416 2007-06-24 Wakaba <wakaba@suika.fam.cx>
1417
1418 * HTML.pm.src: HTML5 revision 923 (matching end tag in CDATA
1419 or RCDATA in fragment parsing mode). HTML5 revision
1420 924 (<!--> and <!--->). HTML5 revision 926 (hn in hn).
1421
1422 2007-06-23 Wakaba <wakaba@suika.fam.cx>
1423
1424 * HTML.pm.src (get_inner_html): HTML5 revision 922 (inner_html
1425 for <pre> and <textarea>).
1426
1427 2007-06-23 Wakaba <wakaba@suika.fam.cx>
1428
1429 * HTML.pm.src: HTML5 revision 920 (<isindex>).
1430
1431 2007-06-23 Wakaba <wakaba@suika.fam.cx>
1432
1433 * HTML.pm.src: HTML5 revision 918 (</head>, </body>, </html>).
1434
1435 2007-06-23 Wakaba <wakaba@suika.fam.cx>
1436
1437 * HTML.pm.src: HTML5 revision 916 (</body>).
1438 HTML5 revision 917 (conforming bare &).
1439
1440 2007-06-23 Wakaba <wakaba@suika.fam.cx>
1441
1442 * NanoDOM.pm (manakai_is_html): Setting to false did not work.
1443
1444 * HTML.pm.src: HTML5 revision 914 (</ in CDATA, RCDATA).
1445 HTML5 revision 915 (<nobr>).
1446
1447 2007-06-23 Wakaba <wakaba@suika.fam.cx>
1448
1449 * HTML.pm.src: HTML5 revisions 908, 909, 912, and 913 (quirks mode).
1450
1451 * NanoDOM.pm (manakai_is_html, manakai_compat_mode, compat_mode):
1452 New attributes.
1453
1454 2007-06-23 Wakaba <wakaba@suika.fam.cx>
1455
1456 * HTML.pm.src: HTML5 revisions 900, 901, 902, and 911 (<
1457 in tags).
1458
1459 2007-06-23 Wakaba <wakaba@suika.fam.cx>
1460
1461 * .cvsignore: |Entities.html| is added.
1462
1463 * HTML.pm.src: |$entity_char| is removed and
1464 requires |Whatpm::_NamedEntityList| instead.
1465 HTML5 revision 898 (refc), except that lack of refc
1466 is parse error.
1467
1468 * mkentitylist.pl: New script.
1469
1470 * Makefile (all): |_NamedEntityList.pm| is added.
1471 (_NamedEntityList.pm, Entities.html): New rules.
1472
1473 2007-06-23 Wakaba <wakaba@suika.fam.cx>
1474
1475 * HTML.pm.src: Parse errors immediately after U+000D
1476 were ignored and U+000D immediately following another
1477 U+000D was not converted to U+000A.
1478
1479 2007-06-23 Wakaba <wakaba@suika.fam.cx>
1480
1481 * HTML.pm.src (set_inner_html): HTML5 revision 892 (adopt
1482 nodes before appended). Parser was not ready for NULL
1483 parse error and escape flag.
1484
1485 * NanoDOM.pm (adopt_node): New.
1486
1487 2007-06-23 Wakaba <wakaba@suika.fam.cx>
1488
1489 * HTML.pm.src: HTML5 revision 886 (insane comment in
1490 CDATA and RCDATA). Note that current implementation
1491 is simply repeating what the spec says and it is maybe not
1492 a best way to do it.
1493
1494 2007-06-23 Wakaba <wakaba@suika.fam.cx>
1495
1496 * HTML.pm.src: HTML5 revision 884 (</form> don't close
1497 the form element if a descendant element without implied end tag has
1498 still been open).
1499
1500 2007-06-23 Wakaba <wakaba@suika.fam.cx>
1501
1502 * ContentChecker.pm: HTML5 revision 881 (Make |id| attribute with
1503 space characters non-conforming).
1504
1505 2007-06-23 Wakaba <wakaba@suika.fam.cx>
1506
1507 * HTML.pm.src: An error message was incorrect.
1508 HTML5 revision 869 (C1 character references).
1509
1510 2007-06-23 Wakaba <wakaba@suika.fam.cx>
1511
1512 * HTML.pm.src: HTML5 revision 867 (a LF at the beginning of
1513 a |textarea| is removed).
1514
1515 2007-06-05 Wakaba <wakaba@suika.fam.cx>
1516
1517 * NanoDOM.pm (get_attribute_node_ns): New method.
1518
1519 * ContentChecker.pm: |script| |async| and |defer|
1520 no longer require |src|. |async| MUST NOT be
1521 specified if |defer|. (HTML5 revision 858).
1522
1523 2007-05-30 Wakaba <wakaba@suika.fam.cx>
1524
1525 * HTML.pm.src: |<form><form>| went to inifinite loop.
1526
1527 2007-05-27 Wakaba <wakaba@suika.fam.cx>
1528
1529 * ContentChecker.pm (html): Set |is_root| (allowed
1530 as a document element) flag on.
1531 (new): Removed.
1532 (check_document): New method.
1533
1534 2007-05-27 Wakaba <wakaba@suika.fam.cx>
1535
1536 * ContentChecker.pm (thead, tfoot): Checker specifications
1537 were incorrect.
1538
1539 2007-05-27 Wakaba <wakaba@suika.fam.cx>
1540
1541 * ContentChecker.pm ($HTMLURIAttrChecker): Include
1542 error position in the |type| option of the error.
1543
1544 * HTMLTable.pm (form_table): The |$onerror| parameter
1545 is now optional. Some bugs are fixed.
1546
1547 2007-05-27 Wakaba <wakaba@suika.fam.cx>
1548
1549 * HTMLTable.pm: New module.
1550
1551 * ContentChecker.pm (table): Invoke table model error checker.
1552
1553 * NanoDOM.pm (first_child, get_attribute_ns): New.
1554
1555 2007-05-26 Wakaba <wakaba@suika.fam.cx>
1556
1557 * ContentChecker.pm ($HTMLLinkTypesAttrChecker): New checker.
1558 (link/@rel, a/@rel, area/@rel): Use new checker.
1559
1560 * Makefile (_LinkTypeList.pm, RelExtensions.html): New rules.
1561
1562 * _LinkTypeList.pm: New file.
1563
1564 * mklinktypelist.pl: New file.
1565
1566 * .cvsignore: |RelExtensions.html| added.
1567
1568 * NanoDOM.pm (child_nodes): Returns an empty array
1569 for non-child-containing node types.
1570 (text_content): New attribute.
1571
1572 2007-05-26 Wakaba <wakaba@suika.fam.cx>
1573
1574 * IMTChecker.pm: New module.
1575
1576 * ContentChecker.pm ($HTMLIMTAttrChecker): Call IMTChecker
1577 to test parameter value validity.
1578
1579 * HTML.pm.src ($style_start_tag): Attributes were
1580 discarded.
1581
1582 2007-05-25 Wakaba <wakaba@suika.fam.cx>
1583
1584 * ContentChecker.pm ($HTMLURIAttrChecker): Implemented.
1585
1586 2007-05-25 Wakaba <wakaba@suika.fam.cx>
1587
1588 * URIChecker.pm: All recommendations from RFC 3986
1589 and RFC 3987 are listed (not all testable items are checked yet).
1590
1591 2007-05-25 Wakaba <wakaba@suika.fam.cx>
1592
1593 * URIChecker.pm: New module.
1594
1595 2007-05-20 Wakaba <wakaba@suika.fam.cx>
1596
1597 * ContentChecker.pm: Now most attributes are implemented
1598 or associated to some placeholder.
1599 ($ElementDefault): Warn unknown attributes
1600 for unknown elements as "attribute not supported".
1601 ($HTMLLanguageTagAttrChecker, $HTMLMQAttrChecker): New placeholders.
1602 ($HTMLUsemapAttrChecker, $HTMLTargetAttrChecker): New checkers.
1603 (|a| attribute checker): Reimplemented.
1604
1605 2007-05-20 Wakaba <wakaba@suika.fam.cx>
1606
1607 * ContentChecker.pm ($HTMLEventHandlerAttrChecker): New placeholder.
1608 ($HTMLAttrChecker): Event handler content attributes
1609 are added.
1610 (link, embed): Required attribute is now checked.
1611 (embed): Unknown local attributes are no longer warned.
1612
1613 2007-05-20 Wakaba <wakaba@suika.fam.cx>
1614
1615 * ContentChecker.pm ($HTMLSpaceURIsAttrChecker): New placeholder.
1616 ($HTMLIMTAttrChecker): New checker.
1617 (link@rel, link@href, link@type, style@type,
1618 a@href, a@ping, a@ping, a@type, embed@src, embed@type,
1619 object@data, object@type, source@src, source@type, area@alt,
1620 area@shape, area@coords, area@href,
1621 area@ping, area@rel, area@type, script@src,
1622 script@defer, script@async, script@type): Checkers added.
1623
1624 2007-05-20 Wakaba <wakaba@suika.fam.cx>
1625
1626 * ContentChecker.pm: Descendant checking was incorrect.
1627
1628 2007-05-19 Wakaba <wakaba@suika.fam.cx>
1629
1630 * ContentChecker.pm: Support |xml:*| and |xmlns:*|
1631 attributes. Report an error if |Element.prefix|
1632 is |xmlns|.
1633
1634 * NanoDOM.pm (prefix): New attribute.
1635
1636 2007-05-19 Wakaba <wakaba@suika.fam.cx>
1637
1638 * HTML.pm.src: In |main| phase, |in body| insertion
1639 mode, action for |<iframe>| was missing.
1640
1641 2007-05-19 Wakaba <wakaba@suika.fam.cx>
1642
1643 * ContentChecker.pm: Support for many of HTML5 elements.
1644 ($GetHTMLNonNegativeIntegerAttrChecker): New.
1645
1646 2007-05-19 Wakaba <wakaba@suika.fam.cx>
1647
1648 * ContentChecker.pm: Support for most elements up to |progress|.
1649 ($HTMLURIAttrChecker): Placeholder.
1650 ($HTMLIntegerAttrChecker, $GetHTMLFloatingPointNumberAttrChecker):
1651 New.
1652
1653 2007-05-19 Wakaba <wakaba@suika.fam.cx>
1654
1655 * ContentChecker.pm: Attribute checkers
1656 for global attributes, |html|, |base|, |style|, and |meta|.
1657
1658 * NanoDOM.pm (insert_before): Weaken reference
1659 to the parent node.
1660 (Attr::new): Set |owner_element| attribute.
1661 (namespace_uri, manakai_local_name): New attribute implementations.
1662 (owner_element): New attribute.
1663
1664 2007-05-19 Wakaba <wakaba@suika.fam.cx>
1665
1666 * ContentChecker.pm ($AttrChecker, $HTMLAttrChecker,
1667 $AnyChecker->{attr_checker}, $HTMLAttrsChecker,
1668 $Element->{$HTML_NS}->{''}): New.
1669 (check_element): Invoke attrs_checker for each element.
1670
1671 2007-05-13 Wakaba <wakaba@suika.fam.cx>
1672
1673 * ContentChecker.pm: Don't use |manakai_element_type_match|.
1674
1675 2007-05-13 Wakaba <wakaba@suika.fam.cx>
1676
1677 * ContentChecker.pm: Use hashs rather than lists for
1678 element type testings.
1679
1680 2007-05-13 Wakaba <wakaba@suika.fam.cx>
1681
1682 * ContentChecker.pm: Don't generate duplicate
1683 error when an element type is put in the "minus" list
1684 and the element type is not allowed explicitly in the particular
1685 element content model.
1686 (html:a checker): New checker.
1687 (html:details, html:datagrid): New checkers.
1688 (html:legend): New checker.
1689
1690 2007-05-13 Wakaba <wakaba@suika.fam.cx>
1691
1692 * ContentChecker.pm (html:li checker): Implemented.
1693
1694 2007-05-13 Wakaba <wakaba@suika.fam.cx>
1695
1696 * ContentChecker.pm ($HTMLInlineOrStriclyInlineChecker): New
1697 checker.
1698 (html:dd checker): New checker.
1699 (html:q, html:em, html:strong, html:small,
1700 html:m, html:dfn, html:code, html:samp, html:span): New checkers.
1701
1702 2007-05-13 Wakaba <wakaba@suika.fam.cx>
1703
1704 * ContentChecker.pm ($AnyChecker): Renamed
1705 from |$ElementDefault->{checker}|.
1706 ($ElementDefault->{checker}): Throw an error that
1707 the element type is not supported by the checker.
1708 ($HTMLMetadataElement): |html:base| was missing.
1709 ($HTMLEmptyChecker): Don't throw an error
1710 for inter-element whitespace nodes.
1711 (html:html checker): Errors were not
1712 thrown even if |html:head| and/or |html:body|
1713 children were missing.
1714 (html:head checker): An error was not
1715 thrown if <meta charset> appered after other
1716 elements.
1717
1718 2007-05-05 Wakaba <wakaba@suika.fam.cx>
1719
1720 * ContentChecker.pm: |footer|, |video|, |audio|, |script|,
1721 and |noscript| elements are implemented.
1722 (new): New method.
1723
1724 2007-05-04 Wakaba <wakaba@suika.fam.cx>
1725
1726 * ContentChecker.pm: New module.
1727
1728 2007-05-04 Wakaba <wakaba@suika.fam.cx>
1729
1730 * NanoDOM.pm (manakai_parent_element,
1731 document_element, manakai_local_name,
1732 manakai_element_type_match): New method.
1733
1734 2007-05-03 Wakaba <wakaba@suika.fam.cx>
1735
1736 * HTML.pm.src: Replace decimal and hexadecimal numeric
1737 entities in C1 range using Windows-1252 mapping. Bare LF
1738 did not count as new line for error reporting.
1739
1740 2007-05-02 Wakaba <wakaba@suika.fam.cx>
1741
1742 * NanoDOM.pm (DOMImplementation): New class.
1743 (append_child): Weaken the |parent_node| reference.
1744 (create_element_ns, Element new): Set the |owner_document|
1745 reference.
1746 (implementation): New attribute.
1747 (owner_document, local_name, namespace_uri): New attributes.
1748
1749 * HTML.pm.src (parse_string): Line and column numbers
1750 are now provided to error handler.
1751 (!!!parse-error): Short descriptions are added.
1752 (_construct_tree): Split into three methods; support
1753 for innerHTML mode.
1754 (set_inner_html): New method.
1755
1756 2007-05-01 Wakaba <wakaba@suika.fam.cx>
1757
1758 * NanoDOM.html: Documentation is added.
1759
1760 * HTML.pod, ContentType.html: Documentation is revised.
1761
1762 * .cvsignore: Pod2html temporary files are added.
1763
1764 * Makefile: Make |NanoDOM.html|.
1765
1766 2007-05-01 Wakaba <wakaba@suika.fam.cx>
1767
1768 * HTML.pm.src (parse_string): New method.
1769 (get_inner_html): Renamed from |inner_html|.
1770
1771 * Makefile: A rule for |HTML.html| is added.
1772
1773 * HTML.pod: New documentation.
1774
1775 2007-05-01 Wakaba <wakaba@suika.fam.cx>
1776
1777 * NanoDOM.pm (last_child, previous_sibling): New attributes.
1778 (clone_node): Attribute nodes were not completely copied.
1779
1780 * HTML.pm.src: Many bugs are fixed.
1781
1782 2007-04-30 Wakaba <wakaba@suika.fam.cx>
1783
1784 * HTML.pm.src: Some typos are fixed.
1785
1786 2007-04-30 Wakaba <wakaba@suika.fam.cx>
1787
1788 * mkhtmlparser.pl, Makefile: References to the |HTML-consume-entity.src|
1789 are removed.
1790
1791 * HTML.pm.src: Tokenizer's handling on named entities are rewritten.
1792
1793 * HTML-consume-entity.src: Removed.
1794
1795 2007-04-30 Wakaba <wakaba@suika.fam.cx>
1796
1797 * HTML.pm.src: Tokenizer's handling on hexadecimal
1798 numeric entities are rewritten.
1799
1800 2007-04-30 Wakaba <wakaba@suika.fam.cx>
1801
1802 * HTML.pm.src: Some tokenizer bugs are fixed.
1803
1804 2007-04-30 Wakaba <wakaba@suika.fam.cx>
1805
1806 * HTML.pm.src: The tree construction stage is implemented.
1807
1808 * mkhtmlparser.pl: New macros are added.
1809
1810 2007-04-28 Wakaba <wakaba@suika.fam.cx>
1811
1812 * ContentType.pm: A note on bug in the specification
1813 is removed since it's been now fixed.
1814
1815 * .cvsignore: New file.
1816
1817 2007-04-28 Wakaba <wakaba@suika.fam.cx>
1818
1819 * HTML.pm.src, HTML-consume-entity.src: New files.
1820
1821 * Makefile (HTML.pm): New rule.
1822
1823 * mkhtmlparser.pl: New script.
1824
1825 2007-04-25 Wakaba <wakaba@suika.fam.cx>
1826
1827 * Makefile: New file.
1828
1829 2007-04-24 Wakaba <wakaba@suika.fam.cx>
1830
1831 * ContentType.pm: An error in pod is fixed.
1832
1833 2007-04-24 Wakaba <wakaba@suika.fam.cx>
1834
1835 * ContentType.pm: Documentation is added.
1836
1837 2007-04-24 Wakaba <wakaba@suika.fam.cx>
1838
1839 * ContentType.pm: New Perl module.
1840
1841 * ChangeLog: New file.
1842
1843

admin@suikawiki.org
ViewVC Help
Powered by ViewVC 1.1.24