/[suikacvs]/markup/html/whatpm/Whatpm/ChangeLog
Suika

Contents of /markup/html/whatpm/Whatpm/ChangeLog

Parent Directory Parent Directory | Revision Log Revision Log


Revision 1.281 - (show annotations) (download)
Sat Aug 30 12:33:36 2008 UTC (16 years, 10 months ago) by wakaba
Branch: MAIN
Changes since 1.280: +6 -0 lines
++ whatpm/t/ChangeLog	30 Aug 2008 12:33:27 -0000
	* content-model-1.dat, content-model-2.dat: Test data updated
	and added for HTML5 revision 2062 and so on (@lang and @xml:lang).

2008-08-30  Wakaba  <wakaba@suika.fam.cx>

++ whatpm/Whatpm/ChangeLog	30 Aug 2008 12:30:55 -0000
	* ContentChecker.pm: |xml:lang| attribute value must be same
	as |lang| attribute value for HTML elements (HTML5 revision 2062
	and so on).

2008-08-30  Wakaba  <wakaba@suika.fam.cx>

++ whatpm/Whatpm/ContentChecker/ChangeLog	30 Aug 2008 12:32:39 -0000
	* HTML.pm: |xml:lang| attribute value must be same
	as |lang| attribute value for HTML elements (HTML5 revision 2062
	and so on).  |lang| attribute in XML document is no longer
	disallowed (HTML5 revision 2062).  |xml:lang| is allowed
	in HTML document (HTML5 revision 2062 and so on).

2008-08-30  Wakaba  <wakaba@suika.fam.cx>

1 2008-08-30 Wakaba <wakaba@suika.fam.cx>
2
3 * ContentChecker.pm: |xml:lang| attribute value must be same
4 as |lang| attribute value for HTML elements (HTML5 revision 2062
5 and so on).
6
7 2008-08-30 Wakaba <wakaba@suika.fam.cx>
8
9 * ContentChecker.pm: Error level definition for |xml_id_error|
10 was missing.
11
12 * URIChecker.pm: The end of the URL should be marked as the
13 error location for an empty path error. The position
14 between the userinfo and the port components should be
15 marked as the error location for an empty host error.
16
17 2008-08-30 Wakaba <wakaba@suika.fam.cx>
18
19 * URIChecker.pm: Set parameters representing where in the
20 value the error occurs for errors. Report unknown
21 address format error in warning level, since address
22 formats are rarely added. Path segments starting with "/.."
23 were misinterpreted as a dot-segment.
24
25 2008-08-30 Wakaba <wakaba@suika.fam.cx>
26
27 * URIChecker.pm (check_iri_reference): Requires
28 |Message::DOM::DOMImplementation|.
29
30 2008-08-29 Wakaba <wakaba@suika.fam.cx>
31
32 * IMTChecker.pm: Updated for the new error reporting architecture.
33
34 * ContentChecker.pm: Error levels for IMTs are added.
35
36 2008-08-17 Wakaba <wakaba@suika.fam.cx>
37
38 * H2H.pm (_shift_token): Support for unquoted HTML attribute
39 values.
40
41 2008-08-16 Wakaba <wakaba@suika.fam.cx>
42
43 * CacheManifest.pm: Support for new style of error
44 reports.
45
46 * HTML.pm.src: Set line=1, column=1 to the document node.
47
48 2008-08-16 Wakaba <wakaba@suika.fam.cx>
49
50 * ContentChecker.pm, RDFXML.pm: Pass {level} object to language tag
51 and URL checkers. Support for more error levels for bogus
52 langauge tag and URL "standards".
53
54 * LangTag.pm, URIChecker.pm: Support for new style error
55 level reporting.
56
57 2008-08-15 Wakaba <wakaba@suika.fam.cx>
58
59 * ContentChecker.pm: Support for RDF/XML error levels.
60
61 * HTMLTable.pm, RDFXML.pm: Support for new style of error level
62 specifying. Error types are revised.
63
64 2008-08-15 Wakaba <wakaba@suika.fam.cx>
65
66 * ContentChecker.pm: All error reporting method calls are
67 renewed.
68
69 2008-08-15 Wakaba <wakaba@suika.fam.cx>
70
71 * HTML.pm.src: All error type names and "text" parameters
72 are revised. Use new style for "level" specification.
73
74 * mkhtmlparser.pl: Use new style for "level" specification.
75
76 2008-08-03 Wakaba <wakaba@suika.fam.cx>
77
78 * WebIDL.pm (parse_char_string): Simplified error
79 reporting process for broken ignored valuetype definition.
80 (Valuetype idl_text): Support for special "DOMString" name.
81
82 2008-08-03 Wakaba <wakaba@suika.fam.cx>
83
84 * WebIDL.pm ($get_scoped_name): Append "::::" if the last
85 terminal of the ScopedName is "DOMString", such that whether
86 the last part of the scoped name is "DOMString" or "_DOMString"
87 later. It is necessary to determine whether a |typedef|
88 definition should be ignored or not.
89 (parse_char_string): Unescape the identifier of
90 exception members.
91 ($resolve): Return undef for builtin types and sequence<T>
92 types (we might not have to do this, however...).
93 (check): Support checking for Exceptions, Valuetypes,
94 and Typedefs.
95 ($serialize_type): Support for "DOMString::::" syntax.
96 (Typedef idl_text): Output Type as "DOMString" if it
97 is really "DOMString" (i.e. its internal representation
98 is "::DOMString::").
99
100 2008-08-03 Wakaba <wakaba@suika.fam.cx>
101
102 * WebIDL.pm ($resolve): New code, based on resolve code
103 for constant types in the |check| method.
104 (check): Support for checking of attributes, operations, and
105 arguments.
106 (Attribute/Operation idl_text): Exception names in getraises,
107 setraises, and raises clauses is serizlied by |$serialize_type|
108 code.
109
110 2008-08-02 Wakaba <wakaba@suika.fam.cx>
111
112 * WebIDL.pm ($integer): Order of selections are changed to match
113 hexadecimal numbers (the original pattern, taken from the spec,
114 was not work for hexadecimal numbers, because the "0" prefix
115 matches to the [0-7]* part (as an empty string) and therefore
116 it does not match with remaining "x..." part of a "0x..." integer
117 literal.
118 ($get_type): It now returns a string, not an array reference,
119 for regular types and |sequence| types (i.e. it in any case
120 returns a string).
121 ($get_next_token): The second item in the array that represents
122 a integer or float token is now a Perl number value, not the
123 original string representation of the number.
124 (check): Support for const value consistency checking.
125 No extended attribute is defined for constants.
126 (Node subclasses): Use simple strings rather than array references
127 for default data type values.
128 ($serialize_type): Type values are now simple strings.
129 (value): If the new attribute value is a false value, then
130 a FALSE value is set to the attribute.
131
132 2008-08-02 Wakaba <wakaba@suika.fam.cx>
133
134 * WebIDL.pm ($get_scoped_name): Now scoped names are stored
135 in its stringified format ("scoped name" as defined in the
136 spec). Note that future version of this module should not use
137 array references for type values and the |type_text| attribute
138 should be made obsolete.
139 (parse_char_string): Unescape attribute names.
140 (check): Support for checking of whether inherited interfaces
141 are actually defined or not. Support for checking of whether
142 interface member identifiers are duplicated or not.
143 ($serialize_type): Scoped names are returned as is. A future
144 version of this code should escape identifiers other than "DOMString",
145 otherwise the idl_text would be non-conforming.
146
147 2008-08-02 Wakaba <wakaba@suika.fam.cx>
148
149 * WebIDL.pm (parse_char_string): Set line/column numbers
150 to generated nodes. Unescape identifiers. Extended attributes
151 for Definition's were ignored.
152 (append_child): Set |parent_node| attribute.
153 (parent_node): New attribute.
154 (check): Support interface/exception members. Support
155 extended attributes. Support definition identifier uniqueness
156 constraint.
157 (qualified_name): New attribute.
158 (Interface/Exception idl_text): Extended attributes were
159 not prepended to the returned text.
160
161 2008-08-02 Wakaba <wakaba@suika.fam.cx>
162
163 * WebIDL.pm (parse_char_string): Set line/column numbers
164 to interface object experimentally. s/shift/pop/g, shift
165 would make things wrong. Support for interface forward
166 declarations was missing. Broken interface declarations
167 with no block were not ignored entirely.
168 (Whatpm::WebIDL::Node): New abstract class. This class
169 makes things easier.
170 (child_nodes): New attribute. Unlike DOM's attribute with
171 same name, this attribute returns a dead list of nodes for
172 simplicity.
173 (get_user_data, set_user_data): New methods.
174 (Module idl_text): A SPACE character should be inserted
175 before the |{| character.
176 (Interface idl_text): Support for interface forward declarations.
177 (is_forward_declaration): New attribute.
178
179 2008-07-19 Wakaba <wakaba@suika.fam.cx>
180
181 * WebIDL.pm (type_text): Better serializer.
182
183 2008-07-19 Wakaba <wakaba@suika.fam.cx>
184
185 * WebIDL.pm: Revise forward-compatible parsing so that
186 it now can handle broken extended attributes and as such.
187
188 2008-07-19 Wakaba <wakaba@suika.fam.cx>
189
190 * WebIDL.pm: Real support for extended attributes.
191 Support for extended attributes with arguments.
192
193 2008-07-19 Wakaba <wakaba@suika.fam.cx>
194
195 * WebIDL.pm: Support for |exception| syntax.
196 (Interface->idl_text): Tentative support for inheritances.
197
198 2008-07-19 Wakaba <wakaba@suika.fam.cx>
199
200 * WebIDL.pm: Hierarchical scoped name support was broken.
201 Support for raises, setraises, and getraises syntaxes.
202
203 2008-07-18 Wakaba <wakaba@suika.fam.cx>
204
205 * WebIDL.pm: Support for |idl_text| attribute, version 1 (no
206 proper support for types, extended attributes, and exceptions yet).
207 WebIDL parser, version 1 (no support for exceptions yet,
208 no proper support for extended attributes yet).
209
210 2008-07-09 Wakaba <wakaba@suika.fam.cx>
211
212 * WebIDL.pm (parse_char_string): Support for basic attribute syntax.
213
214 2008-06-29 Wakaba <wakaba@suika.fam.cx>
215
216 * WebIDL.pm: Support for valuetype and const.
217
218 2008-06-29 Wakaba <wakaba@suika.fam.cx>
219
220 * WebIDL.pm: New module.
221
222 2008-06-15 Wakaba <wakaba@suika.fam.cx>
223
224 * Makefile (Entities.html): URI changed.
225
226 2008-06-08 Wakaba <wakaba@suika.fam.cx>
227
228 * HTML.pm.src: Support for ruby parsing (HTML5 revision 1704).
229
230 2008-06-01 Wakaba <wakaba@suika.fam.cx>
231
232 * HTML.pm.src (_get_next_token): A parse error was missing.
233
234 2008-06-01 Wakaba <wakaba@suika.fam.cx>
235
236 * mklinktypelist.pl: rel=contact is no longer part of the HTML5
237 spec (commented out). (HTML5 revision 1711).
238
239 2008-05-25 Wakaba <wakaba@suika.fam.cx>
240
241 * ContentType.pm: Drop support for UTF-32 (HTML5 revision 1701).
242
243 * HTML.pm.src: UTF-16BE and UTF-16LE should be considered
244 as UTF-16 (HTML5 revision 1701).
245
246 2008-05-25 Wakaba <wakaba@suika.fam.cx>
247
248 * HTML.pm.src: Support for <noframes> in <head> (HTML5 revision
249 1692).
250
251 2008-05-25 Wakaba <wakaba@suika.fam.cx>
252
253 * HTML.pm.src: The secondary insertion mode used when switching
254 to foreign content is the "in body" insertion mode (HTML5 revision
255 1696).
256
257 2008-05-25 Wakaba <wakaba@suika.fam.cx>
258
259 * HTML.pm.src: Don't raise parse error for <isindex/> (HTML5
260 revision 1697).
261
262 2008-05-24 Wakaba <wakaba@suika.fam.cx>
263
264 * HTML.pm.src: Support for end-of-file token in foreign content
265 insertion mode (HTML5 revision 1693). Update SVG camelCase
266 attribute list (HTML5 revision 1700). <textarea> closes
267 </select> (HTML5 revision 1699). More start tags close in
268 foreign content insertion mode (HTML5 revision 1698).
269
270 2008-05-24 Wakaba <wakaba@suika.fam.cx>
271
272 * HTML.pm.src: ";" is not part of charset name (HTML5 revision 1665).
273
274 2008-05-24 Wakaba <wakaba@suika.fam.cx>
275
276 * HTML.pm.src: More robust charset parameter detection (HTML5
277 revision 1674).
278
279 2008-05-24 Wakaba <wakaba@suika.fam.cx>
280
281 * ContentType.pm: Support for image/vnd.microsoft.icon (HTML5
282 revision 1676).
283
284 2008-05-24 Wakaba <wakaba@suika.fam.cx>
285
286 * HTML.pm.src: Ignore language part of public identifiers for
287 quriks mode detection (HTML5 revision 1679).
288
289 2008-05-24 Wakaba <wakaba@suika.fam.cx>
290
291 * HTML.pm.src: Reduce the number of errors in truncated doctypes (HTML5
292 revision 1685).
293
294 2008-05-24 Wakaba <wakaba@suika.fam.cx>
295
296 * HTML.pm.src: Support for EOF in new states for tags (HTML5
297 revision 1684).
298
299 2008-05-24 Wakaba <wakaba@suika.fam.cx>
300
301 * HTML.pm.src (_reset_insertion_mode): Make <td>.innerHTML
302 work (HTML5 revision 1690).
303
304 2008-05-24 Wakaba <wakaba@suika.fam.cx>
305
306 * HTML.pm.src (_tree_construction_main): Change handling of
307 end tags in head insertion modes (HTML5 revision 1686).
308 (parse_char_string): Bug fix for non-utf8 character string handlings.
309 (parse_char_stream): |ungetc| does not work well for this context.
310
311 2008-05-18 Wakaba <wakaba@suika.fam.cx>
312
313 * HTML.pm.src (parse_byte_string): Redefined to invoke
314 |parse_byte_stream|.
315 (parse_byte_stream): New method.
316
317 2008-05-18 Wakaba <wakaba@suika.fam.cx>
318
319 * HTML.pm.src (parse_byte_string): Fix the column number reported
320 by encoding layer error reporter.
321
322 2008-05-17 Wakaba <wakaba@suika.fam.cx>
323
324 * HTML.pm.src (parse_byte_string): Use streaming decoder
325 rather than converting the whole byte string and then parsing.
326 Propagate errors in character encoding layer.
327 (get_next_token): Precise error reporting for |bare stago| error.
328
329 2008-05-17 Wakaba <wakaba@suika.fam.cx>
330
331 * HTML.pm.src (parse_char_stream): New method.
332 (parse_char_string): This method is now defined as an invocation
333 of the |parse_char_stream| method.
334
335 2008-05-17 Wakaba <wakaba@suika.fam.cx>
336
337 * HTML.pm.src (parse_byte_string): Report various status
338 of the sniffing as info-level errors. Support for new
339 decoding framework in parser resestting.
340 (new): Various default error levels were not set.
341
342 2008-05-17 Wakaba <wakaba@suika.fam.cx>
343
344 * HTML.pm.src (parse_byte_string): HTML5 encoding siniffing
345 algorithm, except for the actual sniffing, is implemented
346 with new framework with Message::Charset::Info.
347
348 2008-05-16 Wakaba <wakaba@suika.fam.cx>
349
350 * CacheManifest.pm (_parse): Drop fragment identifiers from
351 URIs in fallback section (HTML5 revision 1596).
352
353 2008-05-10 Wakaba <wakaba@suika.fam.cx>
354
355 * Makefile (Entities.html): URI has changed.
356
357 2008-05-10 Wakaba <wakaba@suika.fam.cx>
358
359 * CacheManifest.pm: Don't replace U+0000 NULL (HTML5 revision
360 1553).
361
362 2008-05-06 Wakaba <wakaba@suika.fam.cx>
363
364 * ContentChecker.pm: Noted that those returned in |table| are
365 no longer table elements, but table objects returned
366 by Whatpm::HTMLTable.
367
368 * HTMLTable.pm (form_table): Return table element node
369 as |$table->{element}|.
370 (assign_header): Support for the |headers=""| attribute.
371
372 2008-05-06 Wakaba <wakaba@suika.fam.cx>
373
374 * HTMLTable.pm (assign_header): New function; first version
375 with no support for headers="".
376 (form_table): Include table width and height to the returned
377 table object for covenience. Indexing in column assignement was wrong.
378 Set whether a data cell is empty or not for convenience.
379
380 2008-05-05 Wakaba <wakaba@suika.fam.cx>
381
382 * HTMLTable.pm: Robuster caption support (HTML5 revision 1393).
383
384 2008-05-05 Wakaba <wakaba@suika.fam.cx>
385
386 * HTMLTable.pm: How table model errors are detected is
387 changed (HTML5 revision 1387).
388
389 2008-05-05 Wakaba <wakaba@suika.fam.cx>
390
391 * HTMLTable.pm: The algorithm now moves |tfoot| elements
392 to the end of the table (HTML5 revision 1380).
393
394 2008-05-05 Wakaba <wakaba@suika.fam.cx>
395
396 * HTMLTable.pm: The algorithm is now 0-based indexing, instead
397 of 1-based (HTML5 revision 1376).
398
399 2008-05-05 Wakaba <wakaba@suika.fam.cx>
400
401 * ContentType.pm: "Content-Type: text/plain; charset=UTF-8"
402 and "Content-Encoding" no longer prevent sniffing (HTML5
403 revision 1288).
404
405 2008-05-05 Wakaba <wakaba@suika.fam.cx>
406
407 * ContentType.pm: Skip BOMs n feed or HTML algorithm (HTML5 revision
408 1282).
409
410 2008-05-03 Wakaba <wakaba@suika.fam.cx>
411
412 * ContentChecker.pm: Support for global attributes.
413 Status of XML specs are added.
414
415 2008-05-03 Wakaba <wakaba@suika.fam.cx>
416
417 * ContentChecker.pm (check_element): Support for |template=""|
418 and |ref=""| attribute (referent element type checking).
419
420 2008-04-29 Wakaba <wakaba@suika.fam.cx>
421
422 * CacheManifest.pm (_parse): New same origin definition (HTML5 revision
423 1500) is implemented (except for IDNA part and URI-scheme-specific
424 knowledge). Line number counting was wrong for LF-only
425 documents.
426
427 2008-04-13 Wakaba <wakaba@suika.fam.cx>
428
429 * HTML.pm.src: Raise an parse error for any disallowed
430 character (HTML5 revision 1263).
431
432 2008-04-13 Wakaba <wakaba@suika.fam.cx>
433
434 * mkentitylist.pl: Support for new HTML5 entity table format (the
435 definition for |AElig;| was missing).
436
437 2008-04-13 Wakaba <wakaba@suika.fam.cx>
438
439 * HTML.pm.src, mkhtmlparser.pl: Support for element/attribute
440 name/namespace fixup (HTML5 revisions 1413, 1415, 1416, and 1417).
441
442 2008-04-13 Wakaba <wakaba@suika.fam.cx>
443
444 * HTML.pm.src: List of element names that close foreign content
445 insertion mode is added (HTML5 revisions 1412 and 1418).
446
447 2008-04-13 Wakaba <wakaba@suika.fam.cx>
448
449 * HTML.pm.src: Support for |mglyph| and |malignmark|
450 elements (HTML5 revision 1410).
451
452 2008-04-13 Wakaba <wakaba@suika.fam.cx>
453
454 * HTML.pm.src: Support for new long MathML entities (HTML5
455 revision 1406).
456
457 2008-04-13 Wakaba <wakaba@suika.fam.cx>
458
459 * HTML.pm.src: CDATA section support for MathML and SVG
460 elements (HTML5 revision 1404 and 1420).
461
462 2008-04-12 Wakaba <wakaba@suika.fam.cx>
463
464 * HTML.pm.src, mkhtmlparser.pl: Support for MathML and SVG
465 elements (HTML5 revision 1404). Unused !!!macro definitions
466 are removed.
467
468 2008-04-12 Wakaba <wakaba@suika.fam.cx>
469
470 * HTML.pm.src, mkhtmlparser.pl: The way permitted slash errors
471 are raised is changed (HTML5 revision 1404).
472
473 2008-04-06 Wakaba <wakaba@suika.fam.cx>
474
475 * HTML.pm.src: Category sets that are no longer used are removed.
476
477 2008-04-06 Wakaba <wakaba@suika.fam.cx>
478
479 * HTML.pm.src: The ->[1] property of stack entries are now
480 replaced by constants representing element category.
481
482 2008-04-06 Wakaba <wakaba@suika.fam.cx>
483
484 * HTML.pm.src: Don't use local name stored in stack (i.e. ->[1])
485 for error reporting. (This is a preparation for using constant
486 value for ->[1].)
487
488 2008-03-22 Wakaba <wakaba@suika.fam.cx>
489
490 * RDFXML.pm: Typo fixed.
491
492 2008-03-22 Wakaba <wakaba@suika.fam.cx>
493
494 * ContentChecker.pm: |fact_level| is now treated
495 as same as |must_level|, i.e. level = |m|.
496 (check_element): Make list of URIs in the DOM.
497
498 2008-03-21 Wakaba <wakaba@suika.fam.cx>
499
500 * RDFXML.pm: Language accessor implemented.
501 Local (null-namespace) attribute support.
502
503 2008-03-21 Wakaba <wakaba@suika.fam.cx>
504
505 * RDFXML.pm: Factored out ID checking code.
506
507 2008-03-21 Wakaba <wakaba@suika.fam.cx>
508
509 * RDFXML.pm: TODO items noted. Validation of ID
510 and URI attributes is implemented. Warn if unknown
511 value is used in rdf:parseType="" attribute.
512
513 * URIChecker.pm (check_rdf_uri_reference): New function.
514
515 2008-03-21 Wakaba <wakaba@suika.fam.cx>
516
517 * RDFXML.pm: bnodeid implemented. Relative references
518 are now resolved.
519
520 2008-03-21 Wakaba <wakaba@suika.fam.cx>
521
522 * ContentChecker.pm: RDF reification implemented.
523
524 * RDFXML.pm: undef vs false bug fixed.
525 Reification implemented.
526
527 2008-03-21 Wakaba <wakaba@suika.fam.cx>
528
529 * RDFXML.pm: s/id/ID/ for attribute name.
530 The |node| arguments are added for |ontriple| calls.
531 Too many "attribute not allowed" errors were raised.
532
533 * ContentChecker.pm: Initial experimental support for rdf:RDF
534 element.
535
536 2008-03-21 Wakaba <wakaba@suika.fam.cx>
537
538 * RDFXML.pm: New module.
539
540 2008-03-20 Wakaba <wakaba@suika.fam.cx>
541
542 * HTML.pm.src (set_inner_html): Line/column number
543 code was old one yet.
544
545 2008-03-20 Wakaba <wakaba@suika.fam.cx>
546
547 * HTML.pm.src: Better line/column reporting for "duplicate attribute"
548 errors. Line/column markings for DOCTYPE, comment, and
549 character tokens are reintroduced; otherwise,
550 error location for "not HTML5" error and errors
551 for implied elements are not attached.
552
553 2008-03-20 Wakaba <wakaba@suika.fam.cx>
554
555 * HTML.pm.src: Set line/column numbers to attributes.
556
557 * NanoDOM.pm (create_attribute_ns, set_attribute_node_ns): Added.
558 (value): Setter implemented.
559
560 * mkhtmlparser.pl: Set line/column numbers to Attr nodes.
561
562 2008-03-20 Wakaba <wakaba@suika.fam.cx>
563
564 * HTML.pm.src: Unused line/column markings are removed.
565
566 2008-03-20 Wakaba <wakaba@suika.fam.cx>
567
568 * HTML.pm.src (_get_next_token): Remove |first_start_tag|
569 flag, which is no longer used.
570
571 2008-03-17 Wakaba <wakaba@suika.fam.cx>
572
573 * HTML.pm.src: Set line/column information to element nodes.
574
575 * mkhtmlparser.pl (!!!create-element, !!!insert-element,
576 and !!!insert-element-t): Set line/column information to
577 element nodes.
578
579 2008-03-17 Wakaba <wakaba@suika.fam.cx>
580
581 * HTML.pm.src (_get_next_token): The first "<" character
582 in "<?", "<>", or "</>" should be the error point.
583
584 2008-03-16 Wakaba <wakaba@suika.fam.cx>
585
586 * HTML.pm.src: Some more fixes on error position reporting.
587
588 2008-03-16 Wakaba <wakaba@suika.fam.cx>
589
590 * HTML.pm.src: Token-level precious error reporting.
591
592 2008-03-16 Wakaba <wakaba@suika.fam.cx>
593
594 * HTML.pm.src: Preparation for more precious error point
595 reporting.
596
597 2008-03-11 Wakaba <wakaba@suika.fam.cx>
598
599 * HTML.pm.src: Error type revised.
600
601 2008-03-11 Wakaba <wakaba@suika.fam.cx>
602
603 * HTML.pm.src: Similar codes are merged together, again.
604
605 2008-03-11 Wakaba <wakaba@suika.fam.cx>
606
607 * HTML.pm.src: Similar codes are merged together.
608
609 2008-03-10 Wakaba <wakaba@suika.fam.cx>
610
611 * mkhtmlparser.pl: Set "level" parameter to parse errors.
612
613 * HTML.pm.src: Code refined.
614
615 2008-03-09 Wakaba <wakaba@suika.fam.cx>
616
617 * HTML.pm.src: |</body>| treatement has been changed (HTML5
618 revision 1348). Note that I really don't know this makes
619 any difference in the black-box behavior of the parser.
620
621 2008-03-09 Wakaba <wakaba@suika.fam.cx>
622
623 * HTML.pm.src: New end-of-file token implementation (HTML5
624 revision 1348).
625
626 2008-03-09 Wakaba <wakaba@suika.fam.cx>
627
628 * HTML.pm.src: |applet| support (HTML5 revision 1347).
629
630 2008-03-09 Wakaba <wakaba@suika.fam.cx>
631
632 * HTML.pm.src: Foster parenting in AAA (HTML5 revision 1343).
633
634 2008-03-09 Wakaba <wakaba@suika.fam.cx>
635
636 * HTML.pm.src: Support for |<input>| in the "in select" insertion
637 mode and support for the "in select in table" insertion mode (HTML5
638 revision 1342).
639
640 2008-03-09 Wakaba <wakaba@suika.fam.cx>
641
642 * HTML.pm.src: No foster parenting for <script> and <script>
643 in non-tainted <table>s (HTML5 revision 1336).
644
645 2008-03-09 Wakaba <wakaba@suika.fam.cx>
646
647 * HTML.pm.src: Ignore white space characters between <html>
648 and <head> (HTML5 revision 1332).
649
650 2008-03-09 Wakaba <wakaba@suika.fam.cx>
651
652 * HTML.pm.src: Treat <input type=hidden> as if it were a
653 white space (HTML5 revision 1331).
654
655 2008-03-08 Wakaba <wakaba@suika.fam.cx>
656
657 * HTML.pm.src: Ignore U+000A at the beginning of a |listing|
658 element (HTML5 revision 1330).
659
660 2008-03-08 Wakaba <wakaba@suika.fam.cx>
661
662 * HTML.pm.src: <title> is always appended to the current
663 element (HTML5 revision 1328).
664
665 2008-03-08 Wakaba <wakaba@suika.fam.cx>
666
667 * HTML.pm.src: White space in tainted tables are moved
668 into foster parents (HTML5 revision 1326).
669
670 2008-03-08 Wakaba <wakaba@suika.fam.cx>
671
672 * HTML.pm.src: Reduce errors from foster parenting cases (HTML5
673 revision 1321).
674
675 2008-03-08 Wakaba <wakaba@suika.fam.cx>
676
677 * HTML.pm.src: |</h/n/>| case code rearranged to align with
678 the spec (HTML5 revision 1320). Note that we finally complete
679 all of HTML5 revision 1320 changes.
680
681 2008-03-08 Wakaba <wakaba@suika.fam.cx>
682
683 * HTML.pm.src: |</form>| now works similar to |</div>| for unclosed
684 tags (HTML5 revision 1320).
685
686 2008-03-08 Wakaba <wakaba@suika.fam.cx>
687
688 * HTML.pm.src: |</p>| case rearranged with no actual change in fact.
689
690 2008-03-08 Wakaba <wakaba@suika.fam.cx>
691
692 * HTML.pm.src: A "generate implied end tags" code (t409.1)
693 could not be reached so that it is now removed (HTML5 revision
694 1320).
695
696 2008-03-08 Wakaba <wakaba@suika.fam.cx>
697
698 * HTML.pm.src: Code for the case of |</div>| and so on are revised
699 to align with new spec text (HTML5 revision 1320).
700
701 2008-03-08 Wakaba <wakaba@suika.fam.cx>
702
703 * HTML.pm.src: Remove strange |if| condition; however, it should
704 have had no harm in theory.
705
706 2008-03-08 Wakaba <wakaba@suika.fam.cx>
707
708 * HTML.pm.src (_tree_construction_main): '</p>' in body
709 case is split from other end tags for the preparation
710 of implementing HTML revision 1320.
711
712 2008-03-07 Wakaba <wakaba@suika.fam.cx>
713
714 * HTML.pm.src: Simplified "generate implied end tag" (HTML5
715 revision 1320).
716
717 2008-03-07 Wakaba <wakaba@suika.fam.cx>
718
719 * HTML.pm.src (_tree_construction_main): Merge rules for "h1"
720 and "div" (HTML5 revision 1318). Add comments to where
721 |form| pointer association codes should be inserted (HTML5 revision
722 1319).
723
724 2008-03-06 Wakaba <wakaba@suika.fam.cx>
725
726 * HTML.pm.src: <html> treatement refined (HTML5 revision 1314).
727
728 2008-03-05 Wakaba <wakaba@suika.fam.cx>
729
730 * HTML.pm.src: Since the case t268 should never be reached (no
731 other token type, there are), it is replaced by a |die| statement.
732
733 2008-03-05 Wakaba <wakaba@suika.fam.cx>
734
735 * HTML.pm.src: Typo fixed.
736
737 2008-03-04 Wakaba <wakaba@suika.fam.cx>
738
739 * HTML.pm.src (_tree_construction_initial): Some limited quirks
740 doctypes were not uppercased for comparison.
741
742 2008-03-03 Wakaba <wakaba@suika.fam.cx>
743
744 * HTML.pm.src (tree construction and set_inner_html): Checkpoints
745 are added.
746
747 2008-03-03 Wakaba <wakaba@suika.fam.cx>
748
749 * HTML.pm.src (_tokenize_attempt_to_consume_an_entity): Checkpoints
750 are set. Cases that are unlikely reached are noted as so.
751
752 2008-03-03 Wakaba <wakaba@suika.fam.cx>
753
754 * HTML.pm.src: Checkpoints for debugging are added.
755
756 * mkhtmlparser.pl: Support for |!!!cp| syntax.
757
758 2008-03-03 Wakaba <wakaba@suika.fam.cx>
759
760 * HTML.pm.src, mkhtmlparser.pl: s/_input_character/_char/g
761 for simplicity.
762
763 2008-03-03 Wakaba <wakaba@suika.fam.cx>
764
765 * HTML.pm.src: Flag name changed: s/correct/force-quirks/g (HTML5
766 revision 1307).
767
768 2008-03-03 Wakaba <wakaba@suika.fam.cx>
769
770 * HTML.pm.src (_get_next_token): Where the /incorrect/ flag
771 is set are changed (HTML5 revision 1305).
772
773 2008-03-02 Wakaba <wakaba@suika.fam.cx>
774
775 * HTML.pm.src: Raise a parse error for |<span ===>| (HTML5 revision
776 1292). Entities are not parsed in comment-like part in RCDATA
777 elements (HTML5 revision 1294). Allow bare & at the end
778 of attribute value literals (HTML5 revision 1296). More
779 quirks mode doctypes (HTML5 revision 1302). Requires spaces
780 between attributes and ban attribute names or unquoted
781 attribute values containing single or double quotes (HTML5
782 revision 1303).
783
784 2008-03-02 Wakaba <wakaba@suika.fam.cx>
785
786 * ContentChecker.pm: Typo fixed. Don't raise "character encoding"
787 and related errors unless it is an HTML document (though the spec
788 is unclear on whether it is applied to XHTML document).
789
790 * HTML.pm (%HTMLAttrStatus): WF2 repetition model attributes
791 are added.
792
793 2008-03-02 Wakaba <wakaba@suika.fam.cx>
794
795 * HTML.pm.src: s/local_name/manakai_local_name/g.
796
797 2008-03-01 Wakaba <wakaba@suika.fam.cx>
798
799 * _NamedEntityList.pm: Updated (HTML5 revision 1286).
800
801 * HTML.pm.src: |charset| in |content| attribute is
802 case-insensitive (HTML5 revision 1270).
803
804 2008-02-26 Wakaba <wakaba@suika.fam.cx>
805
806 * ContentChecker.pm: New status constants are added.
807 ($ElementDefault): |status| added.
808 (check_element): Err for non-standard or deprecated elements.
809 (_attr_status_info): For non-standard or deprecated attributes.
810
811 2008-02-24 Wakaba <wakaba@suika.fam.cx>
812
813 * ContentChecker.pm (_attr_status_info): New internal method.
814
815 2008-02-24 Wakaba <wakaba@suika.fam.cx>
816
817 * ContentChecker.pm (check_element): Element standardized
818 status information is now dispatched.
819
820 2008-02-24 Wakaba <wakaba@suika.fam.cx>
821
822 * ContentChecker.pm (check_element): Fix |del|-and-significant
823 problem by adding some more arguments.
824
825 2008-02-24 Wakaba <wakaba@suika.fam.cx>
826
827 * ContentChecker.pm (check_element): Use context of
828 container-for-the-purpose-of-content-model element (not transparent
829 element) for |check_child_element| calling and significant
830 text flag marking. This reintroduces |<del>|-and-significant
831 problem again.
832
833 2008-02-24 Wakaba <wakaba@suika.fam.cx>
834
835 * ContentChecker.pm (check_element): Make semi-transparent
836 elements ignored for the purpose of phase changes in
837 content model checking.
838
839 2008-02-23 Wakaba <wakaba@suika.fam.cx>
840
841 * ContentChecker.pm (check_element): In-element state
842 was not properly managed for transparent cases.
843
844 2008-02-23 Wakaba <wakaba@suika.fam.cx>
845
846 * ContentChecker.pm (check_element): Support for |video|
847 and |audio| as semi-transparent elements.
848
849 2008-02-23 Wakaba <wakaba@suika.fam.cx>
850
851 * ContentChecker.pm ($HTMLSemiTransparentElements): New.
852 (check_element): s/minuses/minus_elements/, s/pluses/plus_elements/.
853 Support for |html:object| as a semi-transparent element.
854
855 2008-02-23 Wakaba <wakaba@suika.fam.cx>
856
857 * ContentChecker.pm (check_element): The way to traverse
858 the tree is entirely revised to make it easier to track
859 the state of ancestors/descendants. As a result of this
860 revision (which rewrites almost all of Whatpm::ContentChecker::HTML),
861 support for content model checking for HTML elements |figure|,
862 |object|, |video|, and |audio| and checking for XML elements (and
863 some XMLNS checkings) are dropped for now. They will be
864 reimplemented in due cource.
865
866 2008-02-17 Wakaba <wakaba@suika.fam.cx>
867
868 * HTML.pm.src: |>| in public or system literal closes the DOCTYPE
869 token (HTML5 revision 1225).
870
871 2008-02-17 Wakaba <wakaba@suika.fam.cx>
872
873 * ContentChecker.pm ({unsupported_level}): New value.
874
875 * HTML.pm.src: Save whether |meta| |content| attribute
876 contains character references or not.
877
878 2008-02-17 Wakaba <wakaba@suika.fam.cx>
879
880 * ContentChecker.pm (_get_children): (Incompleted) attempt to
881 imlement significant content checking for contents
882 with |del| elements.
883
884 2008-02-17 Wakaba <wakaba@suika.fam.cx>
885
886 * ContenteChecker.pm ($HTMLTransparentElements): More
887 elements are added.
888 (_get_children): HTML |object| elements are now semi-transparent.
889
890 * NanoDOM.pm (manakai_html, manakai_head): New methods.
891
892 2008-02-16 Wakaba <wakaba@suika.fam.cx>
893
894 * CacheManifest.pm: HTML5 revision 1211 implemented.
895
896 * CacheManifest.pod: Updated.
897
898 2008-02-10 Wakaba <wakaba@suika.fam.cx>
899
900 * ContentChecker.pm (check_document, check_element): Support
901 for second argument ($onsubdoc).
902 (_get_css_parser): Removed (now it is part of WDCC).
903
904 2008-02-09 Wakaba <wakaba@suika.fam.cx>
905
906 * ContentChecker.pm (_get_css_parser): New.
907
908 2007-11-25 Wakaba <wakaba@suika.fam.cx>
909
910 * ContentChecker.pm ($AnyChecker): Old way to add child elements
911 for checking had been used.
912
913 2007-11-25 Wakaba <wakaba@suika.fam.cx>
914
915 * ContentChecker.pm (check_element): New todo item type |descendant|.
916
917 2007-11-23 Wakaba <wakaba@suika.fam.cx>
918
919 * IMTChecker.pm: Revised to raise errors and warnings as (poorly)
920 specced in RFC 2046 and RFC 4288.
921 (application/atom+xml): Definition added.
922
923 2007-11-23 Wakaba <wakaba@suika.fam.cx>
924
925 * URIChecker.pm: Make RFC 3986 should-level errors
926 warnings (rather than SHOULD-level errors).
927
928 2007-11-23 Wakaba <wakaba@suika.fam.cx>
929
930 * NanoDOM.pm (get_user_data, set_user_data): New methods.
931
932 * HTML.pm.src: A flag for character references in attribute
933 values are added. Set |manakai_has_reference| user data
934 to |charset| attribute.
935
936 2007-11-23 Wakaba <wakaba@suika.fam.cx>
937
938 * NanoDOM.pm (input_encoding, manakai_charset, manakai_has_bom): New
939 attributes.
940
941 * ContentChecker.pm (check_document): Warn if charset requirements
942 cannot be tested.
943
944 2007-11-19 Wakaba <wakaba@suika.fam.cx>
945
946 * HTML.pm.src (parse_byte_string): Detect charset
947 by universalchardet if charset parameter is not specified.
948
949 * Makefile (Charset-all, Charset-clean): New rules.
950
951 2007-11-18 Wakaba <wakaba@suika.fam.cx>
952
953 * ContentChecker.pm (check_document): Check the existence
954 of character encoding declaration and selection of encoding
955 for HTML document.
956
957 2007-11-18 Wakaba <wakaba@suika.fam.cx>
958
959 * ContentType.pm (get_sniffed_type): Return also the official
960 type in list context.
961
962 2007-11-18 Wakaba <wakaba@suika.fam.cx>
963
964 * ContentType.pm: Sniffing with leading white space
965 ignoring (HTML5 revisions 1013 and 1016).
966
967 2007-11-18 Wakaba <wakaba@suika.fam.cx>
968
969 * ContentType.pm: HTML5 revision 1013 changes, except for leading
970 white spaces, are implemented.
971
972 2007-11-11 Wakaba <wakaba@suika.fam.cx>
973
974 * HTML.pm.src (parse_char_string): Set |inner_encoding|
975 attribute if possible.
976
977 2007-11-11 Wakaba <wakaba@suika.fam.cx>
978
979 * HTML.pm.src (parse_byte_string): New method.
980 (parse_char_string): New alias for |parse_string|.
981 (main phase): Invoking "change the encoding" algorithm if desired.
982
983 * HTML.pod: Updated.
984
985 2007-11-11 Wakaba <wakaba@suika.fam.cx>
986
987 * HTML.pod (get_inner_html): Removed.
988
989 * Makefile (HTML-all, HTML-clean): New.
990
991 2007-11-11 Wakaba <wakaba@suika.fam.cx>
992
993 * HTML.pm.src (get_inner_html): Removed (moved to HTML/Serializer.pm).
994
995 2007-11-08 Wakaba <wakaba@suika.fam.cx>
996
997 * mklinktypelist.pl: s/noreferer/noreferrer/ (HTML5 revision 1132).
998
999 2007-11-04 Wakaba <wakaba@suika.fam.cx>
1000
1001 * Makefile: |CacheManifest.html| is added.
1002
1003 * CacheManifest.pod: New file.
1004
1005 2007-11-04 Wakaba <wakaba@suika.fam.cx>
1006
1007 * CacheManifest.pm: New module.
1008
1009 2007-11-04 Wakaba <wakaba@suika.fam.cx>
1010
1011 * HTML.pm.src: Support for application cache selection algorithm
1012 callback.
1013
1014 2007-11-04 Wakaba <wakaba@suika.fam.cx>
1015
1016 * mklinktypelist.pl: Support for rel=noreferer (HTML5 revision 1118).
1017
1018 2007-10-17 Wakaba <wakaba@suika.fam.cx>
1019
1020 * Makefile (clean): New rule.
1021
1022 * NanoDOM.pm (public_id, system_id): New attributes.
1023
1024 2007-10-17 Wakaba <wakaba@suika.fam.cx>
1025
1026 * Makefile (CSS-all, CSS-clean, clean): New rules.
1027
1028 2007-10-14 Wakaba <wakaba@suika.fam.cx>
1029
1030 * ContentChecker.pm (check_document): Support for
1031 new |is_xml_root| flag.
1032 (check_element): Support for new |pluses| state.
1033 (_add_pluses): New method.
1034 (_remove_minuses): Support for new |minus| item.
1035
1036 2007-09-24 Wakaba <wakaba@suika.fam.cx>
1037
1038 * ContentChecker.pm: Raise specific error for invalid
1039 root element.
1040
1041 2007-09-24 Wakaba <wakaba@suika.fam.cx>
1042
1043 * ContentChecker.pm: Set level values for later uses.
1044
1045 2007-09-09 Wakaba <wakaba@suika.fam.cx>
1046
1047 * ContentChecker.pm: Support for language tag validation.
1048
1049 2007-09-09 Wakaba <wakaba@suika.fam.cx>
1050
1051 * LangTag.pm (check_rfc3066_language_tag): New method.
1052
1053 2007-09-09 Wakaba <wakaba@suika.fam.cx>
1054
1055 * LangTag.pm: New module.
1056
1057 2007-09-04 Wakaba <wakaba@suika.fam.cx>
1058
1059 * HTML.pm.src: Some error types were wrong.
1060
1061 2007-08-17 Wakaba <wakaba@suika.fam.cx>
1062
1063 * CSS/: New directory.
1064
1065 2007-08-17 Wakaba <wakaba@suika.fam.cx>
1066
1067 * ContentChecker.pm (_check_get_children): Support
1068 for |noscript| in |head|.
1069
1070 2007-08-12 Wakaba <wakaba@suika.fam.cx>
1071
1072 * URI/: New directory.
1073
1074 2007-08-11 Wakaba <wakaba@suika.fam.cx>
1075
1076 * HTML.pm.src: Tokenizer's states are now represented in
1077 number.
1078
1079 2007-08-11 Wakaba <wakaba@suika.fam.cx>
1080
1081 * HTML.pm.src: |or|s for insertion modes are replaced
1082 by |&|s.
1083
1084 2007-08-11 Wakaba <wakaba@suika.fam.cx>
1085
1086 * HTML.pm.src: Token types are now represented in number.
1087
1088 2007-08-11 Wakaba <wakaba@suika.fam.cx>
1089
1090 * ContentType.pm (SEE ALSO): Updated.
1091
1092 * HTML.pm.src: Insertion modes are now represented in number.
1093
1094 2007-08-11 Wakaba <wakaba@suika.fam.cx>
1095
1096 * ContentType.pm: Sniffing for bitmap images (HTML5 revision
1097 999) is implemented.
1098
1099 2007-08-08 Wakaba <wakaba@suika.fam.cx>
1100
1101 * ContentType.pm: Sniffing for |<script| (HTML5 revision
1102 983) is implemented.
1103
1104 2007-08-06 Wakaba <wakaba@suika.fam.cx>
1105
1106 * ContentChecker.pod: New documentation.
1107
1108 * Makefile: A rule for |ContentChecker.html| is added.
1109
1110 * ContentChecker.pm: A pod "LICENSE" section is added.
1111
1112 * NanoDOM.pm ($VERSION): New variable.
1113
1114 2007-08-05 Wakaba <wakaba@suika.fam.cx>
1115
1116 * H2H.pm: |b|, |i|, and |sub| are added to the
1117 list of allowed HTML elements.
1118
1119 2007-08-05 Wakaba <wakaba@suika.fam.cx>
1120
1121 * H2H.pm: |samp| is added to the list of allowed
1122 HTML elements.
1123
1124 * URIChecker.pm (check_iri): New.
1125 (check_iri_reference): Error type for IRI reference
1126 syntax error is changed.
1127
1128 2007-08-04 Wakaba <wakaba@suika.fam.cx>
1129
1130 * ContentChecker.pm: Reference to the |Whatpm::ContentChecker::Atom|
1131 is added.
1132 (check_document): Load appropriate module before validation.
1133
1134 2007-08-04 Wakaba <wakaba@suika.fam.cx>
1135
1136 * ContentChecker/: New directory.
1137
1138 2007-08-04 Wakaba <wakaba@suika.fam.cx>
1139
1140 * ContentChecker.pm: HTML |time| element is implemented.
1141
1142 * HTMLTable.pm: Comments are updated as HTML5 is revised.
1143
1144 2007-07-21 Wakaba <wakaba@suika.fam.cx>
1145
1146 * ContentChecker.pm (check_document): Return value
1147 even if no document element is found.
1148
1149 2007-07-21 Wakaba <wakaba@suika.fam.cx>
1150
1151 * HTML.pm.src: |$in_body| is no longer a function.
1152
1153 2007-07-21 Wakaba <wakaba@suika.fam.cx>
1154
1155 * HTML.pm.src: The |$in_body| code has been moved down.
1156
1157 2007-07-21 Wakaba <wakaba@suika.fam.cx>
1158
1159 * HTML.pm.src: The "trailing end" insertion mode
1160 is split into "after html body" and "after html frameset"
1161 insertion modes. Their codes are merged with "after body"
1162 and "after frameset" codes. |$previous_insertion_mode|
1163 has been removed. "after frameset" code is
1164 merged with "in frameset" code.
1165
1166 2007-07-21 Wakaba <wakaba@suika.fam.cx>
1167
1168 * HTML.pm.src: The "before head" insertion mode is
1169 merged with the "in head" insertion mode.
1170
1171 2007-07-21 Wakaba <wakaba@suika.fam.cx>
1172
1173 * HTML.pm.src: Cases in "in head" insertion mode are
1174 reorganized.
1175
1176 2007-07-21 Wakaba <wakaba@suika.fam.cx>
1177
1178 * HTML.pm.src: Some cases in "in table" insertion mode
1179 are merged.
1180
1181 2007-07-21 Wakaba <wakaba@suika.fam.cx>
1182
1183 * HTML.pm.src: The "in row" insertion mode is merged
1184 with "in table" insertion mode.
1185
1186 2007-07-21 Wakaba <wakaba@suika.fam.cx>
1187
1188 * HTML.pm.src: The "in table" and "in table body" insertion
1189 modes are merged.
1190
1191 2007-07-21 Wakaba <wakaba@suika.fam.cx>
1192
1193 * HTML.pm.src: There is no "in table head" or "in table foot"
1194 insertion mode!
1195
1196 2007-07-21 Wakaba <wakaba@suika.fam.cx>
1197
1198 * HTML.pm.src: |<noframes>| "in frameset" and "in noframes"
1199 now directly invoke the handler.
1200
1201 2007-07-21 Wakaba <wakaba@suika.fam.cx>
1202
1203 * HTML.pm.src: Codes for "in cell" insertion mode
1204 is merged to the "in body" insertion mode code.
1205
1206 2007-07-21 Wakaba <wakaba@suika.fam.cx>
1207
1208 * HTML.pm.src: Codes for "in body" and "in caption"
1209 insertion modes are merged.
1210
1211 2007-07-21 Wakaba <wakaba@suika.fam.cx>
1212
1213 * HTML.pm.src: Two |!!!next-token|s were missing.
1214
1215 2007-07-21 Wakaba <wakaba@suika.fam.cx>
1216
1217 * HTML.pm.src: Use numeric constant for |{content_mode}|
1218 instead of string constant for |{content_model_flag}|.
1219
1220 2007-07-21 Wakaba <wakaba@suika.fam.cx>
1221
1222 * HTML.pm.src: Add the name of the attribute
1223 to the "duplicate attribute" error.
1224
1225 2007-07-17 Wakaba <wakaba@suika.fam.cx>
1226
1227 * ContentChecker.pm: Return the |class| node list.
1228
1229 2007-07-17 Wakaba <wakaba@suika.fam.cx>
1230
1231 * ContentChecker.pm: Return the |id| node list.
1232
1233 * HTML.pm.src: A typo is fixed.
1234
1235 2007-07-16 Wakaba <wakaba@suika.fam.cx>
1236
1237 * ContentChecker.pm: Drop wrong |level => 'error'| specification
1238 from "in HTML:xml:lang" error. Character position
1239 is now the last part of the error type in the URI error
1240 description. Report "unsupported" status for language
1241 tags, media queries, script codes, and style sheets.
1242
1243 2007-07-16 Wakaba <wakaba@suika.fam.cx>
1244
1245 * ContentChecker.pm: Report error if |xml:lang|
1246 in HTML, |lang| in XML, |xmlns| in XML, and |meta| |charset|
1247 in XML.
1248
1249 * NanoDOM.pm (Attr.owner_document): New attribute.
1250
1251 2007-07-16 Wakaba <wakaba@suika.fam.cx>
1252
1253 * HTML.pm.src: The character immediately following
1254 a bare |hcro| was discarded. Fix handling of
1255 entity references in attribute values.
1256
1257 2007-07-16 Wakaba <wakaba@suika.fam.cx>
1258
1259 * HTML.pm.src (main and trailin end phases): Token
1260 types |DOCTYPE|, |comment|, |end-of-file|, and |<html>| are
1261 factored out. Error types |in html:#DOCTYPE|
1262 and |after html:#DOCTYPE| are merged
1263 into |DOCTYPE in the middle|. |</frameset>|
1264 in fragment parsing mode changed the
1265 insertion mode.
1266
1267 2007-07-16 Wakaba <wakaba@suika.fam.cx>
1268
1269 * HTML.pm.src: |$phase| has been removed; The |trailing end|
1270 phase is now an insertion mode. Treatments for white
1271 space character tokens were incorrect for some
1272 insertion modes. An old |meta| case was not removed.
1273
1274 2007-07-16 Wakaba <wakaba@suika.fam.cx>
1275
1276 * HTML.pm.src: |meta| charset declaration extraction
1277 implemented (but changing the encoding is not yet:-).
1278
1279 2007-07-15 Wakaba <wakaba@suika.fam.cx>
1280
1281 * Charset/: New directory.
1282
1283 2007-07-15 Wakaba <wakaba@suika.fam.cx>
1284
1285 * H2H.pm: New Perl module (created from
1286 manakai's H2H.dis).
1287
1288 2007-07-15 Wakaba <wakaba@suika.fam.cx>
1289
1290 * XMLSerializer.pm: New Perl module (created from
1291 manakai's SimpleLS.dis).
1292
1293 2007-07-07 Wakaba <wakaba@suika.fam.cx>
1294
1295 * HTML.pm.src: |<!---x-->| was not processed correctly.
1296
1297 2007-07-01 Wakaba <wakaba@suika.fam.cx>
1298
1299 * HTML.pm.src: Report correct error message
1300 for |<body></div></body>|.
1301
1302 2007-07-01 Wakaba <wakaba@suika.fam.cx>
1303
1304 * HTMLTable.pm: An error description was incorrect.
1305
1306 2007-06-30 Wakaba <wakaba@suika.fam.cx>
1307
1308 * ContentChecker.pm: Return |{term}| list.
1309
1310 2007-06-30 Wakaba <wakaba@suika.fam.cx>
1311
1312 * HTML.pm.src: HTML5 revisions 961-966 (</p>, </br>,
1313 nested <nobr>, implied </tbody>, </tfoot>, and </thead>,
1314 and <title> outside of head).
1315
1316 2007-06-30 Wakaba <wakaba@suika.fam.cx>
1317
1318 * IMTChecker.pm: Report warning for unregistered
1319 and private types/subtypes.
1320
1321 * ContentChecker.pm, HTML.pm.src, IMTChecker.pm,
1322 URIChecker.pm, HTMLTable.pm: Error messages are now
1323 consistent; they are all listed in
1324 <http://suika.fam.cx/gate/2005/sw/Whatpm%20Error%20Types>.
1325
1326 2007-06-25 Wakaba <wakaba@suika.fam.cx>
1327
1328 * ContentChecker.pm: |<img ismap>| not in |<a></a>|
1329 is now erred. |<datalist>| is implemented.
1330 Attribute checker for |<command>| and |<menu>| are
1331 added. Support for |contextmenu| global attribute
1332 is added.
1333
1334 2007-06-25 Wakaba <wakaba@suika.fam.cx>
1335
1336 * HTML.pm.src (_reset_insertion_mode): Interpretation
1337 of Step 3 has been changed.
1338
1339 2007-06-25 Wakaba <wakaba@suika.fam.cx>
1340
1341 * HTML.pm.src: Late |<html>| parse error is implemented.
1342
1343 2007-06-24 Wakaba <wakaba@suika.fam.cx>
1344
1345 * URIChecker.pm (check_iri_reference): A |decode| method name was
1346 incorrect.
1347
1348 * ContentChecker.pm: Support for the |footer| element.
1349 Check URI syntax for space-separated URI attributes.
1350 Support for the |tabindex| attribute. Support
1351 for |datetime| attribute.
1352
1353 2007-06-24 Wakaba <wakaba@suika.fam.cx>
1354
1355 * HTML.pm.src: HTML5 revision 1.144 (&#x0D;) and 1.145 (invalid
1356 character references). HTML5 revision 1.146 (white space
1357 characters before root start tag). HTML5 revision
1358 1.148 (named character references in attribute values).
1359 HTML5 revision 1.152 (<plaintext>.innerHTML get).
1360
1361 2007-06-24 Wakaba <wakaba@suika.fam.cx>
1362
1363 * HTML.pm.src: HTML5 revisions 1.142 and 1.143 (<noscript>
1364 in <head>).
1365
1366 2007-06-24 Wakaba <wakaba@suika.fam.cx>
1367
1368 * HTML.pm.src: HTML5 revision 935 (<base>, <link>, <meta>
1369 in body).
1370
1371 * ContentChecker.pm: HTML5 revision 938 (scoped="").
1372
1373 2007-06-24 Wakaba <wakaba@suika.fam.cx>
1374
1375 * HTML.pm.src: HTML5 revision 923 (matching end tag in CDATA
1376 or RCDATA in fragment parsing mode). HTML5 revision
1377 924 (<!--> and <!--->). HTML5 revision 926 (hn in hn).
1378
1379 2007-06-23 Wakaba <wakaba@suika.fam.cx>
1380
1381 * HTML.pm.src (get_inner_html): HTML5 revision 922 (inner_html
1382 for <pre> and <textarea>).
1383
1384 2007-06-23 Wakaba <wakaba@suika.fam.cx>
1385
1386 * HTML.pm.src: HTML5 revision 920 (<isindex>).
1387
1388 2007-06-23 Wakaba <wakaba@suika.fam.cx>
1389
1390 * HTML.pm.src: HTML5 revision 918 (</head>, </body>, </html>).
1391
1392 2007-06-23 Wakaba <wakaba@suika.fam.cx>
1393
1394 * HTML.pm.src: HTML5 revision 916 (</body>).
1395 HTML5 revision 917 (conforming bare &).
1396
1397 2007-06-23 Wakaba <wakaba@suika.fam.cx>
1398
1399 * NanoDOM.pm (manakai_is_html): Setting to false did not work.
1400
1401 * HTML.pm.src: HTML5 revision 914 (</ in CDATA, RCDATA).
1402 HTML5 revision 915 (<nobr>).
1403
1404 2007-06-23 Wakaba <wakaba@suika.fam.cx>
1405
1406 * HTML.pm.src: HTML5 revisions 908, 909, 912, and 913 (quirks mode).
1407
1408 * NanoDOM.pm (manakai_is_html, manakai_compat_mode, compat_mode):
1409 New attributes.
1410
1411 2007-06-23 Wakaba <wakaba@suika.fam.cx>
1412
1413 * HTML.pm.src: HTML5 revisions 900, 901, 902, and 911 (<
1414 in tags).
1415
1416 2007-06-23 Wakaba <wakaba@suika.fam.cx>
1417
1418 * .cvsignore: |Entities.html| is added.
1419
1420 * HTML.pm.src: |$entity_char| is removed and
1421 requires |Whatpm::_NamedEntityList| instead.
1422 HTML5 revision 898 (refc), except that lack of refc
1423 is parse error.
1424
1425 * mkentitylist.pl: New script.
1426
1427 * Makefile (all): |_NamedEntityList.pm| is added.
1428 (_NamedEntityList.pm, Entities.html): New rules.
1429
1430 2007-06-23 Wakaba <wakaba@suika.fam.cx>
1431
1432 * HTML.pm.src: Parse errors immediately after U+000D
1433 were ignored and U+000D immediately following another
1434 U+000D was not converted to U+000A.
1435
1436 2007-06-23 Wakaba <wakaba@suika.fam.cx>
1437
1438 * HTML.pm.src (set_inner_html): HTML5 revision 892 (adopt
1439 nodes before appended). Parser was not ready for NULL
1440 parse error and escape flag.
1441
1442 * NanoDOM.pm (adopt_node): New.
1443
1444 2007-06-23 Wakaba <wakaba@suika.fam.cx>
1445
1446 * HTML.pm.src: HTML5 revision 886 (insane comment in
1447 CDATA and RCDATA). Note that current implementation
1448 is simply repeating what the spec says and it is maybe not
1449 a best way to do it.
1450
1451 2007-06-23 Wakaba <wakaba@suika.fam.cx>
1452
1453 * HTML.pm.src: HTML5 revision 884 (</form> don't close
1454 the form element if a descendant element without implied end tag has
1455 still been open).
1456
1457 2007-06-23 Wakaba <wakaba@suika.fam.cx>
1458
1459 * ContentChecker.pm: HTML5 revision 881 (Make |id| attribute with
1460 space characters non-conforming).
1461
1462 2007-06-23 Wakaba <wakaba@suika.fam.cx>
1463
1464 * HTML.pm.src: An error message was incorrect.
1465 HTML5 revision 869 (C1 character references).
1466
1467 2007-06-23 Wakaba <wakaba@suika.fam.cx>
1468
1469 * HTML.pm.src: HTML5 revision 867 (a LF at the beginning of
1470 a |textarea| is removed).
1471
1472 2007-06-05 Wakaba <wakaba@suika.fam.cx>
1473
1474 * NanoDOM.pm (get_attribute_node_ns): New method.
1475
1476 * ContentChecker.pm: |script| |async| and |defer|
1477 no longer require |src|. |async| MUST NOT be
1478 specified if |defer|. (HTML5 revision 858).
1479
1480 2007-05-30 Wakaba <wakaba@suika.fam.cx>
1481
1482 * HTML.pm.src: |<form><form>| went to inifinite loop.
1483
1484 2007-05-27 Wakaba <wakaba@suika.fam.cx>
1485
1486 * ContentChecker.pm (html): Set |is_root| (allowed
1487 as a document element) flag on.
1488 (new): Removed.
1489 (check_document): New method.
1490
1491 2007-05-27 Wakaba <wakaba@suika.fam.cx>
1492
1493 * ContentChecker.pm (thead, tfoot): Checker specifications
1494 were incorrect.
1495
1496 2007-05-27 Wakaba <wakaba@suika.fam.cx>
1497
1498 * ContentChecker.pm ($HTMLURIAttrChecker): Include
1499 error position in the |type| option of the error.
1500
1501 * HTMLTable.pm (form_table): The |$onerror| parameter
1502 is now optional. Some bugs are fixed.
1503
1504 2007-05-27 Wakaba <wakaba@suika.fam.cx>
1505
1506 * HTMLTable.pm: New module.
1507
1508 * ContentChecker.pm (table): Invoke table model error checker.
1509
1510 * NanoDOM.pm (first_child, get_attribute_ns): New.
1511
1512 2007-05-26 Wakaba <wakaba@suika.fam.cx>
1513
1514 * ContentChecker.pm ($HTMLLinkTypesAttrChecker): New checker.
1515 (link/@rel, a/@rel, area/@rel): Use new checker.
1516
1517 * Makefile (_LinkTypeList.pm, RelExtensions.html): New rules.
1518
1519 * _LinkTypeList.pm: New file.
1520
1521 * mklinktypelist.pl: New file.
1522
1523 * .cvsignore: |RelExtensions.html| added.
1524
1525 * NanoDOM.pm (child_nodes): Returns an empty array
1526 for non-child-containing node types.
1527 (text_content): New attribute.
1528
1529 2007-05-26 Wakaba <wakaba@suika.fam.cx>
1530
1531 * IMTChecker.pm: New module.
1532
1533 * ContentChecker.pm ($HTMLIMTAttrChecker): Call IMTChecker
1534 to test parameter value validity.
1535
1536 * HTML.pm.src ($style_start_tag): Attributes were
1537 discarded.
1538
1539 2007-05-25 Wakaba <wakaba@suika.fam.cx>
1540
1541 * ContentChecker.pm ($HTMLURIAttrChecker): Implemented.
1542
1543 2007-05-25 Wakaba <wakaba@suika.fam.cx>
1544
1545 * URIChecker.pm: All recommendations from RFC 3986
1546 and RFC 3987 are listed (not all testable items are checked yet).
1547
1548 2007-05-25 Wakaba <wakaba@suika.fam.cx>
1549
1550 * URIChecker.pm: New module.
1551
1552 2007-05-20 Wakaba <wakaba@suika.fam.cx>
1553
1554 * ContentChecker.pm: Now most attributes are implemented
1555 or associated to some placeholder.
1556 ($ElementDefault): Warn unknown attributes
1557 for unknown elements as "attribute not supported".
1558 ($HTMLLanguageTagAttrChecker, $HTMLMQAttrChecker): New placeholders.
1559 ($HTMLUsemapAttrChecker, $HTMLTargetAttrChecker): New checkers.
1560 (|a| attribute checker): Reimplemented.
1561
1562 2007-05-20 Wakaba <wakaba@suika.fam.cx>
1563
1564 * ContentChecker.pm ($HTMLEventHandlerAttrChecker): New placeholder.
1565 ($HTMLAttrChecker): Event handler content attributes
1566 are added.
1567 (link, embed): Required attribute is now checked.
1568 (embed): Unknown local attributes are no longer warned.
1569
1570 2007-05-20 Wakaba <wakaba@suika.fam.cx>
1571
1572 * ContentChecker.pm ($HTMLSpaceURIsAttrChecker): New placeholder.
1573 ($HTMLIMTAttrChecker): New checker.
1574 (link@rel, link@href, link@type, style@type,
1575 a@href, a@ping, a@ping, a@type, embed@src, embed@type,
1576 object@data, object@type, source@src, source@type, area@alt,
1577 area@shape, area@coords, area@href,
1578 area@ping, area@rel, area@type, script@src,
1579 script@defer, script@async, script@type): Checkers added.
1580
1581 2007-05-20 Wakaba <wakaba@suika.fam.cx>
1582
1583 * ContentChecker.pm: Descendant checking was incorrect.
1584
1585 2007-05-19 Wakaba <wakaba@suika.fam.cx>
1586
1587 * ContentChecker.pm: Support |xml:*| and |xmlns:*|
1588 attributes. Report an error if |Element.prefix|
1589 is |xmlns|.
1590
1591 * NanoDOM.pm (prefix): New attribute.
1592
1593 2007-05-19 Wakaba <wakaba@suika.fam.cx>
1594
1595 * HTML.pm.src: In |main| phase, |in body| insertion
1596 mode, action for |<iframe>| was missing.
1597
1598 2007-05-19 Wakaba <wakaba@suika.fam.cx>
1599
1600 * ContentChecker.pm: Support for many of HTML5 elements.
1601 ($GetHTMLNonNegativeIntegerAttrChecker): New.
1602
1603 2007-05-19 Wakaba <wakaba@suika.fam.cx>
1604
1605 * ContentChecker.pm: Support for most elements up to |progress|.
1606 ($HTMLURIAttrChecker): Placeholder.
1607 ($HTMLIntegerAttrChecker, $GetHTMLFloatingPointNumberAttrChecker):
1608 New.
1609
1610 2007-05-19 Wakaba <wakaba@suika.fam.cx>
1611
1612 * ContentChecker.pm: Attribute checkers
1613 for global attributes, |html|, |base|, |style|, and |meta|.
1614
1615 * NanoDOM.pm (insert_before): Weaken reference
1616 to the parent node.
1617 (Attr::new): Set |owner_element| attribute.
1618 (namespace_uri, manakai_local_name): New attribute implementations.
1619 (owner_element): New attribute.
1620
1621 2007-05-19 Wakaba <wakaba@suika.fam.cx>
1622
1623 * ContentChecker.pm ($AttrChecker, $HTMLAttrChecker,
1624 $AnyChecker->{attr_checker}, $HTMLAttrsChecker,
1625 $Element->{$HTML_NS}->{''}): New.
1626 (check_element): Invoke attrs_checker for each element.
1627
1628 2007-05-13 Wakaba <wakaba@suika.fam.cx>
1629
1630 * ContentChecker.pm: Don't use |manakai_element_type_match|.
1631
1632 2007-05-13 Wakaba <wakaba@suika.fam.cx>
1633
1634 * ContentChecker.pm: Use hashs rather than lists for
1635 element type testings.
1636
1637 2007-05-13 Wakaba <wakaba@suika.fam.cx>
1638
1639 * ContentChecker.pm: Don't generate duplicate
1640 error when an element type is put in the "minus" list
1641 and the element type is not allowed explicitly in the particular
1642 element content model.
1643 (html:a checker): New checker.
1644 (html:details, html:datagrid): New checkers.
1645 (html:legend): New checker.
1646
1647 2007-05-13 Wakaba <wakaba@suika.fam.cx>
1648
1649 * ContentChecker.pm (html:li checker): Implemented.
1650
1651 2007-05-13 Wakaba <wakaba@suika.fam.cx>
1652
1653 * ContentChecker.pm ($HTMLInlineOrStriclyInlineChecker): New
1654 checker.
1655 (html:dd checker): New checker.
1656 (html:q, html:em, html:strong, html:small,
1657 html:m, html:dfn, html:code, html:samp, html:span): New checkers.
1658
1659 2007-05-13 Wakaba <wakaba@suika.fam.cx>
1660
1661 * ContentChecker.pm ($AnyChecker): Renamed
1662 from |$ElementDefault->{checker}|.
1663 ($ElementDefault->{checker}): Throw an error that
1664 the element type is not supported by the checker.
1665 ($HTMLMetadataElement): |html:base| was missing.
1666 ($HTMLEmptyChecker): Don't throw an error
1667 for inter-element whitespace nodes.
1668 (html:html checker): Errors were not
1669 thrown even if |html:head| and/or |html:body|
1670 children were missing.
1671 (html:head checker): An error was not
1672 thrown if <meta charset> appered after other
1673 elements.
1674
1675 2007-05-05 Wakaba <wakaba@suika.fam.cx>
1676
1677 * ContentChecker.pm: |footer|, |video|, |audio|, |script|,
1678 and |noscript| elements are implemented.
1679 (new): New method.
1680
1681 2007-05-04 Wakaba <wakaba@suika.fam.cx>
1682
1683 * ContentChecker.pm: New module.
1684
1685 2007-05-04 Wakaba <wakaba@suika.fam.cx>
1686
1687 * NanoDOM.pm (manakai_parent_element,
1688 document_element, manakai_local_name,
1689 manakai_element_type_match): New method.
1690
1691 2007-05-03 Wakaba <wakaba@suika.fam.cx>
1692
1693 * HTML.pm.src: Replace decimal and hexadecimal numeric
1694 entities in C1 range using Windows-1252 mapping. Bare LF
1695 did not count as new line for error reporting.
1696
1697 2007-05-02 Wakaba <wakaba@suika.fam.cx>
1698
1699 * NanoDOM.pm (DOMImplementation): New class.
1700 (append_child): Weaken the |parent_node| reference.
1701 (create_element_ns, Element new): Set the |owner_document|
1702 reference.
1703 (implementation): New attribute.
1704 (owner_document, local_name, namespace_uri): New attributes.
1705
1706 * HTML.pm.src (parse_string): Line and column numbers
1707 are now provided to error handler.
1708 (!!!parse-error): Short descriptions are added.
1709 (_construct_tree): Split into three methods; support
1710 for innerHTML mode.
1711 (set_inner_html): New method.
1712
1713 2007-05-01 Wakaba <wakaba@suika.fam.cx>
1714
1715 * NanoDOM.html: Documentation is added.
1716
1717 * HTML.pod, ContentType.html: Documentation is revised.
1718
1719 * .cvsignore: Pod2html temporary files are added.
1720
1721 * Makefile: Make |NanoDOM.html|.
1722
1723 2007-05-01 Wakaba <wakaba@suika.fam.cx>
1724
1725 * HTML.pm.src (parse_string): New method.
1726 (get_inner_html): Renamed from |inner_html|.
1727
1728 * Makefile: A rule for |HTML.html| is added.
1729
1730 * HTML.pod: New documentation.
1731
1732 2007-05-01 Wakaba <wakaba@suika.fam.cx>
1733
1734 * NanoDOM.pm (last_child, previous_sibling): New attributes.
1735 (clone_node): Attribute nodes were not completely copied.
1736
1737 * HTML.pm.src: Many bugs are fixed.
1738
1739 2007-04-30 Wakaba <wakaba@suika.fam.cx>
1740
1741 * HTML.pm.src: Some typos are fixed.
1742
1743 2007-04-30 Wakaba <wakaba@suika.fam.cx>
1744
1745 * mkhtmlparser.pl, Makefile: References to the |HTML-consume-entity.src|
1746 are removed.
1747
1748 * HTML.pm.src: Tokenizer's handling on named entities are rewritten.
1749
1750 * HTML-consume-entity.src: Removed.
1751
1752 2007-04-30 Wakaba <wakaba@suika.fam.cx>
1753
1754 * HTML.pm.src: Tokenizer's handling on hexadecimal
1755 numeric entities are rewritten.
1756
1757 2007-04-30 Wakaba <wakaba@suika.fam.cx>
1758
1759 * HTML.pm.src: Some tokenizer bugs are fixed.
1760
1761 2007-04-30 Wakaba <wakaba@suika.fam.cx>
1762
1763 * HTML.pm.src: The tree construction stage is implemented.
1764
1765 * mkhtmlparser.pl: New macros are added.
1766
1767 2007-04-28 Wakaba <wakaba@suika.fam.cx>
1768
1769 * ContentType.pm: A note on bug in the specification
1770 is removed since it's been now fixed.
1771
1772 * .cvsignore: New file.
1773
1774 2007-04-28 Wakaba <wakaba@suika.fam.cx>
1775
1776 * HTML.pm.src, HTML-consume-entity.src: New files.
1777
1778 * Makefile (HTML.pm): New rule.
1779
1780 * mkhtmlparser.pl: New script.
1781
1782 2007-04-25 Wakaba <wakaba@suika.fam.cx>
1783
1784 * Makefile: New file.
1785
1786 2007-04-24 Wakaba <wakaba@suika.fam.cx>
1787
1788 * ContentType.pm: An error in pod is fixed.
1789
1790 2007-04-24 Wakaba <wakaba@suika.fam.cx>
1791
1792 * ContentType.pm: Documentation is added.
1793
1794 2007-04-24 Wakaba <wakaba@suika.fam.cx>
1795
1796 * ContentType.pm: New Perl module.
1797
1798 * ChangeLog: New file.
1799
1800

admin@suikawiki.org
ViewVC Help
Powered by ViewVC 1.1.24