/[suikacvs]/markup/html/whatpm/Whatpm/ChangeLog
Suika

Contents of /markup/html/whatpm/Whatpm/ChangeLog

Parent Directory Parent Directory | Revision Log Revision Log


Revision 1.290 - (show annotations) (download)
Wed Sep 10 10:46:50 2008 UTC (16 years, 10 months ago) by wakaba
Branch: MAIN
Changes since 1.289: +4 -0 lines
++ whatpm/Whatpm/ChangeLog	10 Sep 2008 10:46:48 -0000
	* HTML.pm.src: Ignore punctuations in charset names.

2008-09-10  Wakaba  <wakaba@suika.fam.cx>

1 2008-09-10 Wakaba <wakaba@suika.fam.cx>
2
3 * HTML.pm.src: Ignore punctuations in charset names.
4
5 2008-09-10 Wakaba <wakaba@suika.fam.cx>
6
7 * ContentChecker.pm: Support for charset-layer error levels.
8
9 * HTML.pm.src: Don't specify |text| argument for the
10 |chardecode:fallback| error, since it is not the encoding
11 being used alternatively.
12
13 2008-09-06 Wakaba <wakaba@suika.fam.cx>
14
15 * HTML.pm.src: Support for |XSLT-compat| (HTML5 revision 2141).
16
17 2008-08-31 Wakaba <wakaba@suika.fam.cx>
18
19 * CacheManifest.pm: Support for extensibility (HTML5 revision 2051).
20
21 2008-08-31 Wakaba <wakaba@suika.fam.cx>
22
23 * HTML.pm.src: Bug fix and sync with the spec with regard
24 to after after frameset insertion mode processing (HTML5
25 revision 1909). Note that the implementation was wrong
26 per the old spec before the r1909 changes.
27
28 2008-08-30 Wakaba <wakaba@suika.fam.cx>
29
30 * HTMLTable.pm: scope=auto algorithm fix synced with the
31 spec (HTML5 revision 2093).
32 ($process_row): Algorithm step numbers synced with the
33 spec (HTML5 revision 2092).
34
35 2008-08-30 Wakaba <wakaba@suika.fam.cx>
36
37 * HTMLTable.pm: Zs is not what we want; we want White_Space! (HTML5
38 revision 2094).
39
40 2008-08-30 Wakaba <wakaba@suika.fam.cx>
41
42 * ContentType.pm: Support for image/svg+xml (HTML5 revision 2096).
43
44 2008-08-30 Wakaba <wakaba@suika.fam.cx>
45
46 * HTML.pm.src: '"' and "'" at the end of attribute
47 name (after another attribute) now raise parse error (HTML5
48 revision 2123). Empty unquoted attribute values are no
49 longer allowed (HTML5 revision 2122).
50
51 2008-08-30 Wakaba <wakaba@suika.fam.cx>
52
53 * mkhtmlparser.pl: Support for MathML |definitionURL| attribute (HTML5
54 revision 2130).
55
56 2008-08-30 Wakaba <wakaba@suika.fam.cx>
57
58 * ContentChecker.pm: |xml:lang| attribute value must be same
59 as |lang| attribute value for HTML elements (HTML5 revision 2062
60 and so on).
61
62 2008-08-30 Wakaba <wakaba@suika.fam.cx>
63
64 * ContentChecker.pm: Error level definition for |xml_id_error|
65 was missing.
66
67 * URIChecker.pm: The end of the URL should be marked as the
68 error location for an empty path error. The position
69 between the userinfo and the port components should be
70 marked as the error location for an empty host error.
71
72 2008-08-30 Wakaba <wakaba@suika.fam.cx>
73
74 * URIChecker.pm: Set parameters representing where in the
75 value the error occurs for errors. Report unknown
76 address format error in warning level, since address
77 formats are rarely added. Path segments starting with "/.."
78 were misinterpreted as a dot-segment.
79
80 2008-08-30 Wakaba <wakaba@suika.fam.cx>
81
82 * URIChecker.pm (check_iri_reference): Requires
83 |Message::DOM::DOMImplementation|.
84
85 2008-08-29 Wakaba <wakaba@suika.fam.cx>
86
87 * IMTChecker.pm: Updated for the new error reporting architecture.
88
89 * ContentChecker.pm: Error levels for IMTs are added.
90
91 2008-08-17 Wakaba <wakaba@suika.fam.cx>
92
93 * H2H.pm (_shift_token): Support for unquoted HTML attribute
94 values.
95
96 2008-08-16 Wakaba <wakaba@suika.fam.cx>
97
98 * CacheManifest.pm: Support for new style of error
99 reports.
100
101 * HTML.pm.src: Set line=1, column=1 to the document node.
102
103 2008-08-16 Wakaba <wakaba@suika.fam.cx>
104
105 * ContentChecker.pm, RDFXML.pm: Pass {level} object to language tag
106 and URL checkers. Support for more error levels for bogus
107 langauge tag and URL "standards".
108
109 * LangTag.pm, URIChecker.pm: Support for new style error
110 level reporting.
111
112 2008-08-15 Wakaba <wakaba@suika.fam.cx>
113
114 * ContentChecker.pm: Support for RDF/XML error levels.
115
116 * HTMLTable.pm, RDFXML.pm: Support for new style of error level
117 specifying. Error types are revised.
118
119 2008-08-15 Wakaba <wakaba@suika.fam.cx>
120
121 * ContentChecker.pm: All error reporting method calls are
122 renewed.
123
124 2008-08-15 Wakaba <wakaba@suika.fam.cx>
125
126 * HTML.pm.src: All error type names and "text" parameters
127 are revised. Use new style for "level" specification.
128
129 * mkhtmlparser.pl: Use new style for "level" specification.
130
131 2008-08-03 Wakaba <wakaba@suika.fam.cx>
132
133 * WebIDL.pm (parse_char_string): Simplified error
134 reporting process for broken ignored valuetype definition.
135 (Valuetype idl_text): Support for special "DOMString" name.
136
137 2008-08-03 Wakaba <wakaba@suika.fam.cx>
138
139 * WebIDL.pm ($get_scoped_name): Append "::::" if the last
140 terminal of the ScopedName is "DOMString", such that whether
141 the last part of the scoped name is "DOMString" or "_DOMString"
142 later. It is necessary to determine whether a |typedef|
143 definition should be ignored or not.
144 (parse_char_string): Unescape the identifier of
145 exception members.
146 ($resolve): Return undef for builtin types and sequence<T>
147 types (we might not have to do this, however...).
148 (check): Support checking for Exceptions, Valuetypes,
149 and Typedefs.
150 ($serialize_type): Support for "DOMString::::" syntax.
151 (Typedef idl_text): Output Type as "DOMString" if it
152 is really "DOMString" (i.e. its internal representation
153 is "::DOMString::").
154
155 2008-08-03 Wakaba <wakaba@suika.fam.cx>
156
157 * WebIDL.pm ($resolve): New code, based on resolve code
158 for constant types in the |check| method.
159 (check): Support for checking of attributes, operations, and
160 arguments.
161 (Attribute/Operation idl_text): Exception names in getraises,
162 setraises, and raises clauses is serizlied by |$serialize_type|
163 code.
164
165 2008-08-02 Wakaba <wakaba@suika.fam.cx>
166
167 * WebIDL.pm ($integer): Order of selections are changed to match
168 hexadecimal numbers (the original pattern, taken from the spec,
169 was not work for hexadecimal numbers, because the "0" prefix
170 matches to the [0-7]* part (as an empty string) and therefore
171 it does not match with remaining "x..." part of a "0x..." integer
172 literal.
173 ($get_type): It now returns a string, not an array reference,
174 for regular types and |sequence| types (i.e. it in any case
175 returns a string).
176 ($get_next_token): The second item in the array that represents
177 a integer or float token is now a Perl number value, not the
178 original string representation of the number.
179 (check): Support for const value consistency checking.
180 No extended attribute is defined for constants.
181 (Node subclasses): Use simple strings rather than array references
182 for default data type values.
183 ($serialize_type): Type values are now simple strings.
184 (value): If the new attribute value is a false value, then
185 a FALSE value is set to the attribute.
186
187 2008-08-02 Wakaba <wakaba@suika.fam.cx>
188
189 * WebIDL.pm ($get_scoped_name): Now scoped names are stored
190 in its stringified format ("scoped name" as defined in the
191 spec). Note that future version of this module should not use
192 array references for type values and the |type_text| attribute
193 should be made obsolete.
194 (parse_char_string): Unescape attribute names.
195 (check): Support for checking of whether inherited interfaces
196 are actually defined or not. Support for checking of whether
197 interface member identifiers are duplicated or not.
198 ($serialize_type): Scoped names are returned as is. A future
199 version of this code should escape identifiers other than "DOMString",
200 otherwise the idl_text would be non-conforming.
201
202 2008-08-02 Wakaba <wakaba@suika.fam.cx>
203
204 * WebIDL.pm (parse_char_string): Set line/column numbers
205 to generated nodes. Unescape identifiers. Extended attributes
206 for Definition's were ignored.
207 (append_child): Set |parent_node| attribute.
208 (parent_node): New attribute.
209 (check): Support interface/exception members. Support
210 extended attributes. Support definition identifier uniqueness
211 constraint.
212 (qualified_name): New attribute.
213 (Interface/Exception idl_text): Extended attributes were
214 not prepended to the returned text.
215
216 2008-08-02 Wakaba <wakaba@suika.fam.cx>
217
218 * WebIDL.pm (parse_char_string): Set line/column numbers
219 to interface object experimentally. s/shift/pop/g, shift
220 would make things wrong. Support for interface forward
221 declarations was missing. Broken interface declarations
222 with no block were not ignored entirely.
223 (Whatpm::WebIDL::Node): New abstract class. This class
224 makes things easier.
225 (child_nodes): New attribute. Unlike DOM's attribute with
226 same name, this attribute returns a dead list of nodes for
227 simplicity.
228 (get_user_data, set_user_data): New methods.
229 (Module idl_text): A SPACE character should be inserted
230 before the |{| character.
231 (Interface idl_text): Support for interface forward declarations.
232 (is_forward_declaration): New attribute.
233
234 2008-07-19 Wakaba <wakaba@suika.fam.cx>
235
236 * WebIDL.pm (type_text): Better serializer.
237
238 2008-07-19 Wakaba <wakaba@suika.fam.cx>
239
240 * WebIDL.pm: Revise forward-compatible parsing so that
241 it now can handle broken extended attributes and as such.
242
243 2008-07-19 Wakaba <wakaba@suika.fam.cx>
244
245 * WebIDL.pm: Real support for extended attributes.
246 Support for extended attributes with arguments.
247
248 2008-07-19 Wakaba <wakaba@suika.fam.cx>
249
250 * WebIDL.pm: Support for |exception| syntax.
251 (Interface->idl_text): Tentative support for inheritances.
252
253 2008-07-19 Wakaba <wakaba@suika.fam.cx>
254
255 * WebIDL.pm: Hierarchical scoped name support was broken.
256 Support for raises, setraises, and getraises syntaxes.
257
258 2008-07-18 Wakaba <wakaba@suika.fam.cx>
259
260 * WebIDL.pm: Support for |idl_text| attribute, version 1 (no
261 proper support for types, extended attributes, and exceptions yet).
262 WebIDL parser, version 1 (no support for exceptions yet,
263 no proper support for extended attributes yet).
264
265 2008-07-09 Wakaba <wakaba@suika.fam.cx>
266
267 * WebIDL.pm (parse_char_string): Support for basic attribute syntax.
268
269 2008-06-29 Wakaba <wakaba@suika.fam.cx>
270
271 * WebIDL.pm: Support for valuetype and const.
272
273 2008-06-29 Wakaba <wakaba@suika.fam.cx>
274
275 * WebIDL.pm: New module.
276
277 2008-06-15 Wakaba <wakaba@suika.fam.cx>
278
279 * Makefile (Entities.html): URI changed.
280
281 2008-06-08 Wakaba <wakaba@suika.fam.cx>
282
283 * HTML.pm.src: Support for ruby parsing (HTML5 revision 1704).
284
285 2008-06-01 Wakaba <wakaba@suika.fam.cx>
286
287 * HTML.pm.src (_get_next_token): A parse error was missing.
288
289 2008-06-01 Wakaba <wakaba@suika.fam.cx>
290
291 * mklinktypelist.pl: rel=contact is no longer part of the HTML5
292 spec (commented out). (HTML5 revision 1711).
293
294 2008-05-25 Wakaba <wakaba@suika.fam.cx>
295
296 * ContentType.pm: Drop support for UTF-32 (HTML5 revision 1701).
297
298 * HTML.pm.src: UTF-16BE and UTF-16LE should be considered
299 as UTF-16 (HTML5 revision 1701).
300
301 2008-05-25 Wakaba <wakaba@suika.fam.cx>
302
303 * HTML.pm.src: Support for <noframes> in <head> (HTML5 revision
304 1692).
305
306 2008-05-25 Wakaba <wakaba@suika.fam.cx>
307
308 * HTML.pm.src: The secondary insertion mode used when switching
309 to foreign content is the "in body" insertion mode (HTML5 revision
310 1696).
311
312 2008-05-25 Wakaba <wakaba@suika.fam.cx>
313
314 * HTML.pm.src: Don't raise parse error for <isindex/> (HTML5
315 revision 1697).
316
317 2008-05-24 Wakaba <wakaba@suika.fam.cx>
318
319 * HTML.pm.src: Support for end-of-file token in foreign content
320 insertion mode (HTML5 revision 1693). Update SVG camelCase
321 attribute list (HTML5 revision 1700). <textarea> closes
322 </select> (HTML5 revision 1699). More start tags close in
323 foreign content insertion mode (HTML5 revision 1698).
324
325 2008-05-24 Wakaba <wakaba@suika.fam.cx>
326
327 * HTML.pm.src: ";" is not part of charset name (HTML5 revision 1665).
328
329 2008-05-24 Wakaba <wakaba@suika.fam.cx>
330
331 * HTML.pm.src: More robust charset parameter detection (HTML5
332 revision 1674).
333
334 2008-05-24 Wakaba <wakaba@suika.fam.cx>
335
336 * ContentType.pm: Support for image/vnd.microsoft.icon (HTML5
337 revision 1676).
338
339 2008-05-24 Wakaba <wakaba@suika.fam.cx>
340
341 * HTML.pm.src: Ignore language part of public identifiers for
342 quriks mode detection (HTML5 revision 1679).
343
344 2008-05-24 Wakaba <wakaba@suika.fam.cx>
345
346 * HTML.pm.src: Reduce the number of errors in truncated doctypes (HTML5
347 revision 1685).
348
349 2008-05-24 Wakaba <wakaba@suika.fam.cx>
350
351 * HTML.pm.src: Support for EOF in new states for tags (HTML5
352 revision 1684).
353
354 2008-05-24 Wakaba <wakaba@suika.fam.cx>
355
356 * HTML.pm.src (_reset_insertion_mode): Make <td>.innerHTML
357 work (HTML5 revision 1690).
358
359 2008-05-24 Wakaba <wakaba@suika.fam.cx>
360
361 * HTML.pm.src (_tree_construction_main): Change handling of
362 end tags in head insertion modes (HTML5 revision 1686).
363 (parse_char_string): Bug fix for non-utf8 character string handlings.
364 (parse_char_stream): |ungetc| does not work well for this context.
365
366 2008-05-18 Wakaba <wakaba@suika.fam.cx>
367
368 * HTML.pm.src (parse_byte_string): Redefined to invoke
369 |parse_byte_stream|.
370 (parse_byte_stream): New method.
371
372 2008-05-18 Wakaba <wakaba@suika.fam.cx>
373
374 * HTML.pm.src (parse_byte_string): Fix the column number reported
375 by encoding layer error reporter.
376
377 2008-05-17 Wakaba <wakaba@suika.fam.cx>
378
379 * HTML.pm.src (parse_byte_string): Use streaming decoder
380 rather than converting the whole byte string and then parsing.
381 Propagate errors in character encoding layer.
382 (get_next_token): Precise error reporting for |bare stago| error.
383
384 2008-05-17 Wakaba <wakaba@suika.fam.cx>
385
386 * HTML.pm.src (parse_char_stream): New method.
387 (parse_char_string): This method is now defined as an invocation
388 of the |parse_char_stream| method.
389
390 2008-05-17 Wakaba <wakaba@suika.fam.cx>
391
392 * HTML.pm.src (parse_byte_string): Report various status
393 of the sniffing as info-level errors. Support for new
394 decoding framework in parser resestting.
395 (new): Various default error levels were not set.
396
397 2008-05-17 Wakaba <wakaba@suika.fam.cx>
398
399 * HTML.pm.src (parse_byte_string): HTML5 encoding siniffing
400 algorithm, except for the actual sniffing, is implemented
401 with new framework with Message::Charset::Info.
402
403 2008-05-16 Wakaba <wakaba@suika.fam.cx>
404
405 * CacheManifest.pm (_parse): Drop fragment identifiers from
406 URIs in fallback section (HTML5 revision 1596).
407
408 2008-05-10 Wakaba <wakaba@suika.fam.cx>
409
410 * Makefile (Entities.html): URI has changed.
411
412 2008-05-10 Wakaba <wakaba@suika.fam.cx>
413
414 * CacheManifest.pm: Don't replace U+0000 NULL (HTML5 revision
415 1553).
416
417 2008-05-06 Wakaba <wakaba@suika.fam.cx>
418
419 * ContentChecker.pm: Noted that those returned in |table| are
420 no longer table elements, but table objects returned
421 by Whatpm::HTMLTable.
422
423 * HTMLTable.pm (form_table): Return table element node
424 as |$table->{element}|.
425 (assign_header): Support for the |headers=""| attribute.
426
427 2008-05-06 Wakaba <wakaba@suika.fam.cx>
428
429 * HTMLTable.pm (assign_header): New function; first version
430 with no support for headers="".
431 (form_table): Include table width and height to the returned
432 table object for covenience. Indexing in column assignement was wrong.
433 Set whether a data cell is empty or not for convenience.
434
435 2008-05-05 Wakaba <wakaba@suika.fam.cx>
436
437 * HTMLTable.pm: Robuster caption support (HTML5 revision 1393).
438
439 2008-05-05 Wakaba <wakaba@suika.fam.cx>
440
441 * HTMLTable.pm: How table model errors are detected is
442 changed (HTML5 revision 1387).
443
444 2008-05-05 Wakaba <wakaba@suika.fam.cx>
445
446 * HTMLTable.pm: The algorithm now moves |tfoot| elements
447 to the end of the table (HTML5 revision 1380).
448
449 2008-05-05 Wakaba <wakaba@suika.fam.cx>
450
451 * HTMLTable.pm: The algorithm is now 0-based indexing, instead
452 of 1-based (HTML5 revision 1376).
453
454 2008-05-05 Wakaba <wakaba@suika.fam.cx>
455
456 * ContentType.pm: "Content-Type: text/plain; charset=UTF-8"
457 and "Content-Encoding" no longer prevent sniffing (HTML5
458 revision 1288).
459
460 2008-05-05 Wakaba <wakaba@suika.fam.cx>
461
462 * ContentType.pm: Skip BOMs n feed or HTML algorithm (HTML5 revision
463 1282).
464
465 2008-05-03 Wakaba <wakaba@suika.fam.cx>
466
467 * ContentChecker.pm: Support for global attributes.
468 Status of XML specs are added.
469
470 2008-05-03 Wakaba <wakaba@suika.fam.cx>
471
472 * ContentChecker.pm (check_element): Support for |template=""|
473 and |ref=""| attribute (referent element type checking).
474
475 2008-04-29 Wakaba <wakaba@suika.fam.cx>
476
477 * CacheManifest.pm (_parse): New same origin definition (HTML5 revision
478 1500) is implemented (except for IDNA part and URI-scheme-specific
479 knowledge). Line number counting was wrong for LF-only
480 documents.
481
482 2008-04-13 Wakaba <wakaba@suika.fam.cx>
483
484 * HTML.pm.src: Raise an parse error for any disallowed
485 character (HTML5 revision 1263).
486
487 2008-04-13 Wakaba <wakaba@suika.fam.cx>
488
489 * mkentitylist.pl: Support for new HTML5 entity table format (the
490 definition for |AElig;| was missing).
491
492 2008-04-13 Wakaba <wakaba@suika.fam.cx>
493
494 * HTML.pm.src, mkhtmlparser.pl: Support for element/attribute
495 name/namespace fixup (HTML5 revisions 1413, 1415, 1416, and 1417).
496
497 2008-04-13 Wakaba <wakaba@suika.fam.cx>
498
499 * HTML.pm.src: List of element names that close foreign content
500 insertion mode is added (HTML5 revisions 1412 and 1418).
501
502 2008-04-13 Wakaba <wakaba@suika.fam.cx>
503
504 * HTML.pm.src: Support for |mglyph| and |malignmark|
505 elements (HTML5 revision 1410).
506
507 2008-04-13 Wakaba <wakaba@suika.fam.cx>
508
509 * HTML.pm.src: Support for new long MathML entities (HTML5
510 revision 1406).
511
512 2008-04-13 Wakaba <wakaba@suika.fam.cx>
513
514 * HTML.pm.src: CDATA section support for MathML and SVG
515 elements (HTML5 revision 1404 and 1420).
516
517 2008-04-12 Wakaba <wakaba@suika.fam.cx>
518
519 * HTML.pm.src, mkhtmlparser.pl: Support for MathML and SVG
520 elements (HTML5 revision 1404). Unused !!!macro definitions
521 are removed.
522
523 2008-04-12 Wakaba <wakaba@suika.fam.cx>
524
525 * HTML.pm.src, mkhtmlparser.pl: The way permitted slash errors
526 are raised is changed (HTML5 revision 1404).
527
528 2008-04-06 Wakaba <wakaba@suika.fam.cx>
529
530 * HTML.pm.src: Category sets that are no longer used are removed.
531
532 2008-04-06 Wakaba <wakaba@suika.fam.cx>
533
534 * HTML.pm.src: The ->[1] property of stack entries are now
535 replaced by constants representing element category.
536
537 2008-04-06 Wakaba <wakaba@suika.fam.cx>
538
539 * HTML.pm.src: Don't use local name stored in stack (i.e. ->[1])
540 for error reporting. (This is a preparation for using constant
541 value for ->[1].)
542
543 2008-03-22 Wakaba <wakaba@suika.fam.cx>
544
545 * RDFXML.pm: Typo fixed.
546
547 2008-03-22 Wakaba <wakaba@suika.fam.cx>
548
549 * ContentChecker.pm: |fact_level| is now treated
550 as same as |must_level|, i.e. level = |m|.
551 (check_element): Make list of URIs in the DOM.
552
553 2008-03-21 Wakaba <wakaba@suika.fam.cx>
554
555 * RDFXML.pm: Language accessor implemented.
556 Local (null-namespace) attribute support.
557
558 2008-03-21 Wakaba <wakaba@suika.fam.cx>
559
560 * RDFXML.pm: Factored out ID checking code.
561
562 2008-03-21 Wakaba <wakaba@suika.fam.cx>
563
564 * RDFXML.pm: TODO items noted. Validation of ID
565 and URI attributes is implemented. Warn if unknown
566 value is used in rdf:parseType="" attribute.
567
568 * URIChecker.pm (check_rdf_uri_reference): New function.
569
570 2008-03-21 Wakaba <wakaba@suika.fam.cx>
571
572 * RDFXML.pm: bnodeid implemented. Relative references
573 are now resolved.
574
575 2008-03-21 Wakaba <wakaba@suika.fam.cx>
576
577 * ContentChecker.pm: RDF reification implemented.
578
579 * RDFXML.pm: undef vs false bug fixed.
580 Reification implemented.
581
582 2008-03-21 Wakaba <wakaba@suika.fam.cx>
583
584 * RDFXML.pm: s/id/ID/ for attribute name.
585 The |node| arguments are added for |ontriple| calls.
586 Too many "attribute not allowed" errors were raised.
587
588 * ContentChecker.pm: Initial experimental support for rdf:RDF
589 element.
590
591 2008-03-21 Wakaba <wakaba@suika.fam.cx>
592
593 * RDFXML.pm: New module.
594
595 2008-03-20 Wakaba <wakaba@suika.fam.cx>
596
597 * HTML.pm.src (set_inner_html): Line/column number
598 code was old one yet.
599
600 2008-03-20 Wakaba <wakaba@suika.fam.cx>
601
602 * HTML.pm.src: Better line/column reporting for "duplicate attribute"
603 errors. Line/column markings for DOCTYPE, comment, and
604 character tokens are reintroduced; otherwise,
605 error location for "not HTML5" error and errors
606 for implied elements are not attached.
607
608 2008-03-20 Wakaba <wakaba@suika.fam.cx>
609
610 * HTML.pm.src: Set line/column numbers to attributes.
611
612 * NanoDOM.pm (create_attribute_ns, set_attribute_node_ns): Added.
613 (value): Setter implemented.
614
615 * mkhtmlparser.pl: Set line/column numbers to Attr nodes.
616
617 2008-03-20 Wakaba <wakaba@suika.fam.cx>
618
619 * HTML.pm.src: Unused line/column markings are removed.
620
621 2008-03-20 Wakaba <wakaba@suika.fam.cx>
622
623 * HTML.pm.src (_get_next_token): Remove |first_start_tag|
624 flag, which is no longer used.
625
626 2008-03-17 Wakaba <wakaba@suika.fam.cx>
627
628 * HTML.pm.src: Set line/column information to element nodes.
629
630 * mkhtmlparser.pl (!!!create-element, !!!insert-element,
631 and !!!insert-element-t): Set line/column information to
632 element nodes.
633
634 2008-03-17 Wakaba <wakaba@suika.fam.cx>
635
636 * HTML.pm.src (_get_next_token): The first "<" character
637 in "<?", "<>", or "</>" should be the error point.
638
639 2008-03-16 Wakaba <wakaba@suika.fam.cx>
640
641 * HTML.pm.src: Some more fixes on error position reporting.
642
643 2008-03-16 Wakaba <wakaba@suika.fam.cx>
644
645 * HTML.pm.src: Token-level precious error reporting.
646
647 2008-03-16 Wakaba <wakaba@suika.fam.cx>
648
649 * HTML.pm.src: Preparation for more precious error point
650 reporting.
651
652 2008-03-11 Wakaba <wakaba@suika.fam.cx>
653
654 * HTML.pm.src: Error type revised.
655
656 2008-03-11 Wakaba <wakaba@suika.fam.cx>
657
658 * HTML.pm.src: Similar codes are merged together, again.
659
660 2008-03-11 Wakaba <wakaba@suika.fam.cx>
661
662 * HTML.pm.src: Similar codes are merged together.
663
664 2008-03-10 Wakaba <wakaba@suika.fam.cx>
665
666 * mkhtmlparser.pl: Set "level" parameter to parse errors.
667
668 * HTML.pm.src: Code refined.
669
670 2008-03-09 Wakaba <wakaba@suika.fam.cx>
671
672 * HTML.pm.src: |</body>| treatement has been changed (HTML5
673 revision 1348). Note that I really don't know this makes
674 any difference in the black-box behavior of the parser.
675
676 2008-03-09 Wakaba <wakaba@suika.fam.cx>
677
678 * HTML.pm.src: New end-of-file token implementation (HTML5
679 revision 1348).
680
681 2008-03-09 Wakaba <wakaba@suika.fam.cx>
682
683 * HTML.pm.src: |applet| support (HTML5 revision 1347).
684
685 2008-03-09 Wakaba <wakaba@suika.fam.cx>
686
687 * HTML.pm.src: Foster parenting in AAA (HTML5 revision 1343).
688
689 2008-03-09 Wakaba <wakaba@suika.fam.cx>
690
691 * HTML.pm.src: Support for |<input>| in the "in select" insertion
692 mode and support for the "in select in table" insertion mode (HTML5
693 revision 1342).
694
695 2008-03-09 Wakaba <wakaba@suika.fam.cx>
696
697 * HTML.pm.src: No foster parenting for <script> and <script>
698 in non-tainted <table>s (HTML5 revision 1336).
699
700 2008-03-09 Wakaba <wakaba@suika.fam.cx>
701
702 * HTML.pm.src: Ignore white space characters between <html>
703 and <head> (HTML5 revision 1332).
704
705 2008-03-09 Wakaba <wakaba@suika.fam.cx>
706
707 * HTML.pm.src: Treat <input type=hidden> as if it were a
708 white space (HTML5 revision 1331).
709
710 2008-03-08 Wakaba <wakaba@suika.fam.cx>
711
712 * HTML.pm.src: Ignore U+000A at the beginning of a |listing|
713 element (HTML5 revision 1330).
714
715 2008-03-08 Wakaba <wakaba@suika.fam.cx>
716
717 * HTML.pm.src: <title> is always appended to the current
718 element (HTML5 revision 1328).
719
720 2008-03-08 Wakaba <wakaba@suika.fam.cx>
721
722 * HTML.pm.src: White space in tainted tables are moved
723 into foster parents (HTML5 revision 1326).
724
725 2008-03-08 Wakaba <wakaba@suika.fam.cx>
726
727 * HTML.pm.src: Reduce errors from foster parenting cases (HTML5
728 revision 1321).
729
730 2008-03-08 Wakaba <wakaba@suika.fam.cx>
731
732 * HTML.pm.src: |</h/n/>| case code rearranged to align with
733 the spec (HTML5 revision 1320). Note that we finally complete
734 all of HTML5 revision 1320 changes.
735
736 2008-03-08 Wakaba <wakaba@suika.fam.cx>
737
738 * HTML.pm.src: |</form>| now works similar to |</div>| for unclosed
739 tags (HTML5 revision 1320).
740
741 2008-03-08 Wakaba <wakaba@suika.fam.cx>
742
743 * HTML.pm.src: |</p>| case rearranged with no actual change in fact.
744
745 2008-03-08 Wakaba <wakaba@suika.fam.cx>
746
747 * HTML.pm.src: A "generate implied end tags" code (t409.1)
748 could not be reached so that it is now removed (HTML5 revision
749 1320).
750
751 2008-03-08 Wakaba <wakaba@suika.fam.cx>
752
753 * HTML.pm.src: Code for the case of |</div>| and so on are revised
754 to align with new spec text (HTML5 revision 1320).
755
756 2008-03-08 Wakaba <wakaba@suika.fam.cx>
757
758 * HTML.pm.src: Remove strange |if| condition; however, it should
759 have had no harm in theory.
760
761 2008-03-08 Wakaba <wakaba@suika.fam.cx>
762
763 * HTML.pm.src (_tree_construction_main): '</p>' in body
764 case is split from other end tags for the preparation
765 of implementing HTML revision 1320.
766
767 2008-03-07 Wakaba <wakaba@suika.fam.cx>
768
769 * HTML.pm.src: Simplified "generate implied end tag" (HTML5
770 revision 1320).
771
772 2008-03-07 Wakaba <wakaba@suika.fam.cx>
773
774 * HTML.pm.src (_tree_construction_main): Merge rules for "h1"
775 and "div" (HTML5 revision 1318). Add comments to where
776 |form| pointer association codes should be inserted (HTML5 revision
777 1319).
778
779 2008-03-06 Wakaba <wakaba@suika.fam.cx>
780
781 * HTML.pm.src: <html> treatement refined (HTML5 revision 1314).
782
783 2008-03-05 Wakaba <wakaba@suika.fam.cx>
784
785 * HTML.pm.src: Since the case t268 should never be reached (no
786 other token type, there are), it is replaced by a |die| statement.
787
788 2008-03-05 Wakaba <wakaba@suika.fam.cx>
789
790 * HTML.pm.src: Typo fixed.
791
792 2008-03-04 Wakaba <wakaba@suika.fam.cx>
793
794 * HTML.pm.src (_tree_construction_initial): Some limited quirks
795 doctypes were not uppercased for comparison.
796
797 2008-03-03 Wakaba <wakaba@suika.fam.cx>
798
799 * HTML.pm.src (tree construction and set_inner_html): Checkpoints
800 are added.
801
802 2008-03-03 Wakaba <wakaba@suika.fam.cx>
803
804 * HTML.pm.src (_tokenize_attempt_to_consume_an_entity): Checkpoints
805 are set. Cases that are unlikely reached are noted as so.
806
807 2008-03-03 Wakaba <wakaba@suika.fam.cx>
808
809 * HTML.pm.src: Checkpoints for debugging are added.
810
811 * mkhtmlparser.pl: Support for |!!!cp| syntax.
812
813 2008-03-03 Wakaba <wakaba@suika.fam.cx>
814
815 * HTML.pm.src, mkhtmlparser.pl: s/_input_character/_char/g
816 for simplicity.
817
818 2008-03-03 Wakaba <wakaba@suika.fam.cx>
819
820 * HTML.pm.src: Flag name changed: s/correct/force-quirks/g (HTML5
821 revision 1307).
822
823 2008-03-03 Wakaba <wakaba@suika.fam.cx>
824
825 * HTML.pm.src (_get_next_token): Where the /incorrect/ flag
826 is set are changed (HTML5 revision 1305).
827
828 2008-03-02 Wakaba <wakaba@suika.fam.cx>
829
830 * HTML.pm.src: Raise a parse error for |<span ===>| (HTML5 revision
831 1292). Entities are not parsed in comment-like part in RCDATA
832 elements (HTML5 revision 1294). Allow bare & at the end
833 of attribute value literals (HTML5 revision 1296). More
834 quirks mode doctypes (HTML5 revision 1302). Requires spaces
835 between attributes and ban attribute names or unquoted
836 attribute values containing single or double quotes (HTML5
837 revision 1303).
838
839 2008-03-02 Wakaba <wakaba@suika.fam.cx>
840
841 * ContentChecker.pm: Typo fixed. Don't raise "character encoding"
842 and related errors unless it is an HTML document (though the spec
843 is unclear on whether it is applied to XHTML document).
844
845 * HTML.pm (%HTMLAttrStatus): WF2 repetition model attributes
846 are added.
847
848 2008-03-02 Wakaba <wakaba@suika.fam.cx>
849
850 * HTML.pm.src: s/local_name/manakai_local_name/g.
851
852 2008-03-01 Wakaba <wakaba@suika.fam.cx>
853
854 * _NamedEntityList.pm: Updated (HTML5 revision 1286).
855
856 * HTML.pm.src: |charset| in |content| attribute is
857 case-insensitive (HTML5 revision 1270).
858
859 2008-02-26 Wakaba <wakaba@suika.fam.cx>
860
861 * ContentChecker.pm: New status constants are added.
862 ($ElementDefault): |status| added.
863 (check_element): Err for non-standard or deprecated elements.
864 (_attr_status_info): For non-standard or deprecated attributes.
865
866 2008-02-24 Wakaba <wakaba@suika.fam.cx>
867
868 * ContentChecker.pm (_attr_status_info): New internal method.
869
870 2008-02-24 Wakaba <wakaba@suika.fam.cx>
871
872 * ContentChecker.pm (check_element): Element standardized
873 status information is now dispatched.
874
875 2008-02-24 Wakaba <wakaba@suika.fam.cx>
876
877 * ContentChecker.pm (check_element): Fix |del|-and-significant
878 problem by adding some more arguments.
879
880 2008-02-24 Wakaba <wakaba@suika.fam.cx>
881
882 * ContentChecker.pm (check_element): Use context of
883 container-for-the-purpose-of-content-model element (not transparent
884 element) for |check_child_element| calling and significant
885 text flag marking. This reintroduces |<del>|-and-significant
886 problem again.
887
888 2008-02-24 Wakaba <wakaba@suika.fam.cx>
889
890 * ContentChecker.pm (check_element): Make semi-transparent
891 elements ignored for the purpose of phase changes in
892 content model checking.
893
894 2008-02-23 Wakaba <wakaba@suika.fam.cx>
895
896 * ContentChecker.pm (check_element): In-element state
897 was not properly managed for transparent cases.
898
899 2008-02-23 Wakaba <wakaba@suika.fam.cx>
900
901 * ContentChecker.pm (check_element): Support for |video|
902 and |audio| as semi-transparent elements.
903
904 2008-02-23 Wakaba <wakaba@suika.fam.cx>
905
906 * ContentChecker.pm ($HTMLSemiTransparentElements): New.
907 (check_element): s/minuses/minus_elements/, s/pluses/plus_elements/.
908 Support for |html:object| as a semi-transparent element.
909
910 2008-02-23 Wakaba <wakaba@suika.fam.cx>
911
912 * ContentChecker.pm (check_element): The way to traverse
913 the tree is entirely revised to make it easier to track
914 the state of ancestors/descendants. As a result of this
915 revision (which rewrites almost all of Whatpm::ContentChecker::HTML),
916 support for content model checking for HTML elements |figure|,
917 |object|, |video|, and |audio| and checking for XML elements (and
918 some XMLNS checkings) are dropped for now. They will be
919 reimplemented in due cource.
920
921 2008-02-17 Wakaba <wakaba@suika.fam.cx>
922
923 * HTML.pm.src: |>| in public or system literal closes the DOCTYPE
924 token (HTML5 revision 1225).
925
926 2008-02-17 Wakaba <wakaba@suika.fam.cx>
927
928 * ContentChecker.pm ({unsupported_level}): New value.
929
930 * HTML.pm.src: Save whether |meta| |content| attribute
931 contains character references or not.
932
933 2008-02-17 Wakaba <wakaba@suika.fam.cx>
934
935 * ContentChecker.pm (_get_children): (Incompleted) attempt to
936 imlement significant content checking for contents
937 with |del| elements.
938
939 2008-02-17 Wakaba <wakaba@suika.fam.cx>
940
941 * ContenteChecker.pm ($HTMLTransparentElements): More
942 elements are added.
943 (_get_children): HTML |object| elements are now semi-transparent.
944
945 * NanoDOM.pm (manakai_html, manakai_head): New methods.
946
947 2008-02-16 Wakaba <wakaba@suika.fam.cx>
948
949 * CacheManifest.pm: HTML5 revision 1211 implemented.
950
951 * CacheManifest.pod: Updated.
952
953 2008-02-10 Wakaba <wakaba@suika.fam.cx>
954
955 * ContentChecker.pm (check_document, check_element): Support
956 for second argument ($onsubdoc).
957 (_get_css_parser): Removed (now it is part of WDCC).
958
959 2008-02-09 Wakaba <wakaba@suika.fam.cx>
960
961 * ContentChecker.pm (_get_css_parser): New.
962
963 2007-11-25 Wakaba <wakaba@suika.fam.cx>
964
965 * ContentChecker.pm ($AnyChecker): Old way to add child elements
966 for checking had been used.
967
968 2007-11-25 Wakaba <wakaba@suika.fam.cx>
969
970 * ContentChecker.pm (check_element): New todo item type |descendant|.
971
972 2007-11-23 Wakaba <wakaba@suika.fam.cx>
973
974 * IMTChecker.pm: Revised to raise errors and warnings as (poorly)
975 specced in RFC 2046 and RFC 4288.
976 (application/atom+xml): Definition added.
977
978 2007-11-23 Wakaba <wakaba@suika.fam.cx>
979
980 * URIChecker.pm: Make RFC 3986 should-level errors
981 warnings (rather than SHOULD-level errors).
982
983 2007-11-23 Wakaba <wakaba@suika.fam.cx>
984
985 * NanoDOM.pm (get_user_data, set_user_data): New methods.
986
987 * HTML.pm.src: A flag for character references in attribute
988 values are added. Set |manakai_has_reference| user data
989 to |charset| attribute.
990
991 2007-11-23 Wakaba <wakaba@suika.fam.cx>
992
993 * NanoDOM.pm (input_encoding, manakai_charset, manakai_has_bom): New
994 attributes.
995
996 * ContentChecker.pm (check_document): Warn if charset requirements
997 cannot be tested.
998
999 2007-11-19 Wakaba <wakaba@suika.fam.cx>
1000
1001 * HTML.pm.src (parse_byte_string): Detect charset
1002 by universalchardet if charset parameter is not specified.
1003
1004 * Makefile (Charset-all, Charset-clean): New rules.
1005
1006 2007-11-18 Wakaba <wakaba@suika.fam.cx>
1007
1008 * ContentChecker.pm (check_document): Check the existence
1009 of character encoding declaration and selection of encoding
1010 for HTML document.
1011
1012 2007-11-18 Wakaba <wakaba@suika.fam.cx>
1013
1014 * ContentType.pm (get_sniffed_type): Return also the official
1015 type in list context.
1016
1017 2007-11-18 Wakaba <wakaba@suika.fam.cx>
1018
1019 * ContentType.pm: Sniffing with leading white space
1020 ignoring (HTML5 revisions 1013 and 1016).
1021
1022 2007-11-18 Wakaba <wakaba@suika.fam.cx>
1023
1024 * ContentType.pm: HTML5 revision 1013 changes, except for leading
1025 white spaces, are implemented.
1026
1027 2007-11-11 Wakaba <wakaba@suika.fam.cx>
1028
1029 * HTML.pm.src (parse_char_string): Set |inner_encoding|
1030 attribute if possible.
1031
1032 2007-11-11 Wakaba <wakaba@suika.fam.cx>
1033
1034 * HTML.pm.src (parse_byte_string): New method.
1035 (parse_char_string): New alias for |parse_string|.
1036 (main phase): Invoking "change the encoding" algorithm if desired.
1037
1038 * HTML.pod: Updated.
1039
1040 2007-11-11 Wakaba <wakaba@suika.fam.cx>
1041
1042 * HTML.pod (get_inner_html): Removed.
1043
1044 * Makefile (HTML-all, HTML-clean): New.
1045
1046 2007-11-11 Wakaba <wakaba@suika.fam.cx>
1047
1048 * HTML.pm.src (get_inner_html): Removed (moved to HTML/Serializer.pm).
1049
1050 2007-11-08 Wakaba <wakaba@suika.fam.cx>
1051
1052 * mklinktypelist.pl: s/noreferer/noreferrer/ (HTML5 revision 1132).
1053
1054 2007-11-04 Wakaba <wakaba@suika.fam.cx>
1055
1056 * Makefile: |CacheManifest.html| is added.
1057
1058 * CacheManifest.pod: New file.
1059
1060 2007-11-04 Wakaba <wakaba@suika.fam.cx>
1061
1062 * CacheManifest.pm: New module.
1063
1064 2007-11-04 Wakaba <wakaba@suika.fam.cx>
1065
1066 * HTML.pm.src: Support for application cache selection algorithm
1067 callback.
1068
1069 2007-11-04 Wakaba <wakaba@suika.fam.cx>
1070
1071 * mklinktypelist.pl: Support for rel=noreferer (HTML5 revision 1118).
1072
1073 2007-10-17 Wakaba <wakaba@suika.fam.cx>
1074
1075 * Makefile (clean): New rule.
1076
1077 * NanoDOM.pm (public_id, system_id): New attributes.
1078
1079 2007-10-17 Wakaba <wakaba@suika.fam.cx>
1080
1081 * Makefile (CSS-all, CSS-clean, clean): New rules.
1082
1083 2007-10-14 Wakaba <wakaba@suika.fam.cx>
1084
1085 * ContentChecker.pm (check_document): Support for
1086 new |is_xml_root| flag.
1087 (check_element): Support for new |pluses| state.
1088 (_add_pluses): New method.
1089 (_remove_minuses): Support for new |minus| item.
1090
1091 2007-09-24 Wakaba <wakaba@suika.fam.cx>
1092
1093 * ContentChecker.pm: Raise specific error for invalid
1094 root element.
1095
1096 2007-09-24 Wakaba <wakaba@suika.fam.cx>
1097
1098 * ContentChecker.pm: Set level values for later uses.
1099
1100 2007-09-09 Wakaba <wakaba@suika.fam.cx>
1101
1102 * ContentChecker.pm: Support for language tag validation.
1103
1104 2007-09-09 Wakaba <wakaba@suika.fam.cx>
1105
1106 * LangTag.pm (check_rfc3066_language_tag): New method.
1107
1108 2007-09-09 Wakaba <wakaba@suika.fam.cx>
1109
1110 * LangTag.pm: New module.
1111
1112 2007-09-04 Wakaba <wakaba@suika.fam.cx>
1113
1114 * HTML.pm.src: Some error types were wrong.
1115
1116 2007-08-17 Wakaba <wakaba@suika.fam.cx>
1117
1118 * CSS/: New directory.
1119
1120 2007-08-17 Wakaba <wakaba@suika.fam.cx>
1121
1122 * ContentChecker.pm (_check_get_children): Support
1123 for |noscript| in |head|.
1124
1125 2007-08-12 Wakaba <wakaba@suika.fam.cx>
1126
1127 * URI/: New directory.
1128
1129 2007-08-11 Wakaba <wakaba@suika.fam.cx>
1130
1131 * HTML.pm.src: Tokenizer's states are now represented in
1132 number.
1133
1134 2007-08-11 Wakaba <wakaba@suika.fam.cx>
1135
1136 * HTML.pm.src: |or|s for insertion modes are replaced
1137 by |&|s.
1138
1139 2007-08-11 Wakaba <wakaba@suika.fam.cx>
1140
1141 * HTML.pm.src: Token types are now represented in number.
1142
1143 2007-08-11 Wakaba <wakaba@suika.fam.cx>
1144
1145 * ContentType.pm (SEE ALSO): Updated.
1146
1147 * HTML.pm.src: Insertion modes are now represented in number.
1148
1149 2007-08-11 Wakaba <wakaba@suika.fam.cx>
1150
1151 * ContentType.pm: Sniffing for bitmap images (HTML5 revision
1152 999) is implemented.
1153
1154 2007-08-08 Wakaba <wakaba@suika.fam.cx>
1155
1156 * ContentType.pm: Sniffing for |<script| (HTML5 revision
1157 983) is implemented.
1158
1159 2007-08-06 Wakaba <wakaba@suika.fam.cx>
1160
1161 * ContentChecker.pod: New documentation.
1162
1163 * Makefile: A rule for |ContentChecker.html| is added.
1164
1165 * ContentChecker.pm: A pod "LICENSE" section is added.
1166
1167 * NanoDOM.pm ($VERSION): New variable.
1168
1169 2007-08-05 Wakaba <wakaba@suika.fam.cx>
1170
1171 * H2H.pm: |b|, |i|, and |sub| are added to the
1172 list of allowed HTML elements.
1173
1174 2007-08-05 Wakaba <wakaba@suika.fam.cx>
1175
1176 * H2H.pm: |samp| is added to the list of allowed
1177 HTML elements.
1178
1179 * URIChecker.pm (check_iri): New.
1180 (check_iri_reference): Error type for IRI reference
1181 syntax error is changed.
1182
1183 2007-08-04 Wakaba <wakaba@suika.fam.cx>
1184
1185 * ContentChecker.pm: Reference to the |Whatpm::ContentChecker::Atom|
1186 is added.
1187 (check_document): Load appropriate module before validation.
1188
1189 2007-08-04 Wakaba <wakaba@suika.fam.cx>
1190
1191 * ContentChecker/: New directory.
1192
1193 2007-08-04 Wakaba <wakaba@suika.fam.cx>
1194
1195 * ContentChecker.pm: HTML |time| element is implemented.
1196
1197 * HTMLTable.pm: Comments are updated as HTML5 is revised.
1198
1199 2007-07-21 Wakaba <wakaba@suika.fam.cx>
1200
1201 * ContentChecker.pm (check_document): Return value
1202 even if no document element is found.
1203
1204 2007-07-21 Wakaba <wakaba@suika.fam.cx>
1205
1206 * HTML.pm.src: |$in_body| is no longer a function.
1207
1208 2007-07-21 Wakaba <wakaba@suika.fam.cx>
1209
1210 * HTML.pm.src: The |$in_body| code has been moved down.
1211
1212 2007-07-21 Wakaba <wakaba@suika.fam.cx>
1213
1214 * HTML.pm.src: The "trailing end" insertion mode
1215 is split into "after html body" and "after html frameset"
1216 insertion modes. Their codes are merged with "after body"
1217 and "after frameset" codes. |$previous_insertion_mode|
1218 has been removed. "after frameset" code is
1219 merged with "in frameset" code.
1220
1221 2007-07-21 Wakaba <wakaba@suika.fam.cx>
1222
1223 * HTML.pm.src: The "before head" insertion mode is
1224 merged with the "in head" insertion mode.
1225
1226 2007-07-21 Wakaba <wakaba@suika.fam.cx>
1227
1228 * HTML.pm.src: Cases in "in head" insertion mode are
1229 reorganized.
1230
1231 2007-07-21 Wakaba <wakaba@suika.fam.cx>
1232
1233 * HTML.pm.src: Some cases in "in table" insertion mode
1234 are merged.
1235
1236 2007-07-21 Wakaba <wakaba@suika.fam.cx>
1237
1238 * HTML.pm.src: The "in row" insertion mode is merged
1239 with "in table" insertion mode.
1240
1241 2007-07-21 Wakaba <wakaba@suika.fam.cx>
1242
1243 * HTML.pm.src: The "in table" and "in table body" insertion
1244 modes are merged.
1245
1246 2007-07-21 Wakaba <wakaba@suika.fam.cx>
1247
1248 * HTML.pm.src: There is no "in table head" or "in table foot"
1249 insertion mode!
1250
1251 2007-07-21 Wakaba <wakaba@suika.fam.cx>
1252
1253 * HTML.pm.src: |<noframes>| "in frameset" and "in noframes"
1254 now directly invoke the handler.
1255
1256 2007-07-21 Wakaba <wakaba@suika.fam.cx>
1257
1258 * HTML.pm.src: Codes for "in cell" insertion mode
1259 is merged to the "in body" insertion mode code.
1260
1261 2007-07-21 Wakaba <wakaba@suika.fam.cx>
1262
1263 * HTML.pm.src: Codes for "in body" and "in caption"
1264 insertion modes are merged.
1265
1266 2007-07-21 Wakaba <wakaba@suika.fam.cx>
1267
1268 * HTML.pm.src: Two |!!!next-token|s were missing.
1269
1270 2007-07-21 Wakaba <wakaba@suika.fam.cx>
1271
1272 * HTML.pm.src: Use numeric constant for |{content_mode}|
1273 instead of string constant for |{content_model_flag}|.
1274
1275 2007-07-21 Wakaba <wakaba@suika.fam.cx>
1276
1277 * HTML.pm.src: Add the name of the attribute
1278 to the "duplicate attribute" error.
1279
1280 2007-07-17 Wakaba <wakaba@suika.fam.cx>
1281
1282 * ContentChecker.pm: Return the |class| node list.
1283
1284 2007-07-17 Wakaba <wakaba@suika.fam.cx>
1285
1286 * ContentChecker.pm: Return the |id| node list.
1287
1288 * HTML.pm.src: A typo is fixed.
1289
1290 2007-07-16 Wakaba <wakaba@suika.fam.cx>
1291
1292 * ContentChecker.pm: Drop wrong |level => 'error'| specification
1293 from "in HTML:xml:lang" error. Character position
1294 is now the last part of the error type in the URI error
1295 description. Report "unsupported" status for language
1296 tags, media queries, script codes, and style sheets.
1297
1298 2007-07-16 Wakaba <wakaba@suika.fam.cx>
1299
1300 * ContentChecker.pm: Report error if |xml:lang|
1301 in HTML, |lang| in XML, |xmlns| in XML, and |meta| |charset|
1302 in XML.
1303
1304 * NanoDOM.pm (Attr.owner_document): New attribute.
1305
1306 2007-07-16 Wakaba <wakaba@suika.fam.cx>
1307
1308 * HTML.pm.src: The character immediately following
1309 a bare |hcro| was discarded. Fix handling of
1310 entity references in attribute values.
1311
1312 2007-07-16 Wakaba <wakaba@suika.fam.cx>
1313
1314 * HTML.pm.src (main and trailin end phases): Token
1315 types |DOCTYPE|, |comment|, |end-of-file|, and |<html>| are
1316 factored out. Error types |in html:#DOCTYPE|
1317 and |after html:#DOCTYPE| are merged
1318 into |DOCTYPE in the middle|. |</frameset>|
1319 in fragment parsing mode changed the
1320 insertion mode.
1321
1322 2007-07-16 Wakaba <wakaba@suika.fam.cx>
1323
1324 * HTML.pm.src: |$phase| has been removed; The |trailing end|
1325 phase is now an insertion mode. Treatments for white
1326 space character tokens were incorrect for some
1327 insertion modes. An old |meta| case was not removed.
1328
1329 2007-07-16 Wakaba <wakaba@suika.fam.cx>
1330
1331 * HTML.pm.src: |meta| charset declaration extraction
1332 implemented (but changing the encoding is not yet:-).
1333
1334 2007-07-15 Wakaba <wakaba@suika.fam.cx>
1335
1336 * Charset/: New directory.
1337
1338 2007-07-15 Wakaba <wakaba@suika.fam.cx>
1339
1340 * H2H.pm: New Perl module (created from
1341 manakai's H2H.dis).
1342
1343 2007-07-15 Wakaba <wakaba@suika.fam.cx>
1344
1345 * XMLSerializer.pm: New Perl module (created from
1346 manakai's SimpleLS.dis).
1347
1348 2007-07-07 Wakaba <wakaba@suika.fam.cx>
1349
1350 * HTML.pm.src: |<!---x-->| was not processed correctly.
1351
1352 2007-07-01 Wakaba <wakaba@suika.fam.cx>
1353
1354 * HTML.pm.src: Report correct error message
1355 for |<body></div></body>|.
1356
1357 2007-07-01 Wakaba <wakaba@suika.fam.cx>
1358
1359 * HTMLTable.pm: An error description was incorrect.
1360
1361 2007-06-30 Wakaba <wakaba@suika.fam.cx>
1362
1363 * ContentChecker.pm: Return |{term}| list.
1364
1365 2007-06-30 Wakaba <wakaba@suika.fam.cx>
1366
1367 * HTML.pm.src: HTML5 revisions 961-966 (</p>, </br>,
1368 nested <nobr>, implied </tbody>, </tfoot>, and </thead>,
1369 and <title> outside of head).
1370
1371 2007-06-30 Wakaba <wakaba@suika.fam.cx>
1372
1373 * IMTChecker.pm: Report warning for unregistered
1374 and private types/subtypes.
1375
1376 * ContentChecker.pm, HTML.pm.src, IMTChecker.pm,
1377 URIChecker.pm, HTMLTable.pm: Error messages are now
1378 consistent; they are all listed in
1379 <http://suika.fam.cx/gate/2005/sw/Whatpm%20Error%20Types>.
1380
1381 2007-06-25 Wakaba <wakaba@suika.fam.cx>
1382
1383 * ContentChecker.pm: |<img ismap>| not in |<a></a>|
1384 is now erred. |<datalist>| is implemented.
1385 Attribute checker for |<command>| and |<menu>| are
1386 added. Support for |contextmenu| global attribute
1387 is added.
1388
1389 2007-06-25 Wakaba <wakaba@suika.fam.cx>
1390
1391 * HTML.pm.src (_reset_insertion_mode): Interpretation
1392 of Step 3 has been changed.
1393
1394 2007-06-25 Wakaba <wakaba@suika.fam.cx>
1395
1396 * HTML.pm.src: Late |<html>| parse error is implemented.
1397
1398 2007-06-24 Wakaba <wakaba@suika.fam.cx>
1399
1400 * URIChecker.pm (check_iri_reference): A |decode| method name was
1401 incorrect.
1402
1403 * ContentChecker.pm: Support for the |footer| element.
1404 Check URI syntax for space-separated URI attributes.
1405 Support for the |tabindex| attribute. Support
1406 for |datetime| attribute.
1407
1408 2007-06-24 Wakaba <wakaba@suika.fam.cx>
1409
1410 * HTML.pm.src: HTML5 revision 1.144 (&#x0D;) and 1.145 (invalid
1411 character references). HTML5 revision 1.146 (white space
1412 characters before root start tag). HTML5 revision
1413 1.148 (named character references in attribute values).
1414 HTML5 revision 1.152 (<plaintext>.innerHTML get).
1415
1416 2007-06-24 Wakaba <wakaba@suika.fam.cx>
1417
1418 * HTML.pm.src: HTML5 revisions 1.142 and 1.143 (<noscript>
1419 in <head>).
1420
1421 2007-06-24 Wakaba <wakaba@suika.fam.cx>
1422
1423 * HTML.pm.src: HTML5 revision 935 (<base>, <link>, <meta>
1424 in body).
1425
1426 * ContentChecker.pm: HTML5 revision 938 (scoped="").
1427
1428 2007-06-24 Wakaba <wakaba@suika.fam.cx>
1429
1430 * HTML.pm.src: HTML5 revision 923 (matching end tag in CDATA
1431 or RCDATA in fragment parsing mode). HTML5 revision
1432 924 (<!--> and <!--->). HTML5 revision 926 (hn in hn).
1433
1434 2007-06-23 Wakaba <wakaba@suika.fam.cx>
1435
1436 * HTML.pm.src (get_inner_html): HTML5 revision 922 (inner_html
1437 for <pre> and <textarea>).
1438
1439 2007-06-23 Wakaba <wakaba@suika.fam.cx>
1440
1441 * HTML.pm.src: HTML5 revision 920 (<isindex>).
1442
1443 2007-06-23 Wakaba <wakaba@suika.fam.cx>
1444
1445 * HTML.pm.src: HTML5 revision 918 (</head>, </body>, </html>).
1446
1447 2007-06-23 Wakaba <wakaba@suika.fam.cx>
1448
1449 * HTML.pm.src: HTML5 revision 916 (</body>).
1450 HTML5 revision 917 (conforming bare &).
1451
1452 2007-06-23 Wakaba <wakaba@suika.fam.cx>
1453
1454 * NanoDOM.pm (manakai_is_html): Setting to false did not work.
1455
1456 * HTML.pm.src: HTML5 revision 914 (</ in CDATA, RCDATA).
1457 HTML5 revision 915 (<nobr>).
1458
1459 2007-06-23 Wakaba <wakaba@suika.fam.cx>
1460
1461 * HTML.pm.src: HTML5 revisions 908, 909, 912, and 913 (quirks mode).
1462
1463 * NanoDOM.pm (manakai_is_html, manakai_compat_mode, compat_mode):
1464 New attributes.
1465
1466 2007-06-23 Wakaba <wakaba@suika.fam.cx>
1467
1468 * HTML.pm.src: HTML5 revisions 900, 901, 902, and 911 (<
1469 in tags).
1470
1471 2007-06-23 Wakaba <wakaba@suika.fam.cx>
1472
1473 * .cvsignore: |Entities.html| is added.
1474
1475 * HTML.pm.src: |$entity_char| is removed and
1476 requires |Whatpm::_NamedEntityList| instead.
1477 HTML5 revision 898 (refc), except that lack of refc
1478 is parse error.
1479
1480 * mkentitylist.pl: New script.
1481
1482 * Makefile (all): |_NamedEntityList.pm| is added.
1483 (_NamedEntityList.pm, Entities.html): New rules.
1484
1485 2007-06-23 Wakaba <wakaba@suika.fam.cx>
1486
1487 * HTML.pm.src: Parse errors immediately after U+000D
1488 were ignored and U+000D immediately following another
1489 U+000D was not converted to U+000A.
1490
1491 2007-06-23 Wakaba <wakaba@suika.fam.cx>
1492
1493 * HTML.pm.src (set_inner_html): HTML5 revision 892 (adopt
1494 nodes before appended). Parser was not ready for NULL
1495 parse error and escape flag.
1496
1497 * NanoDOM.pm (adopt_node): New.
1498
1499 2007-06-23 Wakaba <wakaba@suika.fam.cx>
1500
1501 * HTML.pm.src: HTML5 revision 886 (insane comment in
1502 CDATA and RCDATA). Note that current implementation
1503 is simply repeating what the spec says and it is maybe not
1504 a best way to do it.
1505
1506 2007-06-23 Wakaba <wakaba@suika.fam.cx>
1507
1508 * HTML.pm.src: HTML5 revision 884 (</form> don't close
1509 the form element if a descendant element without implied end tag has
1510 still been open).
1511
1512 2007-06-23 Wakaba <wakaba@suika.fam.cx>
1513
1514 * ContentChecker.pm: HTML5 revision 881 (Make |id| attribute with
1515 space characters non-conforming).
1516
1517 2007-06-23 Wakaba <wakaba@suika.fam.cx>
1518
1519 * HTML.pm.src: An error message was incorrect.
1520 HTML5 revision 869 (C1 character references).
1521
1522 2007-06-23 Wakaba <wakaba@suika.fam.cx>
1523
1524 * HTML.pm.src: HTML5 revision 867 (a LF at the beginning of
1525 a |textarea| is removed).
1526
1527 2007-06-05 Wakaba <wakaba@suika.fam.cx>
1528
1529 * NanoDOM.pm (get_attribute_node_ns): New method.
1530
1531 * ContentChecker.pm: |script| |async| and |defer|
1532 no longer require |src|. |async| MUST NOT be
1533 specified if |defer|. (HTML5 revision 858).
1534
1535 2007-05-30 Wakaba <wakaba@suika.fam.cx>
1536
1537 * HTML.pm.src: |<form><form>| went to inifinite loop.
1538
1539 2007-05-27 Wakaba <wakaba@suika.fam.cx>
1540
1541 * ContentChecker.pm (html): Set |is_root| (allowed
1542 as a document element) flag on.
1543 (new): Removed.
1544 (check_document): New method.
1545
1546 2007-05-27 Wakaba <wakaba@suika.fam.cx>
1547
1548 * ContentChecker.pm (thead, tfoot): Checker specifications
1549 were incorrect.
1550
1551 2007-05-27 Wakaba <wakaba@suika.fam.cx>
1552
1553 * ContentChecker.pm ($HTMLURIAttrChecker): Include
1554 error position in the |type| option of the error.
1555
1556 * HTMLTable.pm (form_table): The |$onerror| parameter
1557 is now optional. Some bugs are fixed.
1558
1559 2007-05-27 Wakaba <wakaba@suika.fam.cx>
1560
1561 * HTMLTable.pm: New module.
1562
1563 * ContentChecker.pm (table): Invoke table model error checker.
1564
1565 * NanoDOM.pm (first_child, get_attribute_ns): New.
1566
1567 2007-05-26 Wakaba <wakaba@suika.fam.cx>
1568
1569 * ContentChecker.pm ($HTMLLinkTypesAttrChecker): New checker.
1570 (link/@rel, a/@rel, area/@rel): Use new checker.
1571
1572 * Makefile (_LinkTypeList.pm, RelExtensions.html): New rules.
1573
1574 * _LinkTypeList.pm: New file.
1575
1576 * mklinktypelist.pl: New file.
1577
1578 * .cvsignore: |RelExtensions.html| added.
1579
1580 * NanoDOM.pm (child_nodes): Returns an empty array
1581 for non-child-containing node types.
1582 (text_content): New attribute.
1583
1584 2007-05-26 Wakaba <wakaba@suika.fam.cx>
1585
1586 * IMTChecker.pm: New module.
1587
1588 * ContentChecker.pm ($HTMLIMTAttrChecker): Call IMTChecker
1589 to test parameter value validity.
1590
1591 * HTML.pm.src ($style_start_tag): Attributes were
1592 discarded.
1593
1594 2007-05-25 Wakaba <wakaba@suika.fam.cx>
1595
1596 * ContentChecker.pm ($HTMLURIAttrChecker): Implemented.
1597
1598 2007-05-25 Wakaba <wakaba@suika.fam.cx>
1599
1600 * URIChecker.pm: All recommendations from RFC 3986
1601 and RFC 3987 are listed (not all testable items are checked yet).
1602
1603 2007-05-25 Wakaba <wakaba@suika.fam.cx>
1604
1605 * URIChecker.pm: New module.
1606
1607 2007-05-20 Wakaba <wakaba@suika.fam.cx>
1608
1609 * ContentChecker.pm: Now most attributes are implemented
1610 or associated to some placeholder.
1611 ($ElementDefault): Warn unknown attributes
1612 for unknown elements as "attribute not supported".
1613 ($HTMLLanguageTagAttrChecker, $HTMLMQAttrChecker): New placeholders.
1614 ($HTMLUsemapAttrChecker, $HTMLTargetAttrChecker): New checkers.
1615 (|a| attribute checker): Reimplemented.
1616
1617 2007-05-20 Wakaba <wakaba@suika.fam.cx>
1618
1619 * ContentChecker.pm ($HTMLEventHandlerAttrChecker): New placeholder.
1620 ($HTMLAttrChecker): Event handler content attributes
1621 are added.
1622 (link, embed): Required attribute is now checked.
1623 (embed): Unknown local attributes are no longer warned.
1624
1625 2007-05-20 Wakaba <wakaba@suika.fam.cx>
1626
1627 * ContentChecker.pm ($HTMLSpaceURIsAttrChecker): New placeholder.
1628 ($HTMLIMTAttrChecker): New checker.
1629 (link@rel, link@href, link@type, style@type,
1630 a@href, a@ping, a@ping, a@type, embed@src, embed@type,
1631 object@data, object@type, source@src, source@type, area@alt,
1632 area@shape, area@coords, area@href,
1633 area@ping, area@rel, area@type, script@src,
1634 script@defer, script@async, script@type): Checkers added.
1635
1636 2007-05-20 Wakaba <wakaba@suika.fam.cx>
1637
1638 * ContentChecker.pm: Descendant checking was incorrect.
1639
1640 2007-05-19 Wakaba <wakaba@suika.fam.cx>
1641
1642 * ContentChecker.pm: Support |xml:*| and |xmlns:*|
1643 attributes. Report an error if |Element.prefix|
1644 is |xmlns|.
1645
1646 * NanoDOM.pm (prefix): New attribute.
1647
1648 2007-05-19 Wakaba <wakaba@suika.fam.cx>
1649
1650 * HTML.pm.src: In |main| phase, |in body| insertion
1651 mode, action for |<iframe>| was missing.
1652
1653 2007-05-19 Wakaba <wakaba@suika.fam.cx>
1654
1655 * ContentChecker.pm: Support for many of HTML5 elements.
1656 ($GetHTMLNonNegativeIntegerAttrChecker): New.
1657
1658 2007-05-19 Wakaba <wakaba@suika.fam.cx>
1659
1660 * ContentChecker.pm: Support for most elements up to |progress|.
1661 ($HTMLURIAttrChecker): Placeholder.
1662 ($HTMLIntegerAttrChecker, $GetHTMLFloatingPointNumberAttrChecker):
1663 New.
1664
1665 2007-05-19 Wakaba <wakaba@suika.fam.cx>
1666
1667 * ContentChecker.pm: Attribute checkers
1668 for global attributes, |html|, |base|, |style|, and |meta|.
1669
1670 * NanoDOM.pm (insert_before): Weaken reference
1671 to the parent node.
1672 (Attr::new): Set |owner_element| attribute.
1673 (namespace_uri, manakai_local_name): New attribute implementations.
1674 (owner_element): New attribute.
1675
1676 2007-05-19 Wakaba <wakaba@suika.fam.cx>
1677
1678 * ContentChecker.pm ($AttrChecker, $HTMLAttrChecker,
1679 $AnyChecker->{attr_checker}, $HTMLAttrsChecker,
1680 $Element->{$HTML_NS}->{''}): New.
1681 (check_element): Invoke attrs_checker for each element.
1682
1683 2007-05-13 Wakaba <wakaba@suika.fam.cx>
1684
1685 * ContentChecker.pm: Don't use |manakai_element_type_match|.
1686
1687 2007-05-13 Wakaba <wakaba@suika.fam.cx>
1688
1689 * ContentChecker.pm: Use hashs rather than lists for
1690 element type testings.
1691
1692 2007-05-13 Wakaba <wakaba@suika.fam.cx>
1693
1694 * ContentChecker.pm: Don't generate duplicate
1695 error when an element type is put in the "minus" list
1696 and the element type is not allowed explicitly in the particular
1697 element content model.
1698 (html:a checker): New checker.
1699 (html:details, html:datagrid): New checkers.
1700 (html:legend): New checker.
1701
1702 2007-05-13 Wakaba <wakaba@suika.fam.cx>
1703
1704 * ContentChecker.pm (html:li checker): Implemented.
1705
1706 2007-05-13 Wakaba <wakaba@suika.fam.cx>
1707
1708 * ContentChecker.pm ($HTMLInlineOrStriclyInlineChecker): New
1709 checker.
1710 (html:dd checker): New checker.
1711 (html:q, html:em, html:strong, html:small,
1712 html:m, html:dfn, html:code, html:samp, html:span): New checkers.
1713
1714 2007-05-13 Wakaba <wakaba@suika.fam.cx>
1715
1716 * ContentChecker.pm ($AnyChecker): Renamed
1717 from |$ElementDefault->{checker}|.
1718 ($ElementDefault->{checker}): Throw an error that
1719 the element type is not supported by the checker.
1720 ($HTMLMetadataElement): |html:base| was missing.
1721 ($HTMLEmptyChecker): Don't throw an error
1722 for inter-element whitespace nodes.
1723 (html:html checker): Errors were not
1724 thrown even if |html:head| and/or |html:body|
1725 children were missing.
1726 (html:head checker): An error was not
1727 thrown if <meta charset> appered after other
1728 elements.
1729
1730 2007-05-05 Wakaba <wakaba@suika.fam.cx>
1731
1732 * ContentChecker.pm: |footer|, |video|, |audio|, |script|,
1733 and |noscript| elements are implemented.
1734 (new): New method.
1735
1736 2007-05-04 Wakaba <wakaba@suika.fam.cx>
1737
1738 * ContentChecker.pm: New module.
1739
1740 2007-05-04 Wakaba <wakaba@suika.fam.cx>
1741
1742 * NanoDOM.pm (manakai_parent_element,
1743 document_element, manakai_local_name,
1744 manakai_element_type_match): New method.
1745
1746 2007-05-03 Wakaba <wakaba@suika.fam.cx>
1747
1748 * HTML.pm.src: Replace decimal and hexadecimal numeric
1749 entities in C1 range using Windows-1252 mapping. Bare LF
1750 did not count as new line for error reporting.
1751
1752 2007-05-02 Wakaba <wakaba@suika.fam.cx>
1753
1754 * NanoDOM.pm (DOMImplementation): New class.
1755 (append_child): Weaken the |parent_node| reference.
1756 (create_element_ns, Element new): Set the |owner_document|
1757 reference.
1758 (implementation): New attribute.
1759 (owner_document, local_name, namespace_uri): New attributes.
1760
1761 * HTML.pm.src (parse_string): Line and column numbers
1762 are now provided to error handler.
1763 (!!!parse-error): Short descriptions are added.
1764 (_construct_tree): Split into three methods; support
1765 for innerHTML mode.
1766 (set_inner_html): New method.
1767
1768 2007-05-01 Wakaba <wakaba@suika.fam.cx>
1769
1770 * NanoDOM.html: Documentation is added.
1771
1772 * HTML.pod, ContentType.html: Documentation is revised.
1773
1774 * .cvsignore: Pod2html temporary files are added.
1775
1776 * Makefile: Make |NanoDOM.html|.
1777
1778 2007-05-01 Wakaba <wakaba@suika.fam.cx>
1779
1780 * HTML.pm.src (parse_string): New method.
1781 (get_inner_html): Renamed from |inner_html|.
1782
1783 * Makefile: A rule for |HTML.html| is added.
1784
1785 * HTML.pod: New documentation.
1786
1787 2007-05-01 Wakaba <wakaba@suika.fam.cx>
1788
1789 * NanoDOM.pm (last_child, previous_sibling): New attributes.
1790 (clone_node): Attribute nodes were not completely copied.
1791
1792 * HTML.pm.src: Many bugs are fixed.
1793
1794 2007-04-30 Wakaba <wakaba@suika.fam.cx>
1795
1796 * HTML.pm.src: Some typos are fixed.
1797
1798 2007-04-30 Wakaba <wakaba@suika.fam.cx>
1799
1800 * mkhtmlparser.pl, Makefile: References to the |HTML-consume-entity.src|
1801 are removed.
1802
1803 * HTML.pm.src: Tokenizer's handling on named entities are rewritten.
1804
1805 * HTML-consume-entity.src: Removed.
1806
1807 2007-04-30 Wakaba <wakaba@suika.fam.cx>
1808
1809 * HTML.pm.src: Tokenizer's handling on hexadecimal
1810 numeric entities are rewritten.
1811
1812 2007-04-30 Wakaba <wakaba@suika.fam.cx>
1813
1814 * HTML.pm.src: Some tokenizer bugs are fixed.
1815
1816 2007-04-30 Wakaba <wakaba@suika.fam.cx>
1817
1818 * HTML.pm.src: The tree construction stage is implemented.
1819
1820 * mkhtmlparser.pl: New macros are added.
1821
1822 2007-04-28 Wakaba <wakaba@suika.fam.cx>
1823
1824 * ContentType.pm: A note on bug in the specification
1825 is removed since it's been now fixed.
1826
1827 * .cvsignore: New file.
1828
1829 2007-04-28 Wakaba <wakaba@suika.fam.cx>
1830
1831 * HTML.pm.src, HTML-consume-entity.src: New files.
1832
1833 * Makefile (HTML.pm): New rule.
1834
1835 * mkhtmlparser.pl: New script.
1836
1837 2007-04-25 Wakaba <wakaba@suika.fam.cx>
1838
1839 * Makefile: New file.
1840
1841 2007-04-24 Wakaba <wakaba@suika.fam.cx>
1842
1843 * ContentType.pm: An error in pod is fixed.
1844
1845 2007-04-24 Wakaba <wakaba@suika.fam.cx>
1846
1847 * ContentType.pm: Documentation is added.
1848
1849 2007-04-24 Wakaba <wakaba@suika.fam.cx>
1850
1851 * ContentType.pm: New Perl module.
1852
1853 * ChangeLog: New file.
1854
1855

admin@suikawiki.org
ViewVC Help
Powered by ViewVC 1.1.24