/[suikacvs]/markup/html/whatpm/Whatpm/ChangeLog
Suika

Contents of /markup/html/whatpm/Whatpm/ChangeLog

Parent Directory Parent Directory | Revision Log Revision Log


Revision 1.282 - (show annotations) (download)
Sat Aug 30 12:57:05 2008 UTC (17 years, 8 months ago) by wakaba
Branch: MAIN
Changes since 1.281: +5 -0 lines
++ whatpm/t/ChangeLog	30 Aug 2008 12:56:52 -0000
	* HTML-tree.dat: tree-test-3.dat added.

	* tree-test-3.dat: Test data for definitionURL="" are added (cf.
	HTML5 revision 2130).

2008-08-30  Wakaba  <wakaba@suika.fam.cx>

++ whatpm/Whatpm/ChangeLog	30 Aug 2008 12:55:11 -0000
	* mkhtmlparser.pl: Support for MathML |definitionURL| attribute (HTML5
	revision 2130).

2008-08-30  Wakaba  <wakaba@suika.fam.cx>

1 2008-08-30 Wakaba <wakaba@suika.fam.cx>
2
3 * mkhtmlparser.pl: Support for MathML |definitionURL| attribute (HTML5
4 revision 2130).
5
6 2008-08-30 Wakaba <wakaba@suika.fam.cx>
7
8 * ContentChecker.pm: |xml:lang| attribute value must be same
9 as |lang| attribute value for HTML elements (HTML5 revision 2062
10 and so on).
11
12 2008-08-30 Wakaba <wakaba@suika.fam.cx>
13
14 * ContentChecker.pm: Error level definition for |xml_id_error|
15 was missing.
16
17 * URIChecker.pm: The end of the URL should be marked as the
18 error location for an empty path error. The position
19 between the userinfo and the port components should be
20 marked as the error location for an empty host error.
21
22 2008-08-30 Wakaba <wakaba@suika.fam.cx>
23
24 * URIChecker.pm: Set parameters representing where in the
25 value the error occurs for errors. Report unknown
26 address format error in warning level, since address
27 formats are rarely added. Path segments starting with "/.."
28 were misinterpreted as a dot-segment.
29
30 2008-08-30 Wakaba <wakaba@suika.fam.cx>
31
32 * URIChecker.pm (check_iri_reference): Requires
33 |Message::DOM::DOMImplementation|.
34
35 2008-08-29 Wakaba <wakaba@suika.fam.cx>
36
37 * IMTChecker.pm: Updated for the new error reporting architecture.
38
39 * ContentChecker.pm: Error levels for IMTs are added.
40
41 2008-08-17 Wakaba <wakaba@suika.fam.cx>
42
43 * H2H.pm (_shift_token): Support for unquoted HTML attribute
44 values.
45
46 2008-08-16 Wakaba <wakaba@suika.fam.cx>
47
48 * CacheManifest.pm: Support for new style of error
49 reports.
50
51 * HTML.pm.src: Set line=1, column=1 to the document node.
52
53 2008-08-16 Wakaba <wakaba@suika.fam.cx>
54
55 * ContentChecker.pm, RDFXML.pm: Pass {level} object to language tag
56 and URL checkers. Support for more error levels for bogus
57 langauge tag and URL "standards".
58
59 * LangTag.pm, URIChecker.pm: Support for new style error
60 level reporting.
61
62 2008-08-15 Wakaba <wakaba@suika.fam.cx>
63
64 * ContentChecker.pm: Support for RDF/XML error levels.
65
66 * HTMLTable.pm, RDFXML.pm: Support for new style of error level
67 specifying. Error types are revised.
68
69 2008-08-15 Wakaba <wakaba@suika.fam.cx>
70
71 * ContentChecker.pm: All error reporting method calls are
72 renewed.
73
74 2008-08-15 Wakaba <wakaba@suika.fam.cx>
75
76 * HTML.pm.src: All error type names and "text" parameters
77 are revised. Use new style for "level" specification.
78
79 * mkhtmlparser.pl: Use new style for "level" specification.
80
81 2008-08-03 Wakaba <wakaba@suika.fam.cx>
82
83 * WebIDL.pm (parse_char_string): Simplified error
84 reporting process for broken ignored valuetype definition.
85 (Valuetype idl_text): Support for special "DOMString" name.
86
87 2008-08-03 Wakaba <wakaba@suika.fam.cx>
88
89 * WebIDL.pm ($get_scoped_name): Append "::::" if the last
90 terminal of the ScopedName is "DOMString", such that whether
91 the last part of the scoped name is "DOMString" or "_DOMString"
92 later. It is necessary to determine whether a |typedef|
93 definition should be ignored or not.
94 (parse_char_string): Unescape the identifier of
95 exception members.
96 ($resolve): Return undef for builtin types and sequence<T>
97 types (we might not have to do this, however...).
98 (check): Support checking for Exceptions, Valuetypes,
99 and Typedefs.
100 ($serialize_type): Support for "DOMString::::" syntax.
101 (Typedef idl_text): Output Type as "DOMString" if it
102 is really "DOMString" (i.e. its internal representation
103 is "::DOMString::").
104
105 2008-08-03 Wakaba <wakaba@suika.fam.cx>
106
107 * WebIDL.pm ($resolve): New code, based on resolve code
108 for constant types in the |check| method.
109 (check): Support for checking of attributes, operations, and
110 arguments.
111 (Attribute/Operation idl_text): Exception names in getraises,
112 setraises, and raises clauses is serizlied by |$serialize_type|
113 code.
114
115 2008-08-02 Wakaba <wakaba@suika.fam.cx>
116
117 * WebIDL.pm ($integer): Order of selections are changed to match
118 hexadecimal numbers (the original pattern, taken from the spec,
119 was not work for hexadecimal numbers, because the "0" prefix
120 matches to the [0-7]* part (as an empty string) and therefore
121 it does not match with remaining "x..." part of a "0x..." integer
122 literal.
123 ($get_type): It now returns a string, not an array reference,
124 for regular types and |sequence| types (i.e. it in any case
125 returns a string).
126 ($get_next_token): The second item in the array that represents
127 a integer or float token is now a Perl number value, not the
128 original string representation of the number.
129 (check): Support for const value consistency checking.
130 No extended attribute is defined for constants.
131 (Node subclasses): Use simple strings rather than array references
132 for default data type values.
133 ($serialize_type): Type values are now simple strings.
134 (value): If the new attribute value is a false value, then
135 a FALSE value is set to the attribute.
136
137 2008-08-02 Wakaba <wakaba@suika.fam.cx>
138
139 * WebIDL.pm ($get_scoped_name): Now scoped names are stored
140 in its stringified format ("scoped name" as defined in the
141 spec). Note that future version of this module should not use
142 array references for type values and the |type_text| attribute
143 should be made obsolete.
144 (parse_char_string): Unescape attribute names.
145 (check): Support for checking of whether inherited interfaces
146 are actually defined or not. Support for checking of whether
147 interface member identifiers are duplicated or not.
148 ($serialize_type): Scoped names are returned as is. A future
149 version of this code should escape identifiers other than "DOMString",
150 otherwise the idl_text would be non-conforming.
151
152 2008-08-02 Wakaba <wakaba@suika.fam.cx>
153
154 * WebIDL.pm (parse_char_string): Set line/column numbers
155 to generated nodes. Unescape identifiers. Extended attributes
156 for Definition's were ignored.
157 (append_child): Set |parent_node| attribute.
158 (parent_node): New attribute.
159 (check): Support interface/exception members. Support
160 extended attributes. Support definition identifier uniqueness
161 constraint.
162 (qualified_name): New attribute.
163 (Interface/Exception idl_text): Extended attributes were
164 not prepended to the returned text.
165
166 2008-08-02 Wakaba <wakaba@suika.fam.cx>
167
168 * WebIDL.pm (parse_char_string): Set line/column numbers
169 to interface object experimentally. s/shift/pop/g, shift
170 would make things wrong. Support for interface forward
171 declarations was missing. Broken interface declarations
172 with no block were not ignored entirely.
173 (Whatpm::WebIDL::Node): New abstract class. This class
174 makes things easier.
175 (child_nodes): New attribute. Unlike DOM's attribute with
176 same name, this attribute returns a dead list of nodes for
177 simplicity.
178 (get_user_data, set_user_data): New methods.
179 (Module idl_text): A SPACE character should be inserted
180 before the |{| character.
181 (Interface idl_text): Support for interface forward declarations.
182 (is_forward_declaration): New attribute.
183
184 2008-07-19 Wakaba <wakaba@suika.fam.cx>
185
186 * WebIDL.pm (type_text): Better serializer.
187
188 2008-07-19 Wakaba <wakaba@suika.fam.cx>
189
190 * WebIDL.pm: Revise forward-compatible parsing so that
191 it now can handle broken extended attributes and as such.
192
193 2008-07-19 Wakaba <wakaba@suika.fam.cx>
194
195 * WebIDL.pm: Real support for extended attributes.
196 Support for extended attributes with arguments.
197
198 2008-07-19 Wakaba <wakaba@suika.fam.cx>
199
200 * WebIDL.pm: Support for |exception| syntax.
201 (Interface->idl_text): Tentative support for inheritances.
202
203 2008-07-19 Wakaba <wakaba@suika.fam.cx>
204
205 * WebIDL.pm: Hierarchical scoped name support was broken.
206 Support for raises, setraises, and getraises syntaxes.
207
208 2008-07-18 Wakaba <wakaba@suika.fam.cx>
209
210 * WebIDL.pm: Support for |idl_text| attribute, version 1 (no
211 proper support for types, extended attributes, and exceptions yet).
212 WebIDL parser, version 1 (no support for exceptions yet,
213 no proper support for extended attributes yet).
214
215 2008-07-09 Wakaba <wakaba@suika.fam.cx>
216
217 * WebIDL.pm (parse_char_string): Support for basic attribute syntax.
218
219 2008-06-29 Wakaba <wakaba@suika.fam.cx>
220
221 * WebIDL.pm: Support for valuetype and const.
222
223 2008-06-29 Wakaba <wakaba@suika.fam.cx>
224
225 * WebIDL.pm: New module.
226
227 2008-06-15 Wakaba <wakaba@suika.fam.cx>
228
229 * Makefile (Entities.html): URI changed.
230
231 2008-06-08 Wakaba <wakaba@suika.fam.cx>
232
233 * HTML.pm.src: Support for ruby parsing (HTML5 revision 1704).
234
235 2008-06-01 Wakaba <wakaba@suika.fam.cx>
236
237 * HTML.pm.src (_get_next_token): A parse error was missing.
238
239 2008-06-01 Wakaba <wakaba@suika.fam.cx>
240
241 * mklinktypelist.pl: rel=contact is no longer part of the HTML5
242 spec (commented out). (HTML5 revision 1711).
243
244 2008-05-25 Wakaba <wakaba@suika.fam.cx>
245
246 * ContentType.pm: Drop support for UTF-32 (HTML5 revision 1701).
247
248 * HTML.pm.src: UTF-16BE and UTF-16LE should be considered
249 as UTF-16 (HTML5 revision 1701).
250
251 2008-05-25 Wakaba <wakaba@suika.fam.cx>
252
253 * HTML.pm.src: Support for <noframes> in <head> (HTML5 revision
254 1692).
255
256 2008-05-25 Wakaba <wakaba@suika.fam.cx>
257
258 * HTML.pm.src: The secondary insertion mode used when switching
259 to foreign content is the "in body" insertion mode (HTML5 revision
260 1696).
261
262 2008-05-25 Wakaba <wakaba@suika.fam.cx>
263
264 * HTML.pm.src: Don't raise parse error for <isindex/> (HTML5
265 revision 1697).
266
267 2008-05-24 Wakaba <wakaba@suika.fam.cx>
268
269 * HTML.pm.src: Support for end-of-file token in foreign content
270 insertion mode (HTML5 revision 1693). Update SVG camelCase
271 attribute list (HTML5 revision 1700). <textarea> closes
272 </select> (HTML5 revision 1699). More start tags close in
273 foreign content insertion mode (HTML5 revision 1698).
274
275 2008-05-24 Wakaba <wakaba@suika.fam.cx>
276
277 * HTML.pm.src: ";" is not part of charset name (HTML5 revision 1665).
278
279 2008-05-24 Wakaba <wakaba@suika.fam.cx>
280
281 * HTML.pm.src: More robust charset parameter detection (HTML5
282 revision 1674).
283
284 2008-05-24 Wakaba <wakaba@suika.fam.cx>
285
286 * ContentType.pm: Support for image/vnd.microsoft.icon (HTML5
287 revision 1676).
288
289 2008-05-24 Wakaba <wakaba@suika.fam.cx>
290
291 * HTML.pm.src: Ignore language part of public identifiers for
292 quriks mode detection (HTML5 revision 1679).
293
294 2008-05-24 Wakaba <wakaba@suika.fam.cx>
295
296 * HTML.pm.src: Reduce the number of errors in truncated doctypes (HTML5
297 revision 1685).
298
299 2008-05-24 Wakaba <wakaba@suika.fam.cx>
300
301 * HTML.pm.src: Support for EOF in new states for tags (HTML5
302 revision 1684).
303
304 2008-05-24 Wakaba <wakaba@suika.fam.cx>
305
306 * HTML.pm.src (_reset_insertion_mode): Make <td>.innerHTML
307 work (HTML5 revision 1690).
308
309 2008-05-24 Wakaba <wakaba@suika.fam.cx>
310
311 * HTML.pm.src (_tree_construction_main): Change handling of
312 end tags in head insertion modes (HTML5 revision 1686).
313 (parse_char_string): Bug fix for non-utf8 character string handlings.
314 (parse_char_stream): |ungetc| does not work well for this context.
315
316 2008-05-18 Wakaba <wakaba@suika.fam.cx>
317
318 * HTML.pm.src (parse_byte_string): Redefined to invoke
319 |parse_byte_stream|.
320 (parse_byte_stream): New method.
321
322 2008-05-18 Wakaba <wakaba@suika.fam.cx>
323
324 * HTML.pm.src (parse_byte_string): Fix the column number reported
325 by encoding layer error reporter.
326
327 2008-05-17 Wakaba <wakaba@suika.fam.cx>
328
329 * HTML.pm.src (parse_byte_string): Use streaming decoder
330 rather than converting the whole byte string and then parsing.
331 Propagate errors in character encoding layer.
332 (get_next_token): Precise error reporting for |bare stago| error.
333
334 2008-05-17 Wakaba <wakaba@suika.fam.cx>
335
336 * HTML.pm.src (parse_char_stream): New method.
337 (parse_char_string): This method is now defined as an invocation
338 of the |parse_char_stream| method.
339
340 2008-05-17 Wakaba <wakaba@suika.fam.cx>
341
342 * HTML.pm.src (parse_byte_string): Report various status
343 of the sniffing as info-level errors. Support for new
344 decoding framework in parser resestting.
345 (new): Various default error levels were not set.
346
347 2008-05-17 Wakaba <wakaba@suika.fam.cx>
348
349 * HTML.pm.src (parse_byte_string): HTML5 encoding siniffing
350 algorithm, except for the actual sniffing, is implemented
351 with new framework with Message::Charset::Info.
352
353 2008-05-16 Wakaba <wakaba@suika.fam.cx>
354
355 * CacheManifest.pm (_parse): Drop fragment identifiers from
356 URIs in fallback section (HTML5 revision 1596).
357
358 2008-05-10 Wakaba <wakaba@suika.fam.cx>
359
360 * Makefile (Entities.html): URI has changed.
361
362 2008-05-10 Wakaba <wakaba@suika.fam.cx>
363
364 * CacheManifest.pm: Don't replace U+0000 NULL (HTML5 revision
365 1553).
366
367 2008-05-06 Wakaba <wakaba@suika.fam.cx>
368
369 * ContentChecker.pm: Noted that those returned in |table| are
370 no longer table elements, but table objects returned
371 by Whatpm::HTMLTable.
372
373 * HTMLTable.pm (form_table): Return table element node
374 as |$table->{element}|.
375 (assign_header): Support for the |headers=""| attribute.
376
377 2008-05-06 Wakaba <wakaba@suika.fam.cx>
378
379 * HTMLTable.pm (assign_header): New function; first version
380 with no support for headers="".
381 (form_table): Include table width and height to the returned
382 table object for covenience. Indexing in column assignement was wrong.
383 Set whether a data cell is empty or not for convenience.
384
385 2008-05-05 Wakaba <wakaba@suika.fam.cx>
386
387 * HTMLTable.pm: Robuster caption support (HTML5 revision 1393).
388
389 2008-05-05 Wakaba <wakaba@suika.fam.cx>
390
391 * HTMLTable.pm: How table model errors are detected is
392 changed (HTML5 revision 1387).
393
394 2008-05-05 Wakaba <wakaba@suika.fam.cx>
395
396 * HTMLTable.pm: The algorithm now moves |tfoot| elements
397 to the end of the table (HTML5 revision 1380).
398
399 2008-05-05 Wakaba <wakaba@suika.fam.cx>
400
401 * HTMLTable.pm: The algorithm is now 0-based indexing, instead
402 of 1-based (HTML5 revision 1376).
403
404 2008-05-05 Wakaba <wakaba@suika.fam.cx>
405
406 * ContentType.pm: "Content-Type: text/plain; charset=UTF-8"
407 and "Content-Encoding" no longer prevent sniffing (HTML5
408 revision 1288).
409
410 2008-05-05 Wakaba <wakaba@suika.fam.cx>
411
412 * ContentType.pm: Skip BOMs n feed or HTML algorithm (HTML5 revision
413 1282).
414
415 2008-05-03 Wakaba <wakaba@suika.fam.cx>
416
417 * ContentChecker.pm: Support for global attributes.
418 Status of XML specs are added.
419
420 2008-05-03 Wakaba <wakaba@suika.fam.cx>
421
422 * ContentChecker.pm (check_element): Support for |template=""|
423 and |ref=""| attribute (referent element type checking).
424
425 2008-04-29 Wakaba <wakaba@suika.fam.cx>
426
427 * CacheManifest.pm (_parse): New same origin definition (HTML5 revision
428 1500) is implemented (except for IDNA part and URI-scheme-specific
429 knowledge). Line number counting was wrong for LF-only
430 documents.
431
432 2008-04-13 Wakaba <wakaba@suika.fam.cx>
433
434 * HTML.pm.src: Raise an parse error for any disallowed
435 character (HTML5 revision 1263).
436
437 2008-04-13 Wakaba <wakaba@suika.fam.cx>
438
439 * mkentitylist.pl: Support for new HTML5 entity table format (the
440 definition for |AElig;| was missing).
441
442 2008-04-13 Wakaba <wakaba@suika.fam.cx>
443
444 * HTML.pm.src, mkhtmlparser.pl: Support for element/attribute
445 name/namespace fixup (HTML5 revisions 1413, 1415, 1416, and 1417).
446
447 2008-04-13 Wakaba <wakaba@suika.fam.cx>
448
449 * HTML.pm.src: List of element names that close foreign content
450 insertion mode is added (HTML5 revisions 1412 and 1418).
451
452 2008-04-13 Wakaba <wakaba@suika.fam.cx>
453
454 * HTML.pm.src: Support for |mglyph| and |malignmark|
455 elements (HTML5 revision 1410).
456
457 2008-04-13 Wakaba <wakaba@suika.fam.cx>
458
459 * HTML.pm.src: Support for new long MathML entities (HTML5
460 revision 1406).
461
462 2008-04-13 Wakaba <wakaba@suika.fam.cx>
463
464 * HTML.pm.src: CDATA section support for MathML and SVG
465 elements (HTML5 revision 1404 and 1420).
466
467 2008-04-12 Wakaba <wakaba@suika.fam.cx>
468
469 * HTML.pm.src, mkhtmlparser.pl: Support for MathML and SVG
470 elements (HTML5 revision 1404). Unused !!!macro definitions
471 are removed.
472
473 2008-04-12 Wakaba <wakaba@suika.fam.cx>
474
475 * HTML.pm.src, mkhtmlparser.pl: The way permitted slash errors
476 are raised is changed (HTML5 revision 1404).
477
478 2008-04-06 Wakaba <wakaba@suika.fam.cx>
479
480 * HTML.pm.src: Category sets that are no longer used are removed.
481
482 2008-04-06 Wakaba <wakaba@suika.fam.cx>
483
484 * HTML.pm.src: The ->[1] property of stack entries are now
485 replaced by constants representing element category.
486
487 2008-04-06 Wakaba <wakaba@suika.fam.cx>
488
489 * HTML.pm.src: Don't use local name stored in stack (i.e. ->[1])
490 for error reporting. (This is a preparation for using constant
491 value for ->[1].)
492
493 2008-03-22 Wakaba <wakaba@suika.fam.cx>
494
495 * RDFXML.pm: Typo fixed.
496
497 2008-03-22 Wakaba <wakaba@suika.fam.cx>
498
499 * ContentChecker.pm: |fact_level| is now treated
500 as same as |must_level|, i.e. level = |m|.
501 (check_element): Make list of URIs in the DOM.
502
503 2008-03-21 Wakaba <wakaba@suika.fam.cx>
504
505 * RDFXML.pm: Language accessor implemented.
506 Local (null-namespace) attribute support.
507
508 2008-03-21 Wakaba <wakaba@suika.fam.cx>
509
510 * RDFXML.pm: Factored out ID checking code.
511
512 2008-03-21 Wakaba <wakaba@suika.fam.cx>
513
514 * RDFXML.pm: TODO items noted. Validation of ID
515 and URI attributes is implemented. Warn if unknown
516 value is used in rdf:parseType="" attribute.
517
518 * URIChecker.pm (check_rdf_uri_reference): New function.
519
520 2008-03-21 Wakaba <wakaba@suika.fam.cx>
521
522 * RDFXML.pm: bnodeid implemented. Relative references
523 are now resolved.
524
525 2008-03-21 Wakaba <wakaba@suika.fam.cx>
526
527 * ContentChecker.pm: RDF reification implemented.
528
529 * RDFXML.pm: undef vs false bug fixed.
530 Reification implemented.
531
532 2008-03-21 Wakaba <wakaba@suika.fam.cx>
533
534 * RDFXML.pm: s/id/ID/ for attribute name.
535 The |node| arguments are added for |ontriple| calls.
536 Too many "attribute not allowed" errors were raised.
537
538 * ContentChecker.pm: Initial experimental support for rdf:RDF
539 element.
540
541 2008-03-21 Wakaba <wakaba@suika.fam.cx>
542
543 * RDFXML.pm: New module.
544
545 2008-03-20 Wakaba <wakaba@suika.fam.cx>
546
547 * HTML.pm.src (set_inner_html): Line/column number
548 code was old one yet.
549
550 2008-03-20 Wakaba <wakaba@suika.fam.cx>
551
552 * HTML.pm.src: Better line/column reporting for "duplicate attribute"
553 errors. Line/column markings for DOCTYPE, comment, and
554 character tokens are reintroduced; otherwise,
555 error location for "not HTML5" error and errors
556 for implied elements are not attached.
557
558 2008-03-20 Wakaba <wakaba@suika.fam.cx>
559
560 * HTML.pm.src: Set line/column numbers to attributes.
561
562 * NanoDOM.pm (create_attribute_ns, set_attribute_node_ns): Added.
563 (value): Setter implemented.
564
565 * mkhtmlparser.pl: Set line/column numbers to Attr nodes.
566
567 2008-03-20 Wakaba <wakaba@suika.fam.cx>
568
569 * HTML.pm.src: Unused line/column markings are removed.
570
571 2008-03-20 Wakaba <wakaba@suika.fam.cx>
572
573 * HTML.pm.src (_get_next_token): Remove |first_start_tag|
574 flag, which is no longer used.
575
576 2008-03-17 Wakaba <wakaba@suika.fam.cx>
577
578 * HTML.pm.src: Set line/column information to element nodes.
579
580 * mkhtmlparser.pl (!!!create-element, !!!insert-element,
581 and !!!insert-element-t): Set line/column information to
582 element nodes.
583
584 2008-03-17 Wakaba <wakaba@suika.fam.cx>
585
586 * HTML.pm.src (_get_next_token): The first "<" character
587 in "<?", "<>", or "</>" should be the error point.
588
589 2008-03-16 Wakaba <wakaba@suika.fam.cx>
590
591 * HTML.pm.src: Some more fixes on error position reporting.
592
593 2008-03-16 Wakaba <wakaba@suika.fam.cx>
594
595 * HTML.pm.src: Token-level precious error reporting.
596
597 2008-03-16 Wakaba <wakaba@suika.fam.cx>
598
599 * HTML.pm.src: Preparation for more precious error point
600 reporting.
601
602 2008-03-11 Wakaba <wakaba@suika.fam.cx>
603
604 * HTML.pm.src: Error type revised.
605
606 2008-03-11 Wakaba <wakaba@suika.fam.cx>
607
608 * HTML.pm.src: Similar codes are merged together, again.
609
610 2008-03-11 Wakaba <wakaba@suika.fam.cx>
611
612 * HTML.pm.src: Similar codes are merged together.
613
614 2008-03-10 Wakaba <wakaba@suika.fam.cx>
615
616 * mkhtmlparser.pl: Set "level" parameter to parse errors.
617
618 * HTML.pm.src: Code refined.
619
620 2008-03-09 Wakaba <wakaba@suika.fam.cx>
621
622 * HTML.pm.src: |</body>| treatement has been changed (HTML5
623 revision 1348). Note that I really don't know this makes
624 any difference in the black-box behavior of the parser.
625
626 2008-03-09 Wakaba <wakaba@suika.fam.cx>
627
628 * HTML.pm.src: New end-of-file token implementation (HTML5
629 revision 1348).
630
631 2008-03-09 Wakaba <wakaba@suika.fam.cx>
632
633 * HTML.pm.src: |applet| support (HTML5 revision 1347).
634
635 2008-03-09 Wakaba <wakaba@suika.fam.cx>
636
637 * HTML.pm.src: Foster parenting in AAA (HTML5 revision 1343).
638
639 2008-03-09 Wakaba <wakaba@suika.fam.cx>
640
641 * HTML.pm.src: Support for |<input>| in the "in select" insertion
642 mode and support for the "in select in table" insertion mode (HTML5
643 revision 1342).
644
645 2008-03-09 Wakaba <wakaba@suika.fam.cx>
646
647 * HTML.pm.src: No foster parenting for <script> and <script>
648 in non-tainted <table>s (HTML5 revision 1336).
649
650 2008-03-09 Wakaba <wakaba@suika.fam.cx>
651
652 * HTML.pm.src: Ignore white space characters between <html>
653 and <head> (HTML5 revision 1332).
654
655 2008-03-09 Wakaba <wakaba@suika.fam.cx>
656
657 * HTML.pm.src: Treat <input type=hidden> as if it were a
658 white space (HTML5 revision 1331).
659
660 2008-03-08 Wakaba <wakaba@suika.fam.cx>
661
662 * HTML.pm.src: Ignore U+000A at the beginning of a |listing|
663 element (HTML5 revision 1330).
664
665 2008-03-08 Wakaba <wakaba@suika.fam.cx>
666
667 * HTML.pm.src: <title> is always appended to the current
668 element (HTML5 revision 1328).
669
670 2008-03-08 Wakaba <wakaba@suika.fam.cx>
671
672 * HTML.pm.src: White space in tainted tables are moved
673 into foster parents (HTML5 revision 1326).
674
675 2008-03-08 Wakaba <wakaba@suika.fam.cx>
676
677 * HTML.pm.src: Reduce errors from foster parenting cases (HTML5
678 revision 1321).
679
680 2008-03-08 Wakaba <wakaba@suika.fam.cx>
681
682 * HTML.pm.src: |</h/n/>| case code rearranged to align with
683 the spec (HTML5 revision 1320). Note that we finally complete
684 all of HTML5 revision 1320 changes.
685
686 2008-03-08 Wakaba <wakaba@suika.fam.cx>
687
688 * HTML.pm.src: |</form>| now works similar to |</div>| for unclosed
689 tags (HTML5 revision 1320).
690
691 2008-03-08 Wakaba <wakaba@suika.fam.cx>
692
693 * HTML.pm.src: |</p>| case rearranged with no actual change in fact.
694
695 2008-03-08 Wakaba <wakaba@suika.fam.cx>
696
697 * HTML.pm.src: A "generate implied end tags" code (t409.1)
698 could not be reached so that it is now removed (HTML5 revision
699 1320).
700
701 2008-03-08 Wakaba <wakaba@suika.fam.cx>
702
703 * HTML.pm.src: Code for the case of |</div>| and so on are revised
704 to align with new spec text (HTML5 revision 1320).
705
706 2008-03-08 Wakaba <wakaba@suika.fam.cx>
707
708 * HTML.pm.src: Remove strange |if| condition; however, it should
709 have had no harm in theory.
710
711 2008-03-08 Wakaba <wakaba@suika.fam.cx>
712
713 * HTML.pm.src (_tree_construction_main): '</p>' in body
714 case is split from other end tags for the preparation
715 of implementing HTML revision 1320.
716
717 2008-03-07 Wakaba <wakaba@suika.fam.cx>
718
719 * HTML.pm.src: Simplified "generate implied end tag" (HTML5
720 revision 1320).
721
722 2008-03-07 Wakaba <wakaba@suika.fam.cx>
723
724 * HTML.pm.src (_tree_construction_main): Merge rules for "h1"
725 and "div" (HTML5 revision 1318). Add comments to where
726 |form| pointer association codes should be inserted (HTML5 revision
727 1319).
728
729 2008-03-06 Wakaba <wakaba@suika.fam.cx>
730
731 * HTML.pm.src: <html> treatement refined (HTML5 revision 1314).
732
733 2008-03-05 Wakaba <wakaba@suika.fam.cx>
734
735 * HTML.pm.src: Since the case t268 should never be reached (no
736 other token type, there are), it is replaced by a |die| statement.
737
738 2008-03-05 Wakaba <wakaba@suika.fam.cx>
739
740 * HTML.pm.src: Typo fixed.
741
742 2008-03-04 Wakaba <wakaba@suika.fam.cx>
743
744 * HTML.pm.src (_tree_construction_initial): Some limited quirks
745 doctypes were not uppercased for comparison.
746
747 2008-03-03 Wakaba <wakaba@suika.fam.cx>
748
749 * HTML.pm.src (tree construction and set_inner_html): Checkpoints
750 are added.
751
752 2008-03-03 Wakaba <wakaba@suika.fam.cx>
753
754 * HTML.pm.src (_tokenize_attempt_to_consume_an_entity): Checkpoints
755 are set. Cases that are unlikely reached are noted as so.
756
757 2008-03-03 Wakaba <wakaba@suika.fam.cx>
758
759 * HTML.pm.src: Checkpoints for debugging are added.
760
761 * mkhtmlparser.pl: Support for |!!!cp| syntax.
762
763 2008-03-03 Wakaba <wakaba@suika.fam.cx>
764
765 * HTML.pm.src, mkhtmlparser.pl: s/_input_character/_char/g
766 for simplicity.
767
768 2008-03-03 Wakaba <wakaba@suika.fam.cx>
769
770 * HTML.pm.src: Flag name changed: s/correct/force-quirks/g (HTML5
771 revision 1307).
772
773 2008-03-03 Wakaba <wakaba@suika.fam.cx>
774
775 * HTML.pm.src (_get_next_token): Where the /incorrect/ flag
776 is set are changed (HTML5 revision 1305).
777
778 2008-03-02 Wakaba <wakaba@suika.fam.cx>
779
780 * HTML.pm.src: Raise a parse error for |<span ===>| (HTML5 revision
781 1292). Entities are not parsed in comment-like part in RCDATA
782 elements (HTML5 revision 1294). Allow bare & at the end
783 of attribute value literals (HTML5 revision 1296). More
784 quirks mode doctypes (HTML5 revision 1302). Requires spaces
785 between attributes and ban attribute names or unquoted
786 attribute values containing single or double quotes (HTML5
787 revision 1303).
788
789 2008-03-02 Wakaba <wakaba@suika.fam.cx>
790
791 * ContentChecker.pm: Typo fixed. Don't raise "character encoding"
792 and related errors unless it is an HTML document (though the spec
793 is unclear on whether it is applied to XHTML document).
794
795 * HTML.pm (%HTMLAttrStatus): WF2 repetition model attributes
796 are added.
797
798 2008-03-02 Wakaba <wakaba@suika.fam.cx>
799
800 * HTML.pm.src: s/local_name/manakai_local_name/g.
801
802 2008-03-01 Wakaba <wakaba@suika.fam.cx>
803
804 * _NamedEntityList.pm: Updated (HTML5 revision 1286).
805
806 * HTML.pm.src: |charset| in |content| attribute is
807 case-insensitive (HTML5 revision 1270).
808
809 2008-02-26 Wakaba <wakaba@suika.fam.cx>
810
811 * ContentChecker.pm: New status constants are added.
812 ($ElementDefault): |status| added.
813 (check_element): Err for non-standard or deprecated elements.
814 (_attr_status_info): For non-standard or deprecated attributes.
815
816 2008-02-24 Wakaba <wakaba@suika.fam.cx>
817
818 * ContentChecker.pm (_attr_status_info): New internal method.
819
820 2008-02-24 Wakaba <wakaba@suika.fam.cx>
821
822 * ContentChecker.pm (check_element): Element standardized
823 status information is now dispatched.
824
825 2008-02-24 Wakaba <wakaba@suika.fam.cx>
826
827 * ContentChecker.pm (check_element): Fix |del|-and-significant
828 problem by adding some more arguments.
829
830 2008-02-24 Wakaba <wakaba@suika.fam.cx>
831
832 * ContentChecker.pm (check_element): Use context of
833 container-for-the-purpose-of-content-model element (not transparent
834 element) for |check_child_element| calling and significant
835 text flag marking. This reintroduces |<del>|-and-significant
836 problem again.
837
838 2008-02-24 Wakaba <wakaba@suika.fam.cx>
839
840 * ContentChecker.pm (check_element): Make semi-transparent
841 elements ignored for the purpose of phase changes in
842 content model checking.
843
844 2008-02-23 Wakaba <wakaba@suika.fam.cx>
845
846 * ContentChecker.pm (check_element): In-element state
847 was not properly managed for transparent cases.
848
849 2008-02-23 Wakaba <wakaba@suika.fam.cx>
850
851 * ContentChecker.pm (check_element): Support for |video|
852 and |audio| as semi-transparent elements.
853
854 2008-02-23 Wakaba <wakaba@suika.fam.cx>
855
856 * ContentChecker.pm ($HTMLSemiTransparentElements): New.
857 (check_element): s/minuses/minus_elements/, s/pluses/plus_elements/.
858 Support for |html:object| as a semi-transparent element.
859
860 2008-02-23 Wakaba <wakaba@suika.fam.cx>
861
862 * ContentChecker.pm (check_element): The way to traverse
863 the tree is entirely revised to make it easier to track
864 the state of ancestors/descendants. As a result of this
865 revision (which rewrites almost all of Whatpm::ContentChecker::HTML),
866 support for content model checking for HTML elements |figure|,
867 |object|, |video|, and |audio| and checking for XML elements (and
868 some XMLNS checkings) are dropped for now. They will be
869 reimplemented in due cource.
870
871 2008-02-17 Wakaba <wakaba@suika.fam.cx>
872
873 * HTML.pm.src: |>| in public or system literal closes the DOCTYPE
874 token (HTML5 revision 1225).
875
876 2008-02-17 Wakaba <wakaba@suika.fam.cx>
877
878 * ContentChecker.pm ({unsupported_level}): New value.
879
880 * HTML.pm.src: Save whether |meta| |content| attribute
881 contains character references or not.
882
883 2008-02-17 Wakaba <wakaba@suika.fam.cx>
884
885 * ContentChecker.pm (_get_children): (Incompleted) attempt to
886 imlement significant content checking for contents
887 with |del| elements.
888
889 2008-02-17 Wakaba <wakaba@suika.fam.cx>
890
891 * ContenteChecker.pm ($HTMLTransparentElements): More
892 elements are added.
893 (_get_children): HTML |object| elements are now semi-transparent.
894
895 * NanoDOM.pm (manakai_html, manakai_head): New methods.
896
897 2008-02-16 Wakaba <wakaba@suika.fam.cx>
898
899 * CacheManifest.pm: HTML5 revision 1211 implemented.
900
901 * CacheManifest.pod: Updated.
902
903 2008-02-10 Wakaba <wakaba@suika.fam.cx>
904
905 * ContentChecker.pm (check_document, check_element): Support
906 for second argument ($onsubdoc).
907 (_get_css_parser): Removed (now it is part of WDCC).
908
909 2008-02-09 Wakaba <wakaba@suika.fam.cx>
910
911 * ContentChecker.pm (_get_css_parser): New.
912
913 2007-11-25 Wakaba <wakaba@suika.fam.cx>
914
915 * ContentChecker.pm ($AnyChecker): Old way to add child elements
916 for checking had been used.
917
918 2007-11-25 Wakaba <wakaba@suika.fam.cx>
919
920 * ContentChecker.pm (check_element): New todo item type |descendant|.
921
922 2007-11-23 Wakaba <wakaba@suika.fam.cx>
923
924 * IMTChecker.pm: Revised to raise errors and warnings as (poorly)
925 specced in RFC 2046 and RFC 4288.
926 (application/atom+xml): Definition added.
927
928 2007-11-23 Wakaba <wakaba@suika.fam.cx>
929
930 * URIChecker.pm: Make RFC 3986 should-level errors
931 warnings (rather than SHOULD-level errors).
932
933 2007-11-23 Wakaba <wakaba@suika.fam.cx>
934
935 * NanoDOM.pm (get_user_data, set_user_data): New methods.
936
937 * HTML.pm.src: A flag for character references in attribute
938 values are added. Set |manakai_has_reference| user data
939 to |charset| attribute.
940
941 2007-11-23 Wakaba <wakaba@suika.fam.cx>
942
943 * NanoDOM.pm (input_encoding, manakai_charset, manakai_has_bom): New
944 attributes.
945
946 * ContentChecker.pm (check_document): Warn if charset requirements
947 cannot be tested.
948
949 2007-11-19 Wakaba <wakaba@suika.fam.cx>
950
951 * HTML.pm.src (parse_byte_string): Detect charset
952 by universalchardet if charset parameter is not specified.
953
954 * Makefile (Charset-all, Charset-clean): New rules.
955
956 2007-11-18 Wakaba <wakaba@suika.fam.cx>
957
958 * ContentChecker.pm (check_document): Check the existence
959 of character encoding declaration and selection of encoding
960 for HTML document.
961
962 2007-11-18 Wakaba <wakaba@suika.fam.cx>
963
964 * ContentType.pm (get_sniffed_type): Return also the official
965 type in list context.
966
967 2007-11-18 Wakaba <wakaba@suika.fam.cx>
968
969 * ContentType.pm: Sniffing with leading white space
970 ignoring (HTML5 revisions 1013 and 1016).
971
972 2007-11-18 Wakaba <wakaba@suika.fam.cx>
973
974 * ContentType.pm: HTML5 revision 1013 changes, except for leading
975 white spaces, are implemented.
976
977 2007-11-11 Wakaba <wakaba@suika.fam.cx>
978
979 * HTML.pm.src (parse_char_string): Set |inner_encoding|
980 attribute if possible.
981
982 2007-11-11 Wakaba <wakaba@suika.fam.cx>
983
984 * HTML.pm.src (parse_byte_string): New method.
985 (parse_char_string): New alias for |parse_string|.
986 (main phase): Invoking "change the encoding" algorithm if desired.
987
988 * HTML.pod: Updated.
989
990 2007-11-11 Wakaba <wakaba@suika.fam.cx>
991
992 * HTML.pod (get_inner_html): Removed.
993
994 * Makefile (HTML-all, HTML-clean): New.
995
996 2007-11-11 Wakaba <wakaba@suika.fam.cx>
997
998 * HTML.pm.src (get_inner_html): Removed (moved to HTML/Serializer.pm).
999
1000 2007-11-08 Wakaba <wakaba@suika.fam.cx>
1001
1002 * mklinktypelist.pl: s/noreferer/noreferrer/ (HTML5 revision 1132).
1003
1004 2007-11-04 Wakaba <wakaba@suika.fam.cx>
1005
1006 * Makefile: |CacheManifest.html| is added.
1007
1008 * CacheManifest.pod: New file.
1009
1010 2007-11-04 Wakaba <wakaba@suika.fam.cx>
1011
1012 * CacheManifest.pm: New module.
1013
1014 2007-11-04 Wakaba <wakaba@suika.fam.cx>
1015
1016 * HTML.pm.src: Support for application cache selection algorithm
1017 callback.
1018
1019 2007-11-04 Wakaba <wakaba@suika.fam.cx>
1020
1021 * mklinktypelist.pl: Support for rel=noreferer (HTML5 revision 1118).
1022
1023 2007-10-17 Wakaba <wakaba@suika.fam.cx>
1024
1025 * Makefile (clean): New rule.
1026
1027 * NanoDOM.pm (public_id, system_id): New attributes.
1028
1029 2007-10-17 Wakaba <wakaba@suika.fam.cx>
1030
1031 * Makefile (CSS-all, CSS-clean, clean): New rules.
1032
1033 2007-10-14 Wakaba <wakaba@suika.fam.cx>
1034
1035 * ContentChecker.pm (check_document): Support for
1036 new |is_xml_root| flag.
1037 (check_element): Support for new |pluses| state.
1038 (_add_pluses): New method.
1039 (_remove_minuses): Support for new |minus| item.
1040
1041 2007-09-24 Wakaba <wakaba@suika.fam.cx>
1042
1043 * ContentChecker.pm: Raise specific error for invalid
1044 root element.
1045
1046 2007-09-24 Wakaba <wakaba@suika.fam.cx>
1047
1048 * ContentChecker.pm: Set level values for later uses.
1049
1050 2007-09-09 Wakaba <wakaba@suika.fam.cx>
1051
1052 * ContentChecker.pm: Support for language tag validation.
1053
1054 2007-09-09 Wakaba <wakaba@suika.fam.cx>
1055
1056 * LangTag.pm (check_rfc3066_language_tag): New method.
1057
1058 2007-09-09 Wakaba <wakaba@suika.fam.cx>
1059
1060 * LangTag.pm: New module.
1061
1062 2007-09-04 Wakaba <wakaba@suika.fam.cx>
1063
1064 * HTML.pm.src: Some error types were wrong.
1065
1066 2007-08-17 Wakaba <wakaba@suika.fam.cx>
1067
1068 * CSS/: New directory.
1069
1070 2007-08-17 Wakaba <wakaba@suika.fam.cx>
1071
1072 * ContentChecker.pm (_check_get_children): Support
1073 for |noscript| in |head|.
1074
1075 2007-08-12 Wakaba <wakaba@suika.fam.cx>
1076
1077 * URI/: New directory.
1078
1079 2007-08-11 Wakaba <wakaba@suika.fam.cx>
1080
1081 * HTML.pm.src: Tokenizer's states are now represented in
1082 number.
1083
1084 2007-08-11 Wakaba <wakaba@suika.fam.cx>
1085
1086 * HTML.pm.src: |or|s for insertion modes are replaced
1087 by |&|s.
1088
1089 2007-08-11 Wakaba <wakaba@suika.fam.cx>
1090
1091 * HTML.pm.src: Token types are now represented in number.
1092
1093 2007-08-11 Wakaba <wakaba@suika.fam.cx>
1094
1095 * ContentType.pm (SEE ALSO): Updated.
1096
1097 * HTML.pm.src: Insertion modes are now represented in number.
1098
1099 2007-08-11 Wakaba <wakaba@suika.fam.cx>
1100
1101 * ContentType.pm: Sniffing for bitmap images (HTML5 revision
1102 999) is implemented.
1103
1104 2007-08-08 Wakaba <wakaba@suika.fam.cx>
1105
1106 * ContentType.pm: Sniffing for |<script| (HTML5 revision
1107 983) is implemented.
1108
1109 2007-08-06 Wakaba <wakaba@suika.fam.cx>
1110
1111 * ContentChecker.pod: New documentation.
1112
1113 * Makefile: A rule for |ContentChecker.html| is added.
1114
1115 * ContentChecker.pm: A pod "LICENSE" section is added.
1116
1117 * NanoDOM.pm ($VERSION): New variable.
1118
1119 2007-08-05 Wakaba <wakaba@suika.fam.cx>
1120
1121 * H2H.pm: |b|, |i|, and |sub| are added to the
1122 list of allowed HTML elements.
1123
1124 2007-08-05 Wakaba <wakaba@suika.fam.cx>
1125
1126 * H2H.pm: |samp| is added to the list of allowed
1127 HTML elements.
1128
1129 * URIChecker.pm (check_iri): New.
1130 (check_iri_reference): Error type for IRI reference
1131 syntax error is changed.
1132
1133 2007-08-04 Wakaba <wakaba@suika.fam.cx>
1134
1135 * ContentChecker.pm: Reference to the |Whatpm::ContentChecker::Atom|
1136 is added.
1137 (check_document): Load appropriate module before validation.
1138
1139 2007-08-04 Wakaba <wakaba@suika.fam.cx>
1140
1141 * ContentChecker/: New directory.
1142
1143 2007-08-04 Wakaba <wakaba@suika.fam.cx>
1144
1145 * ContentChecker.pm: HTML |time| element is implemented.
1146
1147 * HTMLTable.pm: Comments are updated as HTML5 is revised.
1148
1149 2007-07-21 Wakaba <wakaba@suika.fam.cx>
1150
1151 * ContentChecker.pm (check_document): Return value
1152 even if no document element is found.
1153
1154 2007-07-21 Wakaba <wakaba@suika.fam.cx>
1155
1156 * HTML.pm.src: |$in_body| is no longer a function.
1157
1158 2007-07-21 Wakaba <wakaba@suika.fam.cx>
1159
1160 * HTML.pm.src: The |$in_body| code has been moved down.
1161
1162 2007-07-21 Wakaba <wakaba@suika.fam.cx>
1163
1164 * HTML.pm.src: The "trailing end" insertion mode
1165 is split into "after html body" and "after html frameset"
1166 insertion modes. Their codes are merged with "after body"
1167 and "after frameset" codes. |$previous_insertion_mode|
1168 has been removed. "after frameset" code is
1169 merged with "in frameset" code.
1170
1171 2007-07-21 Wakaba <wakaba@suika.fam.cx>
1172
1173 * HTML.pm.src: The "before head" insertion mode is
1174 merged with the "in head" insertion mode.
1175
1176 2007-07-21 Wakaba <wakaba@suika.fam.cx>
1177
1178 * HTML.pm.src: Cases in "in head" insertion mode are
1179 reorganized.
1180
1181 2007-07-21 Wakaba <wakaba@suika.fam.cx>
1182
1183 * HTML.pm.src: Some cases in "in table" insertion mode
1184 are merged.
1185
1186 2007-07-21 Wakaba <wakaba@suika.fam.cx>
1187
1188 * HTML.pm.src: The "in row" insertion mode is merged
1189 with "in table" insertion mode.
1190
1191 2007-07-21 Wakaba <wakaba@suika.fam.cx>
1192
1193 * HTML.pm.src: The "in table" and "in table body" insertion
1194 modes are merged.
1195
1196 2007-07-21 Wakaba <wakaba@suika.fam.cx>
1197
1198 * HTML.pm.src: There is no "in table head" or "in table foot"
1199 insertion mode!
1200
1201 2007-07-21 Wakaba <wakaba@suika.fam.cx>
1202
1203 * HTML.pm.src: |<noframes>| "in frameset" and "in noframes"
1204 now directly invoke the handler.
1205
1206 2007-07-21 Wakaba <wakaba@suika.fam.cx>
1207
1208 * HTML.pm.src: Codes for "in cell" insertion mode
1209 is merged to the "in body" insertion mode code.
1210
1211 2007-07-21 Wakaba <wakaba@suika.fam.cx>
1212
1213 * HTML.pm.src: Codes for "in body" and "in caption"
1214 insertion modes are merged.
1215
1216 2007-07-21 Wakaba <wakaba@suika.fam.cx>
1217
1218 * HTML.pm.src: Two |!!!next-token|s were missing.
1219
1220 2007-07-21 Wakaba <wakaba@suika.fam.cx>
1221
1222 * HTML.pm.src: Use numeric constant for |{content_mode}|
1223 instead of string constant for |{content_model_flag}|.
1224
1225 2007-07-21 Wakaba <wakaba@suika.fam.cx>
1226
1227 * HTML.pm.src: Add the name of the attribute
1228 to the "duplicate attribute" error.
1229
1230 2007-07-17 Wakaba <wakaba@suika.fam.cx>
1231
1232 * ContentChecker.pm: Return the |class| node list.
1233
1234 2007-07-17 Wakaba <wakaba@suika.fam.cx>
1235
1236 * ContentChecker.pm: Return the |id| node list.
1237
1238 * HTML.pm.src: A typo is fixed.
1239
1240 2007-07-16 Wakaba <wakaba@suika.fam.cx>
1241
1242 * ContentChecker.pm: Drop wrong |level => 'error'| specification
1243 from "in HTML:xml:lang" error. Character position
1244 is now the last part of the error type in the URI error
1245 description. Report "unsupported" status for language
1246 tags, media queries, script codes, and style sheets.
1247
1248 2007-07-16 Wakaba <wakaba@suika.fam.cx>
1249
1250 * ContentChecker.pm: Report error if |xml:lang|
1251 in HTML, |lang| in XML, |xmlns| in XML, and |meta| |charset|
1252 in XML.
1253
1254 * NanoDOM.pm (Attr.owner_document): New attribute.
1255
1256 2007-07-16 Wakaba <wakaba@suika.fam.cx>
1257
1258 * HTML.pm.src: The character immediately following
1259 a bare |hcro| was discarded. Fix handling of
1260 entity references in attribute values.
1261
1262 2007-07-16 Wakaba <wakaba@suika.fam.cx>
1263
1264 * HTML.pm.src (main and trailin end phases): Token
1265 types |DOCTYPE|, |comment|, |end-of-file|, and |<html>| are
1266 factored out. Error types |in html:#DOCTYPE|
1267 and |after html:#DOCTYPE| are merged
1268 into |DOCTYPE in the middle|. |</frameset>|
1269 in fragment parsing mode changed the
1270 insertion mode.
1271
1272 2007-07-16 Wakaba <wakaba@suika.fam.cx>
1273
1274 * HTML.pm.src: |$phase| has been removed; The |trailing end|
1275 phase is now an insertion mode. Treatments for white
1276 space character tokens were incorrect for some
1277 insertion modes. An old |meta| case was not removed.
1278
1279 2007-07-16 Wakaba <wakaba@suika.fam.cx>
1280
1281 * HTML.pm.src: |meta| charset declaration extraction
1282 implemented (but changing the encoding is not yet:-).
1283
1284 2007-07-15 Wakaba <wakaba@suika.fam.cx>
1285
1286 * Charset/: New directory.
1287
1288 2007-07-15 Wakaba <wakaba@suika.fam.cx>
1289
1290 * H2H.pm: New Perl module (created from
1291 manakai's H2H.dis).
1292
1293 2007-07-15 Wakaba <wakaba@suika.fam.cx>
1294
1295 * XMLSerializer.pm: New Perl module (created from
1296 manakai's SimpleLS.dis).
1297
1298 2007-07-07 Wakaba <wakaba@suika.fam.cx>
1299
1300 * HTML.pm.src: |<!---x-->| was not processed correctly.
1301
1302 2007-07-01 Wakaba <wakaba@suika.fam.cx>
1303
1304 * HTML.pm.src: Report correct error message
1305 for |<body></div></body>|.
1306
1307 2007-07-01 Wakaba <wakaba@suika.fam.cx>
1308
1309 * HTMLTable.pm: An error description was incorrect.
1310
1311 2007-06-30 Wakaba <wakaba@suika.fam.cx>
1312
1313 * ContentChecker.pm: Return |{term}| list.
1314
1315 2007-06-30 Wakaba <wakaba@suika.fam.cx>
1316
1317 * HTML.pm.src: HTML5 revisions 961-966 (</p>, </br>,
1318 nested <nobr>, implied </tbody>, </tfoot>, and </thead>,
1319 and <title> outside of head).
1320
1321 2007-06-30 Wakaba <wakaba@suika.fam.cx>
1322
1323 * IMTChecker.pm: Report warning for unregistered
1324 and private types/subtypes.
1325
1326 * ContentChecker.pm, HTML.pm.src, IMTChecker.pm,
1327 URIChecker.pm, HTMLTable.pm: Error messages are now
1328 consistent; they are all listed in
1329 <http://suika.fam.cx/gate/2005/sw/Whatpm%20Error%20Types>.
1330
1331 2007-06-25 Wakaba <wakaba@suika.fam.cx>
1332
1333 * ContentChecker.pm: |<img ismap>| not in |<a></a>|
1334 is now erred. |<datalist>| is implemented.
1335 Attribute checker for |<command>| and |<menu>| are
1336 added. Support for |contextmenu| global attribute
1337 is added.
1338
1339 2007-06-25 Wakaba <wakaba@suika.fam.cx>
1340
1341 * HTML.pm.src (_reset_insertion_mode): Interpretation
1342 of Step 3 has been changed.
1343
1344 2007-06-25 Wakaba <wakaba@suika.fam.cx>
1345
1346 * HTML.pm.src: Late |<html>| parse error is implemented.
1347
1348 2007-06-24 Wakaba <wakaba@suika.fam.cx>
1349
1350 * URIChecker.pm (check_iri_reference): A |decode| method name was
1351 incorrect.
1352
1353 * ContentChecker.pm: Support for the |footer| element.
1354 Check URI syntax for space-separated URI attributes.
1355 Support for the |tabindex| attribute. Support
1356 for |datetime| attribute.
1357
1358 2007-06-24 Wakaba <wakaba@suika.fam.cx>
1359
1360 * HTML.pm.src: HTML5 revision 1.144 (&#x0D;) and 1.145 (invalid
1361 character references). HTML5 revision 1.146 (white space
1362 characters before root start tag). HTML5 revision
1363 1.148 (named character references in attribute values).
1364 HTML5 revision 1.152 (<plaintext>.innerHTML get).
1365
1366 2007-06-24 Wakaba <wakaba@suika.fam.cx>
1367
1368 * HTML.pm.src: HTML5 revisions 1.142 and 1.143 (<noscript>
1369 in <head>).
1370
1371 2007-06-24 Wakaba <wakaba@suika.fam.cx>
1372
1373 * HTML.pm.src: HTML5 revision 935 (<base>, <link>, <meta>
1374 in body).
1375
1376 * ContentChecker.pm: HTML5 revision 938 (scoped="").
1377
1378 2007-06-24 Wakaba <wakaba@suika.fam.cx>
1379
1380 * HTML.pm.src: HTML5 revision 923 (matching end tag in CDATA
1381 or RCDATA in fragment parsing mode). HTML5 revision
1382 924 (<!--> and <!--->). HTML5 revision 926 (hn in hn).
1383
1384 2007-06-23 Wakaba <wakaba@suika.fam.cx>
1385
1386 * HTML.pm.src (get_inner_html): HTML5 revision 922 (inner_html
1387 for <pre> and <textarea>).
1388
1389 2007-06-23 Wakaba <wakaba@suika.fam.cx>
1390
1391 * HTML.pm.src: HTML5 revision 920 (<isindex>).
1392
1393 2007-06-23 Wakaba <wakaba@suika.fam.cx>
1394
1395 * HTML.pm.src: HTML5 revision 918 (</head>, </body>, </html>).
1396
1397 2007-06-23 Wakaba <wakaba@suika.fam.cx>
1398
1399 * HTML.pm.src: HTML5 revision 916 (</body>).
1400 HTML5 revision 917 (conforming bare &).
1401
1402 2007-06-23 Wakaba <wakaba@suika.fam.cx>
1403
1404 * NanoDOM.pm (manakai_is_html): Setting to false did not work.
1405
1406 * HTML.pm.src: HTML5 revision 914 (</ in CDATA, RCDATA).
1407 HTML5 revision 915 (<nobr>).
1408
1409 2007-06-23 Wakaba <wakaba@suika.fam.cx>
1410
1411 * HTML.pm.src: HTML5 revisions 908, 909, 912, and 913 (quirks mode).
1412
1413 * NanoDOM.pm (manakai_is_html, manakai_compat_mode, compat_mode):
1414 New attributes.
1415
1416 2007-06-23 Wakaba <wakaba@suika.fam.cx>
1417
1418 * HTML.pm.src: HTML5 revisions 900, 901, 902, and 911 (<
1419 in tags).
1420
1421 2007-06-23 Wakaba <wakaba@suika.fam.cx>
1422
1423 * .cvsignore: |Entities.html| is added.
1424
1425 * HTML.pm.src: |$entity_char| is removed and
1426 requires |Whatpm::_NamedEntityList| instead.
1427 HTML5 revision 898 (refc), except that lack of refc
1428 is parse error.
1429
1430 * mkentitylist.pl: New script.
1431
1432 * Makefile (all): |_NamedEntityList.pm| is added.
1433 (_NamedEntityList.pm, Entities.html): New rules.
1434
1435 2007-06-23 Wakaba <wakaba@suika.fam.cx>
1436
1437 * HTML.pm.src: Parse errors immediately after U+000D
1438 were ignored and U+000D immediately following another
1439 U+000D was not converted to U+000A.
1440
1441 2007-06-23 Wakaba <wakaba@suika.fam.cx>
1442
1443 * HTML.pm.src (set_inner_html): HTML5 revision 892 (adopt
1444 nodes before appended). Parser was not ready for NULL
1445 parse error and escape flag.
1446
1447 * NanoDOM.pm (adopt_node): New.
1448
1449 2007-06-23 Wakaba <wakaba@suika.fam.cx>
1450
1451 * HTML.pm.src: HTML5 revision 886 (insane comment in
1452 CDATA and RCDATA). Note that current implementation
1453 is simply repeating what the spec says and it is maybe not
1454 a best way to do it.
1455
1456 2007-06-23 Wakaba <wakaba@suika.fam.cx>
1457
1458 * HTML.pm.src: HTML5 revision 884 (</form> don't close
1459 the form element if a descendant element without implied end tag has
1460 still been open).
1461
1462 2007-06-23 Wakaba <wakaba@suika.fam.cx>
1463
1464 * ContentChecker.pm: HTML5 revision 881 (Make |id| attribute with
1465 space characters non-conforming).
1466
1467 2007-06-23 Wakaba <wakaba@suika.fam.cx>
1468
1469 * HTML.pm.src: An error message was incorrect.
1470 HTML5 revision 869 (C1 character references).
1471
1472 2007-06-23 Wakaba <wakaba@suika.fam.cx>
1473
1474 * HTML.pm.src: HTML5 revision 867 (a LF at the beginning of
1475 a |textarea| is removed).
1476
1477 2007-06-05 Wakaba <wakaba@suika.fam.cx>
1478
1479 * NanoDOM.pm (get_attribute_node_ns): New method.
1480
1481 * ContentChecker.pm: |script| |async| and |defer|
1482 no longer require |src|. |async| MUST NOT be
1483 specified if |defer|. (HTML5 revision 858).
1484
1485 2007-05-30 Wakaba <wakaba@suika.fam.cx>
1486
1487 * HTML.pm.src: |<form><form>| went to inifinite loop.
1488
1489 2007-05-27 Wakaba <wakaba@suika.fam.cx>
1490
1491 * ContentChecker.pm (html): Set |is_root| (allowed
1492 as a document element) flag on.
1493 (new): Removed.
1494 (check_document): New method.
1495
1496 2007-05-27 Wakaba <wakaba@suika.fam.cx>
1497
1498 * ContentChecker.pm (thead, tfoot): Checker specifications
1499 were incorrect.
1500
1501 2007-05-27 Wakaba <wakaba@suika.fam.cx>
1502
1503 * ContentChecker.pm ($HTMLURIAttrChecker): Include
1504 error position in the |type| option of the error.
1505
1506 * HTMLTable.pm (form_table): The |$onerror| parameter
1507 is now optional. Some bugs are fixed.
1508
1509 2007-05-27 Wakaba <wakaba@suika.fam.cx>
1510
1511 * HTMLTable.pm: New module.
1512
1513 * ContentChecker.pm (table): Invoke table model error checker.
1514
1515 * NanoDOM.pm (first_child, get_attribute_ns): New.
1516
1517 2007-05-26 Wakaba <wakaba@suika.fam.cx>
1518
1519 * ContentChecker.pm ($HTMLLinkTypesAttrChecker): New checker.
1520 (link/@rel, a/@rel, area/@rel): Use new checker.
1521
1522 * Makefile (_LinkTypeList.pm, RelExtensions.html): New rules.
1523
1524 * _LinkTypeList.pm: New file.
1525
1526 * mklinktypelist.pl: New file.
1527
1528 * .cvsignore: |RelExtensions.html| added.
1529
1530 * NanoDOM.pm (child_nodes): Returns an empty array
1531 for non-child-containing node types.
1532 (text_content): New attribute.
1533
1534 2007-05-26 Wakaba <wakaba@suika.fam.cx>
1535
1536 * IMTChecker.pm: New module.
1537
1538 * ContentChecker.pm ($HTMLIMTAttrChecker): Call IMTChecker
1539 to test parameter value validity.
1540
1541 * HTML.pm.src ($style_start_tag): Attributes were
1542 discarded.
1543
1544 2007-05-25 Wakaba <wakaba@suika.fam.cx>
1545
1546 * ContentChecker.pm ($HTMLURIAttrChecker): Implemented.
1547
1548 2007-05-25 Wakaba <wakaba@suika.fam.cx>
1549
1550 * URIChecker.pm: All recommendations from RFC 3986
1551 and RFC 3987 are listed (not all testable items are checked yet).
1552
1553 2007-05-25 Wakaba <wakaba@suika.fam.cx>
1554
1555 * URIChecker.pm: New module.
1556
1557 2007-05-20 Wakaba <wakaba@suika.fam.cx>
1558
1559 * ContentChecker.pm: Now most attributes are implemented
1560 or associated to some placeholder.
1561 ($ElementDefault): Warn unknown attributes
1562 for unknown elements as "attribute not supported".
1563 ($HTMLLanguageTagAttrChecker, $HTMLMQAttrChecker): New placeholders.
1564 ($HTMLUsemapAttrChecker, $HTMLTargetAttrChecker): New checkers.
1565 (|a| attribute checker): Reimplemented.
1566
1567 2007-05-20 Wakaba <wakaba@suika.fam.cx>
1568
1569 * ContentChecker.pm ($HTMLEventHandlerAttrChecker): New placeholder.
1570 ($HTMLAttrChecker): Event handler content attributes
1571 are added.
1572 (link, embed): Required attribute is now checked.
1573 (embed): Unknown local attributes are no longer warned.
1574
1575 2007-05-20 Wakaba <wakaba@suika.fam.cx>
1576
1577 * ContentChecker.pm ($HTMLSpaceURIsAttrChecker): New placeholder.
1578 ($HTMLIMTAttrChecker): New checker.
1579 (link@rel, link@href, link@type, style@type,
1580 a@href, a@ping, a@ping, a@type, embed@src, embed@type,
1581 object@data, object@type, source@src, source@type, area@alt,
1582 area@shape, area@coords, area@href,
1583 area@ping, area@rel, area@type, script@src,
1584 script@defer, script@async, script@type): Checkers added.
1585
1586 2007-05-20 Wakaba <wakaba@suika.fam.cx>
1587
1588 * ContentChecker.pm: Descendant checking was incorrect.
1589
1590 2007-05-19 Wakaba <wakaba@suika.fam.cx>
1591
1592 * ContentChecker.pm: Support |xml:*| and |xmlns:*|
1593 attributes. Report an error if |Element.prefix|
1594 is |xmlns|.
1595
1596 * NanoDOM.pm (prefix): New attribute.
1597
1598 2007-05-19 Wakaba <wakaba@suika.fam.cx>
1599
1600 * HTML.pm.src: In |main| phase, |in body| insertion
1601 mode, action for |<iframe>| was missing.
1602
1603 2007-05-19 Wakaba <wakaba@suika.fam.cx>
1604
1605 * ContentChecker.pm: Support for many of HTML5 elements.
1606 ($GetHTMLNonNegativeIntegerAttrChecker): New.
1607
1608 2007-05-19 Wakaba <wakaba@suika.fam.cx>
1609
1610 * ContentChecker.pm: Support for most elements up to |progress|.
1611 ($HTMLURIAttrChecker): Placeholder.
1612 ($HTMLIntegerAttrChecker, $GetHTMLFloatingPointNumberAttrChecker):
1613 New.
1614
1615 2007-05-19 Wakaba <wakaba@suika.fam.cx>
1616
1617 * ContentChecker.pm: Attribute checkers
1618 for global attributes, |html|, |base|, |style|, and |meta|.
1619
1620 * NanoDOM.pm (insert_before): Weaken reference
1621 to the parent node.
1622 (Attr::new): Set |owner_element| attribute.
1623 (namespace_uri, manakai_local_name): New attribute implementations.
1624 (owner_element): New attribute.
1625
1626 2007-05-19 Wakaba <wakaba@suika.fam.cx>
1627
1628 * ContentChecker.pm ($AttrChecker, $HTMLAttrChecker,
1629 $AnyChecker->{attr_checker}, $HTMLAttrsChecker,
1630 $Element->{$HTML_NS}->{''}): New.
1631 (check_element): Invoke attrs_checker for each element.
1632
1633 2007-05-13 Wakaba <wakaba@suika.fam.cx>
1634
1635 * ContentChecker.pm: Don't use |manakai_element_type_match|.
1636
1637 2007-05-13 Wakaba <wakaba@suika.fam.cx>
1638
1639 * ContentChecker.pm: Use hashs rather than lists for
1640 element type testings.
1641
1642 2007-05-13 Wakaba <wakaba@suika.fam.cx>
1643
1644 * ContentChecker.pm: Don't generate duplicate
1645 error when an element type is put in the "minus" list
1646 and the element type is not allowed explicitly in the particular
1647 element content model.
1648 (html:a checker): New checker.
1649 (html:details, html:datagrid): New checkers.
1650 (html:legend): New checker.
1651
1652 2007-05-13 Wakaba <wakaba@suika.fam.cx>
1653
1654 * ContentChecker.pm (html:li checker): Implemented.
1655
1656 2007-05-13 Wakaba <wakaba@suika.fam.cx>
1657
1658 * ContentChecker.pm ($HTMLInlineOrStriclyInlineChecker): New
1659 checker.
1660 (html:dd checker): New checker.
1661 (html:q, html:em, html:strong, html:small,
1662 html:m, html:dfn, html:code, html:samp, html:span): New checkers.
1663
1664 2007-05-13 Wakaba <wakaba@suika.fam.cx>
1665
1666 * ContentChecker.pm ($AnyChecker): Renamed
1667 from |$ElementDefault->{checker}|.
1668 ($ElementDefault->{checker}): Throw an error that
1669 the element type is not supported by the checker.
1670 ($HTMLMetadataElement): |html:base| was missing.
1671 ($HTMLEmptyChecker): Don't throw an error
1672 for inter-element whitespace nodes.
1673 (html:html checker): Errors were not
1674 thrown even if |html:head| and/or |html:body|
1675 children were missing.
1676 (html:head checker): An error was not
1677 thrown if <meta charset> appered after other
1678 elements.
1679
1680 2007-05-05 Wakaba <wakaba@suika.fam.cx>
1681
1682 * ContentChecker.pm: |footer|, |video|, |audio|, |script|,
1683 and |noscript| elements are implemented.
1684 (new): New method.
1685
1686 2007-05-04 Wakaba <wakaba@suika.fam.cx>
1687
1688 * ContentChecker.pm: New module.
1689
1690 2007-05-04 Wakaba <wakaba@suika.fam.cx>
1691
1692 * NanoDOM.pm (manakai_parent_element,
1693 document_element, manakai_local_name,
1694 manakai_element_type_match): New method.
1695
1696 2007-05-03 Wakaba <wakaba@suika.fam.cx>
1697
1698 * HTML.pm.src: Replace decimal and hexadecimal numeric
1699 entities in C1 range using Windows-1252 mapping. Bare LF
1700 did not count as new line for error reporting.
1701
1702 2007-05-02 Wakaba <wakaba@suika.fam.cx>
1703
1704 * NanoDOM.pm (DOMImplementation): New class.
1705 (append_child): Weaken the |parent_node| reference.
1706 (create_element_ns, Element new): Set the |owner_document|
1707 reference.
1708 (implementation): New attribute.
1709 (owner_document, local_name, namespace_uri): New attributes.
1710
1711 * HTML.pm.src (parse_string): Line and column numbers
1712 are now provided to error handler.
1713 (!!!parse-error): Short descriptions are added.
1714 (_construct_tree): Split into three methods; support
1715 for innerHTML mode.
1716 (set_inner_html): New method.
1717
1718 2007-05-01 Wakaba <wakaba@suika.fam.cx>
1719
1720 * NanoDOM.html: Documentation is added.
1721
1722 * HTML.pod, ContentType.html: Documentation is revised.
1723
1724 * .cvsignore: Pod2html temporary files are added.
1725
1726 * Makefile: Make |NanoDOM.html|.
1727
1728 2007-05-01 Wakaba <wakaba@suika.fam.cx>
1729
1730 * HTML.pm.src (parse_string): New method.
1731 (get_inner_html): Renamed from |inner_html|.
1732
1733 * Makefile: A rule for |HTML.html| is added.
1734
1735 * HTML.pod: New documentation.
1736
1737 2007-05-01 Wakaba <wakaba@suika.fam.cx>
1738
1739 * NanoDOM.pm (last_child, previous_sibling): New attributes.
1740 (clone_node): Attribute nodes were not completely copied.
1741
1742 * HTML.pm.src: Many bugs are fixed.
1743
1744 2007-04-30 Wakaba <wakaba@suika.fam.cx>
1745
1746 * HTML.pm.src: Some typos are fixed.
1747
1748 2007-04-30 Wakaba <wakaba@suika.fam.cx>
1749
1750 * mkhtmlparser.pl, Makefile: References to the |HTML-consume-entity.src|
1751 are removed.
1752
1753 * HTML.pm.src: Tokenizer's handling on named entities are rewritten.
1754
1755 * HTML-consume-entity.src: Removed.
1756
1757 2007-04-30 Wakaba <wakaba@suika.fam.cx>
1758
1759 * HTML.pm.src: Tokenizer's handling on hexadecimal
1760 numeric entities are rewritten.
1761
1762 2007-04-30 Wakaba <wakaba@suika.fam.cx>
1763
1764 * HTML.pm.src: Some tokenizer bugs are fixed.
1765
1766 2007-04-30 Wakaba <wakaba@suika.fam.cx>
1767
1768 * HTML.pm.src: The tree construction stage is implemented.
1769
1770 * mkhtmlparser.pl: New macros are added.
1771
1772 2007-04-28 Wakaba <wakaba@suika.fam.cx>
1773
1774 * ContentType.pm: A note on bug in the specification
1775 is removed since it's been now fixed.
1776
1777 * .cvsignore: New file.
1778
1779 2007-04-28 Wakaba <wakaba@suika.fam.cx>
1780
1781 * HTML.pm.src, HTML-consume-entity.src: New files.
1782
1783 * Makefile (HTML.pm): New rule.
1784
1785 * mkhtmlparser.pl: New script.
1786
1787 2007-04-25 Wakaba <wakaba@suika.fam.cx>
1788
1789 * Makefile: New file.
1790
1791 2007-04-24 Wakaba <wakaba@suika.fam.cx>
1792
1793 * ContentType.pm: An error in pod is fixed.
1794
1795 2007-04-24 Wakaba <wakaba@suika.fam.cx>
1796
1797 * ContentType.pm: Documentation is added.
1798
1799 2007-04-24 Wakaba <wakaba@suika.fam.cx>
1800
1801 * ContentType.pm: New Perl module.
1802
1803 * ChangeLog: New file.
1804
1805

admin@suikawiki.org
ViewVC Help
Powered by ViewVC 1.1.24