/[suikacvs]/markup/html/whatpm/Whatpm/ChangeLog
Suika

Contents of /markup/html/whatpm/Whatpm/ChangeLog

Parent Directory Parent Directory | Revision Log Revision Log


Revision 1.362 - (show annotations) (download)
Thu Nov 6 06:56:59 2008 UTC (16 years, 8 months ago) by wakaba
Branch: MAIN
Changes since 1.361: +4 -0 lines
++ whatpm/Whatpm/ChangeLog	6 Nov 2008 04:01:55 -0000
2008-11-06  Wakaba  <wakaba@suika.fam.cx>

	* SWML/: New directory.

++ whatpm/Whatpm/SWML/ChangeLog	6 Nov 2008 06:56:18 -0000
2008-11-06  Wakaba  <wakaba@suika.fam.cx>

	* Parser.pm: New file.  Tokenizer impelmented, but it does not
	work yet.

	* ChangeLog: New file.

1 2008-11-06 Wakaba <wakaba@suika.fam.cx>
2
3 * SWML/: New directory.
4
5 2008-10-20 Wakaba <wakaba@suika.fam.cx>
6
7 * NanoDOM.pm (specified, all_declarations_processed,
8 manakai_attribute_type): New attributes.
9
10 2008-10-19 Wakaba <wakaba@suika.fam.cx>
11
12 * NanoDOM.pm (Entity->new): Initialize ->child_nodes as an empty
13 array.
14
15 2008-10-19 Wakaba <wakaba@suika.fam.cx>
16
17 * NanoDOM.pm (notation_name): New attribute.
18
19 2008-10-18 Wakaba <wakaba@suika.fam.cx>
20
21 * NanoDOM.pm (public_id, system_id): New attributes.a
22
23 2008-10-18 Wakaba <wakaba@suika.fam.cx>
24
25 * NanoDOM.pm (text_content): Moved to Node from Element. Setter
26 implemented.
27 (allowed_tokens, default_type, declared_type): Implemented.
28
29 2008-10-17 Wakaba <wakaba@suika.fam.cx>
30
31 * NanoDOM.pm (node_name): New attribute.
32 (ELEMENT_TYPE_DEFINITION_NODE, ATTRIBUTE_DEFINITION_NODE): New
33 constants.
34 (create_element_type_definition_node, create_attribute_definition,
35 create_notation, create_general_entity,
36 get_element_type_definition_node,
37 set_element_type_definition_node, get_general_entity_node,
38 set_general_entity_node, get_notation_node, set_notation_node,
39 get_attribute_definition_node, set_attribute_definition_node): New
40 methods.
41 (element_types, entities, notations, attribute_definitions): New
42 attributes.
43 (DocumentType): Support for child nodes, entities, notations, and
44 element types.
45 (Entity, Notation, ElementTypeDefinition, AttributeDefinition):
46 New classes.
47
48 * Dumper.pm: Support for general entities, notations, element type
49 definitions, and attribute definitions.
50
51 2008-10-15 Wakaba <wakaba@suika.fam.cx>
52
53 * NanoDOM.pm (create_processing_instruction): New method.
54 (xml_version, xml_encoding, xml_standalone): New attributes.
55 (ProcessingInstruction): New class.
56
57 2008-10-14 Wakaba <wakaba@suika.fam.cx>
58
59 * HTML.pm.src: Handling of end tags in the foreign content
60 insertion mode was partially wrong, because of wrong bit
61 operations.
62
63 2008-10-14 Wakaba <wakaba@suika.fam.cx>
64
65 * NanoDOM.pm (dom_config): New attribute (do nothing), for
66 Whatpm::XML::Parser support.
67
68 2008-10-14 Wakaba <wakaba@suika.fam.cx>
69
70 * Makefile: New rule to make HTML/Tokenizer.pm is added.
71
72 * HTML.pm.src: Tokenizer part moved to another file.
73
74 2008-10-13 Wakaba <wakaba@suika.fam.cx>
75
76 * HTML.pm.src: Merge |DT_EL| and |DD_EL| as |DTDD_EL|.
77
78 2008-10-13 Wakaba <wakaba@suika.fam.cx>
79
80 * HTML.pm.src: Element category constants redefined.
81
82 2008-10-13 Wakaba <wakaba@suika.fam.cx>
83
84 * HTML.pm.src: Steps for CDATA/RCDATA elements in tree
85 construction stage synced with the spec (HTML5 revisions 2139 and
86 2302).
87
88 2008-10-07 Wakaba <wakaba@suika.fam.cx>
89
90 * ContentChecker.pm: New error level "html5_fact" added, which
91 should be tentatively used until all of requirements are properly
92 specced as RFC 2119 "MUST" in HTML5.
93
94 2008-10-05 Wakaba <wakaba@suika.fam.cx>
95
96 * ContentChecker.pod: Note on internal flags is added.
97
98 2008-10-05 Wakaba <wakaba@suika.fam.cx>
99
100 * HTML.pm.src: An AAA bug fixed.
101
102 2008-10-04 Wakaba <wakaba@suika.fam.cx>
103
104 * HTML.pm.src: If another node is inserted by the parser, don't
105 reuse existing Text node to append a character (HTML5 revision
106 2124).
107
108 2008-10-04 Wakaba <wakaba@suika.fam.cx>
109
110 * HTML.pm.src: Support for <option> and <optgroup> in body (HTML5
111 revisions 1731 and 2128).
112
113 2008-10-04 Wakaba <wakaba@suika.fam.cx>
114
115 * HTML.pm.src: Make <foreignObject> scoping (HTML5 revision 1837).
116 Support for end tags of camelCase SVG elements were broken. A
117 wrong error type text fixed.
118
119 2008-10-04 Wakaba <wakaba@suika.fam.cx>
120
121 * HTML.pm.src: Drop redundant code (HTML5 revision 1731).
122
123 2008-10-04 Wakaba <wakaba@suika.fam.cx>
124
125 * HTML.pm.src: Support for new definition of |param| and |source|
126 start tag parsing (HTML5 revision 1731).
127
128 2008-10-04 Wakaba <wakaba@suika.fam.cx>
129
130 * HTML.pm.src: <p> steps reimplemented (HTML5 revision 1731).
131
132 2008-10-04 Wakaba <wakaba@suika.fam.cx>
133
134 * HTML.pm.src: <li>, <dt>, and <dd> steps reimplemented (HTML5
135 revisions 1731 and 1831).
136
137 2008-10-04 Wakaba <wakaba@suika.fam.cx>
138
139 * HTML.pm.src: Support for new flow (but not phrasing) elements (HTML5
140 revisions 1731 and 1778). Support for the </sarcasm> end tag (HTML5
141 revision 1731).
142
143 2008-10-04 Wakaba <wakaba@suika.fam.cx>
144
145 * HTML.pm.src: Support for |command| and |eventsource| elements (HTML5
146 revision 1731). End tags of |option| and |optgroup| elements are
147 now optional (HTML5 revision 1731).
148
149 2008-10-04 Wakaba <wakaba@suika.fam.cx>
150
151 * HTML.pm.src: New "special" elements added to the list (HTML5
152 revision 1778). "strile" -> "strike".
153
154 2008-10-02 Wakaba <wakaba@suika.fam.cx>
155
156 * ContentType.pm (get_sniffed_type): Support for the "better"
157 content sniffing (HTML5 revision 1927). In a case the official
158 type was not returned when the method is invoked in the list
159 context.
160
161 2008-09-22 Wakaba <wakaba@suika.fam.cx>
162
163 * HTML.pm.src: Character references for non-space C0 characters,
164 including U+000B VT, DEL character, noncharacter code points, are
165 now converted to the U+FFFD character (cf. HTML5 revision 2138).
166
167 2008-09-21 Wakaba <wakaba@suika.fam.cx>
168
169 * ContentChecker.pm: |form=""| check support added.
170
171 2008-09-21 Wakaba <wakaba@suika.fam.cx>
172
173 * ContentChecker.pm: |contextmenu| validness is now checked using
174 |id| and |id_type| properties, and |menu| property is removed.
175
176 2008-09-21 Wakaba <wakaba@suika.fam.cx>
177
178 * ContentChecker.pm: Prepare for |form| |name| attribute's
179 duplication checking.
180
181 2008-09-21 Wakaba <wakaba@suika.fam.cx>
182
183 * HTML.pm.src (parse_byte_stream): Support (or non-support) for
184 unsupported charset="" parameter value (HTML5 revision 2131).
185
186 2008-09-20 Wakaba <wakaba@suika.fam.cx>
187
188 * HTML.pm.src: Reminding places where U+000B is allowed as a space
189 character is fixed (cf. HTML5 revision 1738).
190
191 * ContentChecker.pm, HTMLTable.pm: U+000B is no longer part of
192 space characters (HTML5 revision 1738).
193
194 2008-09-20 Wakaba <wakaba@suika.fam.cx>
195
196 * HTML.pm.src: The "anything else" case for the "after after body"
197 insertion mode was not updated to swtich to the "in body"
198 insertion mode. U+000B is no longer a space character for the
199 purpose of tree construction phase (HTML5 revision 1738).
200
201 2008-09-20 Wakaba <wakaba@suika.fam.cx>
202
203 * HTML.pm.src: U+000B is no longer a space character (HTML5
204 revision 1738).
205
206 2008-09-20 Wakaba <wakaba@suika.fam.cx>
207
208 * ContentType.pm: 0x0B is no longer a space character (HTML5
209 revision 1738).
210
211 * HTML.pm.src: U+000B is no longer a space character for the
212 algorithm for extracting an encoding from a Content-Type (HTML5
213 revision 1738).
214
215 2008-09-20 Wakaba <wakaba@suika.fam.cx>
216
217 * ContentChecker.pm ($IsInHTMLInteractiveContent): New.
218
219 2008-09-18 Wakaba <wakaba@suika.fam.cx>
220
221 * LangTag.pm: Add checks for remaining requirements from RFC 4646.
222
223 * mklangreg.pl: Sort 'Prefix' values by their length, to ease
224 matching.
225
226 2008-09-18 Wakaba <wakaba@suika.fam.cx>
227
228 * LangTag.pm: Warn for private use language subtags. Error level
229 typos fixed. Support for Suppress-Script field.
230
231 * mklangreg.pl: Support for dumping of nested structure.
232
233 2008-09-18 Wakaba <wakaba@suika.fam.cx>
234
235 * LangTag.pm (check_rfc4646_langtag): Check if a tag is in the
236 recommended case as per RFC 4646.
237
238 2008-09-18 Wakaba <wakaba@suika.fam.cx>
239
240 * LangTag.pm (check_rfc4646_langtag): New method.
241
242 2008-09-18 Wakaba <wakaba@suika.fam.cx>
243
244 * mklangreg.pl: New script.
245
246 * Makefile: Updated for creation of the module for language subtag
247 registry.
248
249 2008-09-16 Wakaba <wakaba@suika.fam.cx>
250
251 * Makefile: WebIDL.html added.
252
253 * WebIDL.pod: New documentation.
254
255 2008-09-16 Wakaba <wakaba@suika.fam.cx>
256
257 * WebIDL.pm: Checker's error types are redefined.
258
259 2008-09-16 Wakaba <wakaba@suika.fam.cx>
260
261 * WebIDL.pm: Parser's error types are redefined. Some forward
262 compatible parsing bugs are fixed. Some unreachable codes are
263 commented out.
264
265 2008-09-16 Wakaba <wakaba@suika.fam.cx>
266
267 * WebIDL.pm: Support for the reminding extended attributes are
268 added. It does not satisfy the definition that a forward
269 interface declaration has an extended attribute. It seems that
270 unless explicitly allowed multiple extended attributes with the
271 same name is not allowed, though it is not explicitly mentioned in
272 the spec.
273
274 2008-09-16 Wakaba <wakaba@suika.fam.cx>
275
276 * WebIDL.pm: Unescapes extended attribute names and extended
277 attribute identifiers. Preserve whether an extended attribute has
278 an argument list of not. Support for extended attributes:
279 Constructor, ExceptionConsts, IndexGetter, IndexSetter,
280 NameGetter, NameSetter, and Null.
281 (has_argument_list): New attribute.
282 (idl_text): Stringifies argument lists, if any, even if it is
283 empty.
284
285 2008-09-15 Wakaba <wakaba@suika.fam.cx>
286
287 * HTML.pm.src: New state |PCDATA_STATE|. Use an empty string for
288 |{s_kwd}| in DATA_STATE as default.
289
290 2008-09-15 Wakaba <wakaba@suika.fam.cx>
291
292 * HTML.pm.src, mkhtmlparser.pl: Replace |{prev_char}|
293 by |{s_kwd}| in DATA_STATE.
294
295 2008-09-15 Wakaba <wakaba@suika.fam.cx>
296
297 * HTML.pm.src: Shorten keys.
298
299 2008-09-15 Wakaba <wakaba@suika.fam.cx>
300
301 * HTML.pm.src: Remove checking for control character, surrogate
302 pair, or noncharacter code points and non-Unicode code
303 points (they should be handled by Whatpm::Charset::UnicodeChecker).
304 (parse_char_stream): Support for the |$get_wrapper| argument and
305 character stream error handlers.
306
307 2008-09-15 Wakaba <wakaba@suika.fam.cx>
308
309 * ContentChecker.pm: Don't call |loda_ns_module|
310 for null-namespace elements/attributes.
311
312 * HTML.pm.src: Fact out $disallowed_control_chars
313 as a hash.
314
315 2008-09-14 Wakaba <wakaba@suika.fam.cx>
316
317 * HTML.pm.src: Regexp typo fixed. |{prev_char}|
318 and |{next_char}| initializations are moved to initialization
319 method. |{read_until}| now supports buffering. Sync |set_inner_html|
320 with |parse_char_stream|.
321
322 2008-09-14 Wakaba <wakaba@suika.fam.cx>
323
324 * HTML.pm.src (parse_char_stream): Make |set_next_char|
325 invoke |manakai_read_until|, not only |read|, where
326 possible, to decrease the number of |read| method calls.
327
328 * mkhtmlparser.pl: Related changes to the aforementioned
329 modification.
330
331 2008-09-14 Wakaba <wakaba@suika.fam.cx>
332
333 * HTML.pm.src: Use |read| instead of |getc|. |set_inner_html|
334 would report character error from now.
335
336 2008-09-14 Wakaba <wakaba@suika.fam.cx>
337
338 * HTML.pm.src: White-space-leaded non-white-space character
339 tokens in "before head insertion mode" was not
340 correctly handled.
341 (set_inner_html): Reimplemented using CharString decodehandle
342 class. Support for $get_wrapper argument. Support
343 for |{read_until}| feature.
344
345 2008-09-14 Wakaba <wakaba@suika.fam.cx>
346
347 * HTML.pm.src: Make a "bare ero" error for unknown
348 entities point the "&" character.
349
350 2008-09-14 Wakaba <wakaba@suika.fam.cx>
351
352 * HTML.pm.src: It turns out that U+FFFD don't have to
353 be added to the list of excluded characters.
354
355 2008-09-14 Wakaba <wakaba@suika.fam.cx>
356
357 * HTML.pm.src ($char_onerror): Have character decoder's |line|
358 and |column| a higher priority than the one set by the
359 tokenizer's input handler.
360 ($self->{read_until}): Exclude U+FFFD (but this might
361 not be necessary, since now we do line/column fixup in
362 the character decode handle).
363
364 2008-09-14 Wakaba <wakaba@suika.fam.cx>
365
366 * HTML.pm.src: Use |{read_until}| where possible.
367
368 2008-09-14 Wakaba <wakaba@suika.fam.cx>
369
370 * HTML.pm.src: Change |{getc_until}| to |{read_until}|
371 and |manakai_getc_until| to |manakai_read_until| to
372 reduce the number of string copies.
373
374 2008-09-14 Wakaba <wakaba@suika.fam.cx>
375
376 * HTML.pm.src (parse_char_string): Use newly created
377 |Whatpm::Charset::DecodeHandle::CharString| instead of Perl's
378 standard feature to |open| a string as a filehandle,
379 since Perl's string filehandle seems not supporting |ungetc|
380 method correctly.
381 (parse_char_stream): Define |{getc_until}| method.
382 (DATA_STATE): Experimental support for |getc_until| feature.
383
384 2008-09-13 Wakaba <wakaba@suika.fam.cx>
385
386 * HTML.pm.src: Check points added to newly added branches.
387
388 2008-09-13 Wakaba <wakaba@suika.fam.cx>
389
390 * HTML.pm.src: Remove |{char}|, which is no longer used.
391 Remove |{entity_in_attr}| and |{last_attribute_value_state}|
392 and replaced by |{prev_state}|.
393
394 * mkhtmlparser.pl: Remove |{char}| feature.
395 Remove |!!!back-next-input-character;| macro.
396
397 2008-09-13 Wakaba <wakaba@suika.fam.cx>
398
399 * HTML.pm.src: Finally we get rid of all the inner loops. Remove
400 entity related tokenizer states in favor of new states
401 implementing the consume character reference algorithm.
402
403 2008-09-13 Wakaba <wakaba@suika.fam.cx>
404
405 * HTML.pm.src: "Consume a character reference" algorithm is
406 now implemented as a tokenizer's state, rather than
407 a method, with minimum changes (more changes will
408 be made, in due course). "Bogus comment state"'s inner
409 loop gets removed.
410
411 2008-09-13 Wakaba <wakaba@suika.fam.cx>
412
413 * HTML.pm.src: Make |PUBLIC| and |SYSTEM| keyword tokenizing
414 into their own tokenizer states.
415
416 2008-09-13 Wakaba <wakaba@suika.fam.cx>
417
418 * HTML.pm.src: |CDATA_SECTION_STATE| (formally |CDATA_BLOCK_STATE|
419 is split into three states.
420
421 2008-09-13 Wakaba <wakaba@suika.fam.cx>
422
423 * HTML.pm.src: |CLOSE_TAG_OPEN_STATE| is broken into
424 itself and new |CDATA_PCDATA_CLOSE_TAG_STATE| so that
425 no longer does the tokenizer have to push back next input
426 characters in those states.
427
428 2008-09-13 Wakaba <wakaba@suika.fam.cx>
429
430 * HTML.pm.src: |MARKUP_DECLARATION_OPEN_STATE| broken
431 into four states so that no longer does the tokenizer have to push
432 back next input characters in that state.
433
434 2008-09-11 Wakaba <wakaba@suika.fam.cx>
435
436 * HTML.pm.src: Methods now accept additional parameter, $get_wrapper,
437 which can be used to insert some wrapper between the character
438 stream handle and the tokenizer. (It is currently not supported
439 for |set_inner_html| for |Element|s).
440
441 2008-09-10 Wakaba <wakaba@suika.fam.cx>
442
443 * HTML.pm.src: Ignore punctuations in charset names.
444
445 2008-09-10 Wakaba <wakaba@suika.fam.cx>
446
447 * ContentChecker.pm: Support for charset-layer error levels.
448
449 * HTML.pm.src: Don't specify |text| argument for the
450 |chardecode:fallback| error, since it is not the encoding
451 being used alternatively.
452
453 2008-09-06 Wakaba <wakaba@suika.fam.cx>
454
455 * HTML.pm.src: Support for |XSLT-compat| (HTML5 revision 2141).
456
457 2008-08-31 Wakaba <wakaba@suika.fam.cx>
458
459 * CacheManifest.pm: Support for extensibility (HTML5 revision 2051).
460
461 2008-08-31 Wakaba <wakaba@suika.fam.cx>
462
463 * HTML.pm.src: Bug fix and sync with the spec with regard
464 to after after frameset insertion mode processing (HTML5
465 revision 1909). Note that the implementation was wrong
466 per the old spec before the r1909 changes.
467
468 2008-08-30 Wakaba <wakaba@suika.fam.cx>
469
470 * HTMLTable.pm: scope=auto algorithm fix synced with the
471 spec (HTML5 revision 2093).
472 ($process_row): Algorithm step numbers synced with the
473 spec (HTML5 revision 2092).
474
475 2008-08-30 Wakaba <wakaba@suika.fam.cx>
476
477 * HTMLTable.pm: Zs is not what we want; we want White_Space! (HTML5
478 revision 2094).
479
480 2008-08-30 Wakaba <wakaba@suika.fam.cx>
481
482 * ContentType.pm: Support for image/svg+xml (HTML5 revision 2096).
483
484 2008-08-30 Wakaba <wakaba@suika.fam.cx>
485
486 * HTML.pm.src: '"' and "'" at the end of attribute
487 name (after another attribute) now raise parse error (HTML5
488 revision 2123). Empty unquoted attribute values are no
489 longer allowed (HTML5 revision 2122).
490
491 2008-08-30 Wakaba <wakaba@suika.fam.cx>
492
493 * mkhtmlparser.pl: Support for MathML |definitionURL| attribute (HTML5
494 revision 2130).
495
496 2008-08-30 Wakaba <wakaba@suika.fam.cx>
497
498 * ContentChecker.pm: |xml:lang| attribute value must be same
499 as |lang| attribute value for HTML elements (HTML5 revision 2062
500 and so on).
501
502 2008-08-30 Wakaba <wakaba@suika.fam.cx>
503
504 * ContentChecker.pm: Error level definition for |xml_id_error|
505 was missing.
506
507 * URIChecker.pm: The end of the URL should be marked as the
508 error location for an empty path error. The position
509 between the userinfo and the port components should be
510 marked as the error location for an empty host error.
511
512 2008-08-30 Wakaba <wakaba@suika.fam.cx>
513
514 * URIChecker.pm: Set parameters representing where in the
515 value the error occurs for errors. Report unknown
516 address format error in warning level, since address
517 formats are rarely added. Path segments starting with "/.."
518 were misinterpreted as a dot-segment.
519
520 2008-08-30 Wakaba <wakaba@suika.fam.cx>
521
522 * URIChecker.pm (check_iri_reference): Requires
523 |Message::DOM::DOMImplementation|.
524
525 2008-08-29 Wakaba <wakaba@suika.fam.cx>
526
527 * IMTChecker.pm: Updated for the new error reporting architecture.
528
529 * ContentChecker.pm: Error levels for IMTs are added.
530
531 2008-08-17 Wakaba <wakaba@suika.fam.cx>
532
533 * H2H.pm (_shift_token): Support for unquoted HTML attribute
534 values.
535
536 2008-08-16 Wakaba <wakaba@suika.fam.cx>
537
538 * CacheManifest.pm: Support for new style of error
539 reports.
540
541 * HTML.pm.src: Set line=1, column=1 to the document node.
542
543 2008-08-16 Wakaba <wakaba@suika.fam.cx>
544
545 * ContentChecker.pm, RDFXML.pm: Pass {level} object to language tag
546 and URL checkers. Support for more error levels for bogus
547 langauge tag and URL "standards".
548
549 * LangTag.pm, URIChecker.pm: Support for new style error
550 level reporting.
551
552 2008-08-15 Wakaba <wakaba@suika.fam.cx>
553
554 * ContentChecker.pm: Support for RDF/XML error levels.
555
556 * HTMLTable.pm, RDFXML.pm: Support for new style of error level
557 specifying. Error types are revised.
558
559 2008-08-15 Wakaba <wakaba@suika.fam.cx>
560
561 * ContentChecker.pm: All error reporting method calls are
562 renewed.
563
564 2008-08-15 Wakaba <wakaba@suika.fam.cx>
565
566 * HTML.pm.src: All error type names and "text" parameters
567 are revised. Use new style for "level" specification.
568
569 * mkhtmlparser.pl: Use new style for "level" specification.
570
571 2008-08-03 Wakaba <wakaba@suika.fam.cx>
572
573 * WebIDL.pm (parse_char_string): Simplified error
574 reporting process for broken ignored valuetype definition.
575 (Valuetype idl_text): Support for special "DOMString" name.
576
577 2008-08-03 Wakaba <wakaba@suika.fam.cx>
578
579 * WebIDL.pm ($get_scoped_name): Append "::::" if the last
580 terminal of the ScopedName is "DOMString", such that whether
581 the last part of the scoped name is "DOMString" or "_DOMString"
582 later. It is necessary to determine whether a |typedef|
583 definition should be ignored or not.
584 (parse_char_string): Unescape the identifier of
585 exception members.
586 ($resolve): Return undef for builtin types and sequence<T>
587 types (we might not have to do this, however...).
588 (check): Support checking for Exceptions, Valuetypes,
589 and Typedefs.
590 ($serialize_type): Support for "DOMString::::" syntax.
591 (Typedef idl_text): Output Type as "DOMString" if it
592 is really "DOMString" (i.e. its internal representation
593 is "::DOMString::").
594
595 2008-08-03 Wakaba <wakaba@suika.fam.cx>
596
597 * WebIDL.pm ($resolve): New code, based on resolve code
598 for constant types in the |check| method.
599 (check): Support for checking of attributes, operations, and
600 arguments.
601 (Attribute/Operation idl_text): Exception names in getraises,
602 setraises, and raises clauses is serizlied by |$serialize_type|
603 code.
604
605 2008-08-02 Wakaba <wakaba@suika.fam.cx>
606
607 * WebIDL.pm ($integer): Order of selections are changed to match
608 hexadecimal numbers (the original pattern, taken from the spec,
609 was not work for hexadecimal numbers, because the "0" prefix
610 matches to the [0-7]* part (as an empty string) and therefore
611 it does not match with remaining "x..." part of a "0x..." integer
612 literal.
613 ($get_type): It now returns a string, not an array reference,
614 for regular types and |sequence| types (i.e. it in any case
615 returns a string).
616 ($get_next_token): The second item in the array that represents
617 a integer or float token is now a Perl number value, not the
618 original string representation of the number.
619 (check): Support for const value consistency checking.
620 No extended attribute is defined for constants.
621 (Node subclasses): Use simple strings rather than array references
622 for default data type values.
623 ($serialize_type): Type values are now simple strings.
624 (value): If the new attribute value is a false value, then
625 a FALSE value is set to the attribute.
626
627 2008-08-02 Wakaba <wakaba@suika.fam.cx>
628
629 * WebIDL.pm ($get_scoped_name): Now scoped names are stored
630 in its stringified format ("scoped name" as defined in the
631 spec). Note that future version of this module should not use
632 array references for type values and the |type_text| attribute
633 should be made obsolete.
634 (parse_char_string): Unescape attribute names.
635 (check): Support for checking of whether inherited interfaces
636 are actually defined or not. Support for checking of whether
637 interface member identifiers are duplicated or not.
638 ($serialize_type): Scoped names are returned as is. A future
639 version of this code should escape identifiers other than "DOMString",
640 otherwise the idl_text would be non-conforming.
641
642 2008-08-02 Wakaba <wakaba@suika.fam.cx>
643
644 * WebIDL.pm (parse_char_string): Set line/column numbers
645 to generated nodes. Unescape identifiers. Extended attributes
646 for Definition's were ignored.
647 (append_child): Set |parent_node| attribute.
648 (parent_node): New attribute.
649 (check): Support interface/exception members. Support
650 extended attributes. Support definition identifier uniqueness
651 constraint.
652 (qualified_name): New attribute.
653 (Interface/Exception idl_text): Extended attributes were
654 not prepended to the returned text.
655
656 2008-08-02 Wakaba <wakaba@suika.fam.cx>
657
658 * WebIDL.pm (parse_char_string): Set line/column numbers
659 to interface object experimentally. s/shift/pop/g, shift
660 would make things wrong. Support for interface forward
661 declarations was missing. Broken interface declarations
662 with no block were not ignored entirely.
663 (Whatpm::WebIDL::Node): New abstract class. This class
664 makes things easier.
665 (child_nodes): New attribute. Unlike DOM's attribute with
666 same name, this attribute returns a dead list of nodes for
667 simplicity.
668 (get_user_data, set_user_data): New methods.
669 (Module idl_text): A SPACE character should be inserted
670 before the |{| character.
671 (Interface idl_text): Support for interface forward declarations.
672 (is_forward_declaration): New attribute.
673
674 2008-07-19 Wakaba <wakaba@suika.fam.cx>
675
676 * WebIDL.pm (type_text): Better serializer.
677
678 2008-07-19 Wakaba <wakaba@suika.fam.cx>
679
680 * WebIDL.pm: Revise forward-compatible parsing so that
681 it now can handle broken extended attributes and as such.
682
683 2008-07-19 Wakaba <wakaba@suika.fam.cx>
684
685 * WebIDL.pm: Real support for extended attributes.
686 Support for extended attributes with arguments.
687
688 2008-07-19 Wakaba <wakaba@suika.fam.cx>
689
690 * WebIDL.pm: Support for |exception| syntax.
691 (Interface->idl_text): Tentative support for inheritances.
692
693 2008-07-19 Wakaba <wakaba@suika.fam.cx>
694
695 * WebIDL.pm: Hierarchical scoped name support was broken.
696 Support for raises, setraises, and getraises syntaxes.
697
698 2008-07-18 Wakaba <wakaba@suika.fam.cx>
699
700 * WebIDL.pm: Support for |idl_text| attribute, version 1 (no
701 proper support for types, extended attributes, and exceptions yet).
702 WebIDL parser, version 1 (no support for exceptions yet,
703 no proper support for extended attributes yet).
704
705 2008-07-09 Wakaba <wakaba@suika.fam.cx>
706
707 * WebIDL.pm (parse_char_string): Support for basic attribute syntax.
708
709 2008-06-29 Wakaba <wakaba@suika.fam.cx>
710
711 * WebIDL.pm: Support for valuetype and const.
712
713 2008-06-29 Wakaba <wakaba@suika.fam.cx>
714
715 * WebIDL.pm: New module.
716
717 2008-06-15 Wakaba <wakaba@suika.fam.cx>
718
719 * Makefile (Entities.html): URI changed.
720
721 2008-06-08 Wakaba <wakaba@suika.fam.cx>
722
723 * HTML.pm.src: Support for ruby parsing (HTML5 revision 1704).
724
725 2008-06-01 Wakaba <wakaba@suika.fam.cx>
726
727 * HTML.pm.src (_get_next_token): A parse error was missing.
728
729 2008-06-01 Wakaba <wakaba@suika.fam.cx>
730
731 * mklinktypelist.pl: rel=contact is no longer part of the HTML5
732 spec (commented out). (HTML5 revision 1711).
733
734 2008-05-25 Wakaba <wakaba@suika.fam.cx>
735
736 * ContentType.pm: Drop support for UTF-32 (HTML5 revision 1701).
737
738 * HTML.pm.src: UTF-16BE and UTF-16LE should be considered
739 as UTF-16 (HTML5 revision 1701).
740
741 2008-05-25 Wakaba <wakaba@suika.fam.cx>
742
743 * HTML.pm.src: Support for <noframes> in <head> (HTML5 revision
744 1692).
745
746 2008-05-25 Wakaba <wakaba@suika.fam.cx>
747
748 * HTML.pm.src: The secondary insertion mode used when switching
749 to foreign content is the "in body" insertion mode (HTML5 revision
750 1696).
751
752 2008-05-25 Wakaba <wakaba@suika.fam.cx>
753
754 * HTML.pm.src: Don't raise parse error for <isindex/> (HTML5
755 revision 1697).
756
757 2008-05-24 Wakaba <wakaba@suika.fam.cx>
758
759 * HTML.pm.src: Support for end-of-file token in foreign content
760 insertion mode (HTML5 revision 1693). Update SVG camelCase
761 attribute list (HTML5 revision 1700). <textarea> closes
762 </select> (HTML5 revision 1699). More start tags close in
763 foreign content insertion mode (HTML5 revision 1698).
764
765 2008-05-24 Wakaba <wakaba@suika.fam.cx>
766
767 * HTML.pm.src: ";" is not part of charset name (HTML5 revision 1665).
768
769 2008-05-24 Wakaba <wakaba@suika.fam.cx>
770
771 * HTML.pm.src: More robust charset parameter detection (HTML5
772 revision 1674).
773
774 2008-05-24 Wakaba <wakaba@suika.fam.cx>
775
776 * ContentType.pm: Support for image/vnd.microsoft.icon (HTML5
777 revision 1676).
778
779 2008-05-24 Wakaba <wakaba@suika.fam.cx>
780
781 * HTML.pm.src: Ignore language part of public identifiers for
782 quriks mode detection (HTML5 revision 1679).
783
784 2008-05-24 Wakaba <wakaba@suika.fam.cx>
785
786 * HTML.pm.src: Reduce the number of errors in truncated doctypes (HTML5
787 revision 1685).
788
789 2008-05-24 Wakaba <wakaba@suika.fam.cx>
790
791 * HTML.pm.src: Support for EOF in new states for tags (HTML5
792 revision 1684).
793
794 2008-05-24 Wakaba <wakaba@suika.fam.cx>
795
796 * HTML.pm.src (_reset_insertion_mode): Make <td>.innerHTML
797 work (HTML5 revision 1690).
798
799 2008-05-24 Wakaba <wakaba@suika.fam.cx>
800
801 * HTML.pm.src (_tree_construction_main): Change handling of
802 end tags in head insertion modes (HTML5 revision 1686).
803 (parse_char_string): Bug fix for non-utf8 character string handlings.
804 (parse_char_stream): |ungetc| does not work well for this context.
805
806 2008-05-18 Wakaba <wakaba@suika.fam.cx>
807
808 * HTML.pm.src (parse_byte_string): Redefined to invoke
809 |parse_byte_stream|.
810 (parse_byte_stream): New method.
811
812 2008-05-18 Wakaba <wakaba@suika.fam.cx>
813
814 * HTML.pm.src (parse_byte_string): Fix the column number reported
815 by encoding layer error reporter.
816
817 2008-05-17 Wakaba <wakaba@suika.fam.cx>
818
819 * HTML.pm.src (parse_byte_string): Use streaming decoder
820 rather than converting the whole byte string and then parsing.
821 Propagate errors in character encoding layer.
822 (get_next_token): Precise error reporting for |bare stago| error.
823
824 2008-05-17 Wakaba <wakaba@suika.fam.cx>
825
826 * HTML.pm.src (parse_char_stream): New method.
827 (parse_char_string): This method is now defined as an invocation
828 of the |parse_char_stream| method.
829
830 2008-05-17 Wakaba <wakaba@suika.fam.cx>
831
832 * HTML.pm.src (parse_byte_string): Report various status
833 of the sniffing as info-level errors. Support for new
834 decoding framework in parser resestting.
835 (new): Various default error levels were not set.
836
837 2008-05-17 Wakaba <wakaba@suika.fam.cx>
838
839 * HTML.pm.src (parse_byte_string): HTML5 encoding siniffing
840 algorithm, except for the actual sniffing, is implemented
841 with new framework with Message::Charset::Info.
842
843 2008-05-16 Wakaba <wakaba@suika.fam.cx>
844
845 * CacheManifest.pm (_parse): Drop fragment identifiers from
846 URIs in fallback section (HTML5 revision 1596).
847
848 2008-05-10 Wakaba <wakaba@suika.fam.cx>
849
850 * Makefile (Entities.html): URI has changed.
851
852 2008-05-10 Wakaba <wakaba@suika.fam.cx>
853
854 * CacheManifest.pm: Don't replace U+0000 NULL (HTML5 revision
855 1553).
856
857 2008-05-06 Wakaba <wakaba@suika.fam.cx>
858
859 * ContentChecker.pm: Noted that those returned in |table| are
860 no longer table elements, but table objects returned
861 by Whatpm::HTMLTable.
862
863 * HTMLTable.pm (form_table): Return table element node
864 as |$table->{element}|.
865 (assign_header): Support for the |headers=""| attribute.
866
867 2008-05-06 Wakaba <wakaba@suika.fam.cx>
868
869 * HTMLTable.pm (assign_header): New function; first version
870 with no support for headers="".
871 (form_table): Include table width and height to the returned
872 table object for covenience. Indexing in column assignement was wrong.
873 Set whether a data cell is empty or not for convenience.
874
875 2008-05-05 Wakaba <wakaba@suika.fam.cx>
876
877 * HTMLTable.pm: Robuster caption support (HTML5 revision 1393).
878
879 2008-05-05 Wakaba <wakaba@suika.fam.cx>
880
881 * HTMLTable.pm: How table model errors are detected is
882 changed (HTML5 revision 1387).
883
884 2008-05-05 Wakaba <wakaba@suika.fam.cx>
885
886 * HTMLTable.pm: The algorithm now moves |tfoot| elements
887 to the end of the table (HTML5 revision 1380).
888
889 2008-05-05 Wakaba <wakaba@suika.fam.cx>
890
891 * HTMLTable.pm: The algorithm is now 0-based indexing, instead
892 of 1-based (HTML5 revision 1376).
893
894 2008-05-05 Wakaba <wakaba@suika.fam.cx>
895
896 * ContentType.pm: "Content-Type: text/plain; charset=UTF-8"
897 and "Content-Encoding" no longer prevent sniffing (HTML5
898 revision 1288).
899
900 2008-05-05 Wakaba <wakaba@suika.fam.cx>
901
902 * ContentType.pm: Skip BOMs n feed or HTML algorithm (HTML5 revision
903 1282).
904
905 2008-05-03 Wakaba <wakaba@suika.fam.cx>
906
907 * ContentChecker.pm: Support for global attributes.
908 Status of XML specs are added.
909
910 2008-05-03 Wakaba <wakaba@suika.fam.cx>
911
912 * ContentChecker.pm (check_element): Support for |template=""|
913 and |ref=""| attribute (referent element type checking).
914
915 2008-04-29 Wakaba <wakaba@suika.fam.cx>
916
917 * CacheManifest.pm (_parse): New same origin definition (HTML5 revision
918 1500) is implemented (except for IDNA part and URI-scheme-specific
919 knowledge). Line number counting was wrong for LF-only
920 documents.
921
922 2008-04-13 Wakaba <wakaba@suika.fam.cx>
923
924 * HTML.pm.src: Raise an parse error for any disallowed
925 character (HTML5 revision 1263).
926
927 2008-04-13 Wakaba <wakaba@suika.fam.cx>
928
929 * mkentitylist.pl: Support for new HTML5 entity table format (the
930 definition for |AElig;| was missing).
931
932 2008-04-13 Wakaba <wakaba@suika.fam.cx>
933
934 * HTML.pm.src, mkhtmlparser.pl: Support for element/attribute
935 name/namespace fixup (HTML5 revisions 1413, 1415, 1416, and 1417).
936
937 2008-04-13 Wakaba <wakaba@suika.fam.cx>
938
939 * HTML.pm.src: List of element names that close foreign content
940 insertion mode is added (HTML5 revisions 1412 and 1418).
941
942 2008-04-13 Wakaba <wakaba@suika.fam.cx>
943
944 * HTML.pm.src: Support for |mglyph| and |malignmark|
945 elements (HTML5 revision 1410).
946
947 2008-04-13 Wakaba <wakaba@suika.fam.cx>
948
949 * HTML.pm.src: Support for new long MathML entities (HTML5
950 revision 1406).
951
952 2008-04-13 Wakaba <wakaba@suika.fam.cx>
953
954 * HTML.pm.src: CDATA section support for MathML and SVG
955 elements (HTML5 revision 1404 and 1420).
956
957 2008-04-12 Wakaba <wakaba@suika.fam.cx>
958
959 * HTML.pm.src, mkhtmlparser.pl: Support for MathML and SVG
960 elements (HTML5 revision 1404). Unused !!!macro definitions
961 are removed.
962
963 2008-04-12 Wakaba <wakaba@suika.fam.cx>
964
965 * HTML.pm.src, mkhtmlparser.pl: The way permitted slash errors
966 are raised is changed (HTML5 revision 1404).
967
968 2008-04-06 Wakaba <wakaba@suika.fam.cx>
969
970 * HTML.pm.src: Category sets that are no longer used are removed.
971
972 2008-04-06 Wakaba <wakaba@suika.fam.cx>
973
974 * HTML.pm.src: The ->[1] property of stack entries are now
975 replaced by constants representing element category.
976
977 2008-04-06 Wakaba <wakaba@suika.fam.cx>
978
979 * HTML.pm.src: Don't use local name stored in stack (i.e. ->[1])
980 for error reporting. (This is a preparation for using constant
981 value for ->[1].)
982
983 2008-03-22 Wakaba <wakaba@suika.fam.cx>
984
985 * RDFXML.pm: Typo fixed.
986
987 2008-03-22 Wakaba <wakaba@suika.fam.cx>
988
989 * ContentChecker.pm: |fact_level| is now treated
990 as same as |must_level|, i.e. level = |m|.
991 (check_element): Make list of URIs in the DOM.
992
993 2008-03-21 Wakaba <wakaba@suika.fam.cx>
994
995 * RDFXML.pm: Language accessor implemented.
996 Local (null-namespace) attribute support.
997
998 2008-03-21 Wakaba <wakaba@suika.fam.cx>
999
1000 * RDFXML.pm: Factored out ID checking code.
1001
1002 2008-03-21 Wakaba <wakaba@suika.fam.cx>
1003
1004 * RDFXML.pm: TODO items noted. Validation of ID
1005 and URI attributes is implemented. Warn if unknown
1006 value is used in rdf:parseType="" attribute.
1007
1008 * URIChecker.pm (check_rdf_uri_reference): New function.
1009
1010 2008-03-21 Wakaba <wakaba@suika.fam.cx>
1011
1012 * RDFXML.pm: bnodeid implemented. Relative references
1013 are now resolved.
1014
1015 2008-03-21 Wakaba <wakaba@suika.fam.cx>
1016
1017 * ContentChecker.pm: RDF reification implemented.
1018
1019 * RDFXML.pm: undef vs false bug fixed.
1020 Reification implemented.
1021
1022 2008-03-21 Wakaba <wakaba@suika.fam.cx>
1023
1024 * RDFXML.pm: s/id/ID/ for attribute name.
1025 The |node| arguments are added for |ontriple| calls.
1026 Too many "attribute not allowed" errors were raised.
1027
1028 * ContentChecker.pm: Initial experimental support for rdf:RDF
1029 element.
1030
1031 2008-03-21 Wakaba <wakaba@suika.fam.cx>
1032
1033 * RDFXML.pm: New module.
1034
1035 2008-03-20 Wakaba <wakaba@suika.fam.cx>
1036
1037 * HTML.pm.src (set_inner_html): Line/column number
1038 code was old one yet.
1039
1040 2008-03-20 Wakaba <wakaba@suika.fam.cx>
1041
1042 * HTML.pm.src: Better line/column reporting for "duplicate attribute"
1043 errors. Line/column markings for DOCTYPE, comment, and
1044 character tokens are reintroduced; otherwise,
1045 error location for "not HTML5" error and errors
1046 for implied elements are not attached.
1047
1048 2008-03-20 Wakaba <wakaba@suika.fam.cx>
1049
1050 * HTML.pm.src: Set line/column numbers to attributes.
1051
1052 * NanoDOM.pm (create_attribute_ns, set_attribute_node_ns): Added.
1053 (value): Setter implemented.
1054
1055 * mkhtmlparser.pl: Set line/column numbers to Attr nodes.
1056
1057 2008-03-20 Wakaba <wakaba@suika.fam.cx>
1058
1059 * HTML.pm.src: Unused line/column markings are removed.
1060
1061 2008-03-20 Wakaba <wakaba@suika.fam.cx>
1062
1063 * HTML.pm.src (_get_next_token): Remove |first_start_tag|
1064 flag, which is no longer used.
1065
1066 2008-03-17 Wakaba <wakaba@suika.fam.cx>
1067
1068 * HTML.pm.src: Set line/column information to element nodes.
1069
1070 * mkhtmlparser.pl (!!!create-element, !!!insert-element,
1071 and !!!insert-element-t): Set line/column information to
1072 element nodes.
1073
1074 2008-03-17 Wakaba <wakaba@suika.fam.cx>
1075
1076 * HTML.pm.src (_get_next_token): The first "<" character
1077 in "<?", "<>", or "</>" should be the error point.
1078
1079 2008-03-16 Wakaba <wakaba@suika.fam.cx>
1080
1081 * HTML.pm.src: Some more fixes on error position reporting.
1082
1083 2008-03-16 Wakaba <wakaba@suika.fam.cx>
1084
1085 * HTML.pm.src: Token-level precious error reporting.
1086
1087 2008-03-16 Wakaba <wakaba@suika.fam.cx>
1088
1089 * HTML.pm.src: Preparation for more precious error point
1090 reporting.
1091
1092 2008-03-11 Wakaba <wakaba@suika.fam.cx>
1093
1094 * HTML.pm.src: Error type revised.
1095
1096 2008-03-11 Wakaba <wakaba@suika.fam.cx>
1097
1098 * HTML.pm.src: Similar codes are merged together, again.
1099
1100 2008-03-11 Wakaba <wakaba@suika.fam.cx>
1101
1102 * HTML.pm.src: Similar codes are merged together.
1103
1104 2008-03-10 Wakaba <wakaba@suika.fam.cx>
1105
1106 * mkhtmlparser.pl: Set "level" parameter to parse errors.
1107
1108 * HTML.pm.src: Code refined.
1109
1110 2008-03-09 Wakaba <wakaba@suika.fam.cx>
1111
1112 * HTML.pm.src: |</body>| treatement has been changed (HTML5
1113 revision 1348). Note that I really don't know this makes
1114 any difference in the black-box behavior of the parser.
1115
1116 2008-03-09 Wakaba <wakaba@suika.fam.cx>
1117
1118 * HTML.pm.src: New end-of-file token implementation (HTML5
1119 revision 1348).
1120
1121 2008-03-09 Wakaba <wakaba@suika.fam.cx>
1122
1123 * HTML.pm.src: |applet| support (HTML5 revision 1347).
1124
1125 2008-03-09 Wakaba <wakaba@suika.fam.cx>
1126
1127 * HTML.pm.src: Foster parenting in AAA (HTML5 revision 1343).
1128
1129 2008-03-09 Wakaba <wakaba@suika.fam.cx>
1130
1131 * HTML.pm.src: Support for |<input>| in the "in select" insertion
1132 mode and support for the "in select in table" insertion mode (HTML5
1133 revision 1342).
1134
1135 2008-03-09 Wakaba <wakaba@suika.fam.cx>
1136
1137 * HTML.pm.src: No foster parenting for <script> and <script>
1138 in non-tainted <table>s (HTML5 revision 1336).
1139
1140 2008-03-09 Wakaba <wakaba@suika.fam.cx>
1141
1142 * HTML.pm.src: Ignore white space characters between <html>
1143 and <head> (HTML5 revision 1332).
1144
1145 2008-03-09 Wakaba <wakaba@suika.fam.cx>
1146
1147 * HTML.pm.src: Treat <input type=hidden> as if it were a
1148 white space (HTML5 revision 1331).
1149
1150 2008-03-08 Wakaba <wakaba@suika.fam.cx>
1151
1152 * HTML.pm.src: Ignore U+000A at the beginning of a |listing|
1153 element (HTML5 revision 1330).
1154
1155 2008-03-08 Wakaba <wakaba@suika.fam.cx>
1156
1157 * HTML.pm.src: <title> is always appended to the current
1158 element (HTML5 revision 1328).
1159
1160 2008-03-08 Wakaba <wakaba@suika.fam.cx>
1161
1162 * HTML.pm.src: White space in tainted tables are moved
1163 into foster parents (HTML5 revision 1326).
1164
1165 2008-03-08 Wakaba <wakaba@suika.fam.cx>
1166
1167 * HTML.pm.src: Reduce errors from foster parenting cases (HTML5
1168 revision 1321).
1169
1170 2008-03-08 Wakaba <wakaba@suika.fam.cx>
1171
1172 * HTML.pm.src: |</h/n/>| case code rearranged to align with
1173 the spec (HTML5 revision 1320). Note that we finally complete
1174 all of HTML5 revision 1320 changes.
1175
1176 2008-03-08 Wakaba <wakaba@suika.fam.cx>
1177
1178 * HTML.pm.src: |</form>| now works similar to |</div>| for unclosed
1179 tags (HTML5 revision 1320).
1180
1181 2008-03-08 Wakaba <wakaba@suika.fam.cx>
1182
1183 * HTML.pm.src: |</p>| case rearranged with no actual change in fact.
1184
1185 2008-03-08 Wakaba <wakaba@suika.fam.cx>
1186
1187 * HTML.pm.src: A "generate implied end tags" code (t409.1)
1188 could not be reached so that it is now removed (HTML5 revision
1189 1320).
1190
1191 2008-03-08 Wakaba <wakaba@suika.fam.cx>
1192
1193 * HTML.pm.src: Code for the case of |</div>| and so on are revised
1194 to align with new spec text (HTML5 revision 1320).
1195
1196 2008-03-08 Wakaba <wakaba@suika.fam.cx>
1197
1198 * HTML.pm.src: Remove strange |if| condition; however, it should
1199 have had no harm in theory.
1200
1201 2008-03-08 Wakaba <wakaba@suika.fam.cx>
1202
1203 * HTML.pm.src (_tree_construction_main): '</p>' in body
1204 case is split from other end tags for the preparation
1205 of implementing HTML revision 1320.
1206
1207 2008-03-07 Wakaba <wakaba@suika.fam.cx>
1208
1209 * HTML.pm.src: Simplified "generate implied end tag" (HTML5
1210 revision 1320).
1211
1212 2008-03-07 Wakaba <wakaba@suika.fam.cx>
1213
1214 * HTML.pm.src (_tree_construction_main): Merge rules for "h1"
1215 and "div" (HTML5 revision 1318). Add comments to where
1216 |form| pointer association codes should be inserted (HTML5 revision
1217 1319).
1218
1219 2008-03-06 Wakaba <wakaba@suika.fam.cx>
1220
1221 * HTML.pm.src: <html> treatement refined (HTML5 revision 1314).
1222
1223 2008-03-05 Wakaba <wakaba@suika.fam.cx>
1224
1225 * HTML.pm.src: Since the case t268 should never be reached (no
1226 other token type, there are), it is replaced by a |die| statement.
1227
1228 2008-03-05 Wakaba <wakaba@suika.fam.cx>
1229
1230 * HTML.pm.src: Typo fixed.
1231
1232 2008-03-04 Wakaba <wakaba@suika.fam.cx>
1233
1234 * HTML.pm.src (_tree_construction_initial): Some limited quirks
1235 doctypes were not uppercased for comparison.
1236
1237 2008-03-03 Wakaba <wakaba@suika.fam.cx>
1238
1239 * HTML.pm.src (tree construction and set_inner_html): Checkpoints
1240 are added.
1241
1242 2008-03-03 Wakaba <wakaba@suika.fam.cx>
1243
1244 * HTML.pm.src (_tokenize_attempt_to_consume_an_entity): Checkpoints
1245 are set. Cases that are unlikely reached are noted as so.
1246
1247 2008-03-03 Wakaba <wakaba@suika.fam.cx>
1248
1249 * HTML.pm.src: Checkpoints for debugging are added.
1250
1251 * mkhtmlparser.pl: Support for |!!!cp| syntax.
1252
1253 2008-03-03 Wakaba <wakaba@suika.fam.cx>
1254
1255 * HTML.pm.src, mkhtmlparser.pl: s/_input_character/_char/g
1256 for simplicity.
1257
1258 2008-03-03 Wakaba <wakaba@suika.fam.cx>
1259
1260 * HTML.pm.src: Flag name changed: s/correct/force-quirks/g (HTML5
1261 revision 1307).
1262
1263 2008-03-03 Wakaba <wakaba@suika.fam.cx>
1264
1265 * HTML.pm.src (_get_next_token): Where the /incorrect/ flag
1266 is set are changed (HTML5 revision 1305).
1267
1268 2008-03-02 Wakaba <wakaba@suika.fam.cx>
1269
1270 * HTML.pm.src: Raise a parse error for |<span ===>| (HTML5 revision
1271 1292). Entities are not parsed in comment-like part in RCDATA
1272 elements (HTML5 revision 1294). Allow bare & at the end
1273 of attribute value literals (HTML5 revision 1296). More
1274 quirks mode doctypes (HTML5 revision 1302). Requires spaces
1275 between attributes and ban attribute names or unquoted
1276 attribute values containing single or double quotes (HTML5
1277 revision 1303).
1278
1279 2008-03-02 Wakaba <wakaba@suika.fam.cx>
1280
1281 * ContentChecker.pm: Typo fixed. Don't raise "character encoding"
1282 and related errors unless it is an HTML document (though the spec
1283 is unclear on whether it is applied to XHTML document).
1284
1285 * HTML.pm (%HTMLAttrStatus): WF2 repetition model attributes
1286 are added.
1287
1288 2008-03-02 Wakaba <wakaba@suika.fam.cx>
1289
1290 * HTML.pm.src: s/local_name/manakai_local_name/g.
1291
1292 2008-03-01 Wakaba <wakaba@suika.fam.cx>
1293
1294 * _NamedEntityList.pm: Updated (HTML5 revision 1286).
1295
1296 * HTML.pm.src: |charset| in |content| attribute is
1297 case-insensitive (HTML5 revision 1270).
1298
1299 2008-02-26 Wakaba <wakaba@suika.fam.cx>
1300
1301 * ContentChecker.pm: New status constants are added.
1302 ($ElementDefault): |status| added.
1303 (check_element): Err for non-standard or deprecated elements.
1304 (_attr_status_info): For non-standard or deprecated attributes.
1305
1306 2008-02-24 Wakaba <wakaba@suika.fam.cx>
1307
1308 * ContentChecker.pm (_attr_status_info): New internal method.
1309
1310 2008-02-24 Wakaba <wakaba@suika.fam.cx>
1311
1312 * ContentChecker.pm (check_element): Element standardized
1313 status information is now dispatched.
1314
1315 2008-02-24 Wakaba <wakaba@suika.fam.cx>
1316
1317 * ContentChecker.pm (check_element): Fix |del|-and-significant
1318 problem by adding some more arguments.
1319
1320 2008-02-24 Wakaba <wakaba@suika.fam.cx>
1321
1322 * ContentChecker.pm (check_element): Use context of
1323 container-for-the-purpose-of-content-model element (not transparent
1324 element) for |check_child_element| calling and significant
1325 text flag marking. This reintroduces |<del>|-and-significant
1326 problem again.
1327
1328 2008-02-24 Wakaba <wakaba@suika.fam.cx>
1329
1330 * ContentChecker.pm (check_element): Make semi-transparent
1331 elements ignored for the purpose of phase changes in
1332 content model checking.
1333
1334 2008-02-23 Wakaba <wakaba@suika.fam.cx>
1335
1336 * ContentChecker.pm (check_element): In-element state
1337 was not properly managed for transparent cases.
1338
1339 2008-02-23 Wakaba <wakaba@suika.fam.cx>
1340
1341 * ContentChecker.pm (check_element): Support for |video|
1342 and |audio| as semi-transparent elements.
1343
1344 2008-02-23 Wakaba <wakaba@suika.fam.cx>
1345
1346 * ContentChecker.pm ($HTMLSemiTransparentElements): New.
1347 (check_element): s/minuses/minus_elements/, s/pluses/plus_elements/.
1348 Support for |html:object| as a semi-transparent element.
1349
1350 2008-02-23 Wakaba <wakaba@suika.fam.cx>
1351
1352 * ContentChecker.pm (check_element): The way to traverse
1353 the tree is entirely revised to make it easier to track
1354 the state of ancestors/descendants. As a result of this
1355 revision (which rewrites almost all of Whatpm::ContentChecker::HTML),
1356 support for content model checking for HTML elements |figure|,
1357 |object|, |video|, and |audio| and checking for XML elements (and
1358 some XMLNS checkings) are dropped for now. They will be
1359 reimplemented in due cource.
1360
1361 2008-02-17 Wakaba <wakaba@suika.fam.cx>
1362
1363 * HTML.pm.src: |>| in public or system literal closes the DOCTYPE
1364 token (HTML5 revision 1225).
1365
1366 2008-02-17 Wakaba <wakaba@suika.fam.cx>
1367
1368 * ContentChecker.pm ({unsupported_level}): New value.
1369
1370 * HTML.pm.src: Save whether |meta| |content| attribute
1371 contains character references or not.
1372
1373 2008-02-17 Wakaba <wakaba@suika.fam.cx>
1374
1375 * ContentChecker.pm (_get_children): (Incompleted) attempt to
1376 imlement significant content checking for contents
1377 with |del| elements.
1378
1379 2008-02-17 Wakaba <wakaba@suika.fam.cx>
1380
1381 * ContenteChecker.pm ($HTMLTransparentElements): More
1382 elements are added.
1383 (_get_children): HTML |object| elements are now semi-transparent.
1384
1385 * NanoDOM.pm (manakai_html, manakai_head): New methods.
1386
1387 2008-02-16 Wakaba <wakaba@suika.fam.cx>
1388
1389 * CacheManifest.pm: HTML5 revision 1211 implemented.
1390
1391 * CacheManifest.pod: Updated.
1392
1393 2008-02-10 Wakaba <wakaba@suika.fam.cx>
1394
1395 * ContentChecker.pm (check_document, check_element): Support
1396 for second argument ($onsubdoc).
1397 (_get_css_parser): Removed (now it is part of WDCC).
1398
1399 2008-02-09 Wakaba <wakaba@suika.fam.cx>
1400
1401 * ContentChecker.pm (_get_css_parser): New.
1402
1403 2007-11-25 Wakaba <wakaba@suika.fam.cx>
1404
1405 * ContentChecker.pm ($AnyChecker): Old way to add child elements
1406 for checking had been used.
1407
1408 2007-11-25 Wakaba <wakaba@suika.fam.cx>
1409
1410 * ContentChecker.pm (check_element): New todo item type |descendant|.
1411
1412 2007-11-23 Wakaba <wakaba@suika.fam.cx>
1413
1414 * IMTChecker.pm: Revised to raise errors and warnings as (poorly)
1415 specced in RFC 2046 and RFC 4288.
1416 (application/atom+xml): Definition added.
1417
1418 2007-11-23 Wakaba <wakaba@suika.fam.cx>
1419
1420 * URIChecker.pm: Make RFC 3986 should-level errors
1421 warnings (rather than SHOULD-level errors).
1422
1423 2007-11-23 Wakaba <wakaba@suika.fam.cx>
1424
1425 * NanoDOM.pm (get_user_data, set_user_data): New methods.
1426
1427 * HTML.pm.src: A flag for character references in attribute
1428 values are added. Set |manakai_has_reference| user data
1429 to |charset| attribute.
1430
1431 2007-11-23 Wakaba <wakaba@suika.fam.cx>
1432
1433 * NanoDOM.pm (input_encoding, manakai_charset, manakai_has_bom): New
1434 attributes.
1435
1436 * ContentChecker.pm (check_document): Warn if charset requirements
1437 cannot be tested.
1438
1439 2007-11-19 Wakaba <wakaba@suika.fam.cx>
1440
1441 * HTML.pm.src (parse_byte_string): Detect charset
1442 by universalchardet if charset parameter is not specified.
1443
1444 * Makefile (Charset-all, Charset-clean): New rules.
1445
1446 2007-11-18 Wakaba <wakaba@suika.fam.cx>
1447
1448 * ContentChecker.pm (check_document): Check the existence
1449 of character encoding declaration and selection of encoding
1450 for HTML document.
1451
1452 2007-11-18 Wakaba <wakaba@suika.fam.cx>
1453
1454 * ContentType.pm (get_sniffed_type): Return also the official
1455 type in list context.
1456
1457 2007-11-18 Wakaba <wakaba@suika.fam.cx>
1458
1459 * ContentType.pm: Sniffing with leading white space
1460 ignoring (HTML5 revisions 1013 and 1016).
1461
1462 2007-11-18 Wakaba <wakaba@suika.fam.cx>
1463
1464 * ContentType.pm: HTML5 revision 1013 changes, except for leading
1465 white spaces, are implemented.
1466
1467 2007-11-11 Wakaba <wakaba@suika.fam.cx>
1468
1469 * HTML.pm.src (parse_char_string): Set |inner_encoding|
1470 attribute if possible.
1471
1472 2007-11-11 Wakaba <wakaba@suika.fam.cx>
1473
1474 * HTML.pm.src (parse_byte_string): New method.
1475 (parse_char_string): New alias for |parse_string|.
1476 (main phase): Invoking "change the encoding" algorithm if desired.
1477
1478 * HTML.pod: Updated.
1479
1480 2007-11-11 Wakaba <wakaba@suika.fam.cx>
1481
1482 * HTML.pod (get_inner_html): Removed.
1483
1484 * Makefile (HTML-all, HTML-clean): New.
1485
1486 2007-11-11 Wakaba <wakaba@suika.fam.cx>
1487
1488 * HTML.pm.src (get_inner_html): Removed (moved to HTML/Serializer.pm).
1489
1490 2007-11-08 Wakaba <wakaba@suika.fam.cx>
1491
1492 * mklinktypelist.pl: s/noreferer/noreferrer/ (HTML5 revision 1132).
1493
1494 2007-11-04 Wakaba <wakaba@suika.fam.cx>
1495
1496 * Makefile: |CacheManifest.html| is added.
1497
1498 * CacheManifest.pod: New file.
1499
1500 2007-11-04 Wakaba <wakaba@suika.fam.cx>
1501
1502 * CacheManifest.pm: New module.
1503
1504 2007-11-04 Wakaba <wakaba@suika.fam.cx>
1505
1506 * HTML.pm.src: Support for application cache selection algorithm
1507 callback.
1508
1509 2007-11-04 Wakaba <wakaba@suika.fam.cx>
1510
1511 * mklinktypelist.pl: Support for rel=noreferer (HTML5 revision 1118).
1512
1513 2007-10-17 Wakaba <wakaba@suika.fam.cx>
1514
1515 * Makefile (clean): New rule.
1516
1517 * NanoDOM.pm (public_id, system_id): New attributes.
1518
1519 2007-10-17 Wakaba <wakaba@suika.fam.cx>
1520
1521 * Makefile (CSS-all, CSS-clean, clean): New rules.
1522
1523 2007-10-14 Wakaba <wakaba@suika.fam.cx>
1524
1525 * ContentChecker.pm (check_document): Support for
1526 new |is_xml_root| flag.
1527 (check_element): Support for new |pluses| state.
1528 (_add_pluses): New method.
1529 (_remove_minuses): Support for new |minus| item.
1530
1531 2007-09-24 Wakaba <wakaba@suika.fam.cx>
1532
1533 * ContentChecker.pm: Raise specific error for invalid
1534 root element.
1535
1536 2007-09-24 Wakaba <wakaba@suika.fam.cx>
1537
1538 * ContentChecker.pm: Set level values for later uses.
1539
1540 2007-09-09 Wakaba <wakaba@suika.fam.cx>
1541
1542 * ContentChecker.pm: Support for language tag validation.
1543
1544 2007-09-09 Wakaba <wakaba@suika.fam.cx>
1545
1546 * LangTag.pm (check_rfc3066_language_tag): New method.
1547
1548 2007-09-09 Wakaba <wakaba@suika.fam.cx>
1549
1550 * LangTag.pm: New module.
1551
1552 2007-09-04 Wakaba <wakaba@suika.fam.cx>
1553
1554 * HTML.pm.src: Some error types were wrong.
1555
1556 2007-08-17 Wakaba <wakaba@suika.fam.cx>
1557
1558 * CSS/: New directory.
1559
1560 2007-08-17 Wakaba <wakaba@suika.fam.cx>
1561
1562 * ContentChecker.pm (_check_get_children): Support
1563 for |noscript| in |head|.
1564
1565 2007-08-12 Wakaba <wakaba@suika.fam.cx>
1566
1567 * URI/: New directory.
1568
1569 2007-08-11 Wakaba <wakaba@suika.fam.cx>
1570
1571 * HTML.pm.src: Tokenizer's states are now represented in
1572 number.
1573
1574 2007-08-11 Wakaba <wakaba@suika.fam.cx>
1575
1576 * HTML.pm.src: |or|s for insertion modes are replaced
1577 by |&|s.
1578
1579 2007-08-11 Wakaba <wakaba@suika.fam.cx>
1580
1581 * HTML.pm.src: Token types are now represented in number.
1582
1583 2007-08-11 Wakaba <wakaba@suika.fam.cx>
1584
1585 * ContentType.pm (SEE ALSO): Updated.
1586
1587 * HTML.pm.src: Insertion modes are now represented in number.
1588
1589 2007-08-11 Wakaba <wakaba@suika.fam.cx>
1590
1591 * ContentType.pm: Sniffing for bitmap images (HTML5 revision
1592 999) is implemented.
1593
1594 2007-08-08 Wakaba <wakaba@suika.fam.cx>
1595
1596 * ContentType.pm: Sniffing for |<script| (HTML5 revision
1597 983) is implemented.
1598
1599 2007-08-06 Wakaba <wakaba@suika.fam.cx>
1600
1601 * ContentChecker.pod: New documentation.
1602
1603 * Makefile: A rule for |ContentChecker.html| is added.
1604
1605 * ContentChecker.pm: A pod "LICENSE" section is added.
1606
1607 * NanoDOM.pm ($VERSION): New variable.
1608
1609 2007-08-05 Wakaba <wakaba@suika.fam.cx>
1610
1611 * H2H.pm: |b|, |i|, and |sub| are added to the
1612 list of allowed HTML elements.
1613
1614 2007-08-05 Wakaba <wakaba@suika.fam.cx>
1615
1616 * H2H.pm: |samp| is added to the list of allowed
1617 HTML elements.
1618
1619 * URIChecker.pm (check_iri): New.
1620 (check_iri_reference): Error type for IRI reference
1621 syntax error is changed.
1622
1623 2007-08-04 Wakaba <wakaba@suika.fam.cx>
1624
1625 * ContentChecker.pm: Reference to the |Whatpm::ContentChecker::Atom|
1626 is added.
1627 (check_document): Load appropriate module before validation.
1628
1629 2007-08-04 Wakaba <wakaba@suika.fam.cx>
1630
1631 * ContentChecker/: New directory.
1632
1633 2007-08-04 Wakaba <wakaba@suika.fam.cx>
1634
1635 * ContentChecker.pm: HTML |time| element is implemented.
1636
1637 * HTMLTable.pm: Comments are updated as HTML5 is revised.
1638
1639 2007-07-21 Wakaba <wakaba@suika.fam.cx>
1640
1641 * ContentChecker.pm (check_document): Return value
1642 even if no document element is found.
1643
1644 2007-07-21 Wakaba <wakaba@suika.fam.cx>
1645
1646 * HTML.pm.src: |$in_body| is no longer a function.
1647
1648 2007-07-21 Wakaba <wakaba@suika.fam.cx>
1649
1650 * HTML.pm.src: The |$in_body| code has been moved down.
1651
1652 2007-07-21 Wakaba <wakaba@suika.fam.cx>
1653
1654 * HTML.pm.src: The "trailing end" insertion mode
1655 is split into "after html body" and "after html frameset"
1656 insertion modes. Their codes are merged with "after body"
1657 and "after frameset" codes. |$previous_insertion_mode|
1658 has been removed. "after frameset" code is
1659 merged with "in frameset" code.
1660
1661 2007-07-21 Wakaba <wakaba@suika.fam.cx>
1662
1663 * HTML.pm.src: The "before head" insertion mode is
1664 merged with the "in head" insertion mode.
1665
1666 2007-07-21 Wakaba <wakaba@suika.fam.cx>
1667
1668 * HTML.pm.src: Cases in "in head" insertion mode are
1669 reorganized.
1670
1671 2007-07-21 Wakaba <wakaba@suika.fam.cx>
1672
1673 * HTML.pm.src: Some cases in "in table" insertion mode
1674 are merged.
1675
1676 2007-07-21 Wakaba <wakaba@suika.fam.cx>
1677
1678 * HTML.pm.src: The "in row" insertion mode is merged
1679 with "in table" insertion mode.
1680
1681 2007-07-21 Wakaba <wakaba@suika.fam.cx>
1682
1683 * HTML.pm.src: The "in table" and "in table body" insertion
1684 modes are merged.
1685
1686 2007-07-21 Wakaba <wakaba@suika.fam.cx>
1687
1688 * HTML.pm.src: There is no "in table head" or "in table foot"
1689 insertion mode!
1690
1691 2007-07-21 Wakaba <wakaba@suika.fam.cx>
1692
1693 * HTML.pm.src: |<noframes>| "in frameset" and "in noframes"
1694 now directly invoke the handler.
1695
1696 2007-07-21 Wakaba <wakaba@suika.fam.cx>
1697
1698 * HTML.pm.src: Codes for "in cell" insertion mode
1699 is merged to the "in body" insertion mode code.
1700
1701 2007-07-21 Wakaba <wakaba@suika.fam.cx>
1702
1703 * HTML.pm.src: Codes for "in body" and "in caption"
1704 insertion modes are merged.
1705
1706 2007-07-21 Wakaba <wakaba@suika.fam.cx>
1707
1708 * HTML.pm.src: Two |!!!next-token|s were missing.
1709
1710 2007-07-21 Wakaba <wakaba@suika.fam.cx>
1711
1712 * HTML.pm.src: Use numeric constant for |{content_mode}|
1713 instead of string constant for |{content_model_flag}|.
1714
1715 2007-07-21 Wakaba <wakaba@suika.fam.cx>
1716
1717 * HTML.pm.src: Add the name of the attribute
1718 to the "duplicate attribute" error.
1719
1720 2007-07-17 Wakaba <wakaba@suika.fam.cx>
1721
1722 * ContentChecker.pm: Return the |class| node list.
1723
1724 2007-07-17 Wakaba <wakaba@suika.fam.cx>
1725
1726 * ContentChecker.pm: Return the |id| node list.
1727
1728 * HTML.pm.src: A typo is fixed.
1729
1730 2007-07-16 Wakaba <wakaba@suika.fam.cx>
1731
1732 * ContentChecker.pm: Drop wrong |level => 'error'| specification
1733 from "in HTML:xml:lang" error. Character position
1734 is now the last part of the error type in the URI error
1735 description. Report "unsupported" status for language
1736 tags, media queries, script codes, and style sheets.
1737
1738 2007-07-16 Wakaba <wakaba@suika.fam.cx>
1739
1740 * ContentChecker.pm: Report error if |xml:lang|
1741 in HTML, |lang| in XML, |xmlns| in XML, and |meta| |charset|
1742 in XML.
1743
1744 * NanoDOM.pm (Attr.owner_document): New attribute.
1745
1746 2007-07-16 Wakaba <wakaba@suika.fam.cx>
1747
1748 * HTML.pm.src: The character immediately following
1749 a bare |hcro| was discarded. Fix handling of
1750 entity references in attribute values.
1751
1752 2007-07-16 Wakaba <wakaba@suika.fam.cx>
1753
1754 * HTML.pm.src (main and trailin end phases): Token
1755 types |DOCTYPE|, |comment|, |end-of-file|, and |<html>| are
1756 factored out. Error types |in html:#DOCTYPE|
1757 and |after html:#DOCTYPE| are merged
1758 into |DOCTYPE in the middle|. |</frameset>|
1759 in fragment parsing mode changed the
1760 insertion mode.
1761
1762 2007-07-16 Wakaba <wakaba@suika.fam.cx>
1763
1764 * HTML.pm.src: |$phase| has been removed; The |trailing end|
1765 phase is now an insertion mode. Treatments for white
1766 space character tokens were incorrect for some
1767 insertion modes. An old |meta| case was not removed.
1768
1769 2007-07-16 Wakaba <wakaba@suika.fam.cx>
1770
1771 * HTML.pm.src: |meta| charset declaration extraction
1772 implemented (but changing the encoding is not yet:-).
1773
1774 2007-07-15 Wakaba <wakaba@suika.fam.cx>
1775
1776 * Charset/: New directory.
1777
1778 2007-07-15 Wakaba <wakaba@suika.fam.cx>
1779
1780 * H2H.pm: New Perl module (created from
1781 manakai's H2H.dis).
1782
1783 2007-07-15 Wakaba <wakaba@suika.fam.cx>
1784
1785 * XMLSerializer.pm: New Perl module (created from
1786 manakai's SimpleLS.dis).
1787
1788 2007-07-07 Wakaba <wakaba@suika.fam.cx>
1789
1790 * HTML.pm.src: |<!---x-->| was not processed correctly.
1791
1792 2007-07-01 Wakaba <wakaba@suika.fam.cx>
1793
1794 * HTML.pm.src: Report correct error message
1795 for |<body></div></body>|.
1796
1797 2007-07-01 Wakaba <wakaba@suika.fam.cx>
1798
1799 * HTMLTable.pm: An error description was incorrect.
1800
1801 2007-06-30 Wakaba <wakaba@suika.fam.cx>
1802
1803 * ContentChecker.pm: Return |{term}| list.
1804
1805 2007-06-30 Wakaba <wakaba@suika.fam.cx>
1806
1807 * HTML.pm.src: HTML5 revisions 961-966 (</p>, </br>,
1808 nested <nobr>, implied </tbody>, </tfoot>, and </thead>,
1809 and <title> outside of head).
1810
1811 2007-06-30 Wakaba <wakaba@suika.fam.cx>
1812
1813 * IMTChecker.pm: Report warning for unregistered
1814 and private types/subtypes.
1815
1816 * ContentChecker.pm, HTML.pm.src, IMTChecker.pm,
1817 URIChecker.pm, HTMLTable.pm: Error messages are now
1818 consistent; they are all listed in
1819 <http://suika.fam.cx/gate/2005/sw/Whatpm%20Error%20Types>.
1820
1821 2007-06-25 Wakaba <wakaba@suika.fam.cx>
1822
1823 * ContentChecker.pm: |<img ismap>| not in |<a></a>|
1824 is now erred. |<datalist>| is implemented.
1825 Attribute checker for |<command>| and |<menu>| are
1826 added. Support for |contextmenu| global attribute
1827 is added.
1828
1829 2007-06-25 Wakaba <wakaba@suika.fam.cx>
1830
1831 * HTML.pm.src (_reset_insertion_mode): Interpretation
1832 of Step 3 has been changed.
1833
1834 2007-06-25 Wakaba <wakaba@suika.fam.cx>
1835
1836 * HTML.pm.src: Late |<html>| parse error is implemented.
1837
1838 2007-06-24 Wakaba <wakaba@suika.fam.cx>
1839
1840 * URIChecker.pm (check_iri_reference): A |decode| method name was
1841 incorrect.
1842
1843 * ContentChecker.pm: Support for the |footer| element.
1844 Check URI syntax for space-separated URI attributes.
1845 Support for the |tabindex| attribute. Support
1846 for |datetime| attribute.
1847
1848 2007-06-24 Wakaba <wakaba@suika.fam.cx>
1849
1850 * HTML.pm.src: HTML5 revision 1.144 (&#x0D;) and 1.145 (invalid
1851 character references). HTML5 revision 1.146 (white space
1852 characters before root start tag). HTML5 revision
1853 1.148 (named character references in attribute values).
1854 HTML5 revision 1.152 (<plaintext>.innerHTML get).
1855
1856 2007-06-24 Wakaba <wakaba@suika.fam.cx>
1857
1858 * HTML.pm.src: HTML5 revisions 1.142 and 1.143 (<noscript>
1859 in <head>).
1860
1861 2007-06-24 Wakaba <wakaba@suika.fam.cx>
1862
1863 * HTML.pm.src: HTML5 revision 935 (<base>, <link>, <meta>
1864 in body).
1865
1866 * ContentChecker.pm: HTML5 revision 938 (scoped="").
1867
1868 2007-06-24 Wakaba <wakaba@suika.fam.cx>
1869
1870 * HTML.pm.src: HTML5 revision 923 (matching end tag in CDATA
1871 or RCDATA in fragment parsing mode). HTML5 revision
1872 924 (<!--> and <!--->). HTML5 revision 926 (hn in hn).
1873
1874 2007-06-23 Wakaba <wakaba@suika.fam.cx>
1875
1876 * HTML.pm.src (get_inner_html): HTML5 revision 922 (inner_html
1877 for <pre> and <textarea>).
1878
1879 2007-06-23 Wakaba <wakaba@suika.fam.cx>
1880
1881 * HTML.pm.src: HTML5 revision 920 (<isindex>).
1882
1883 2007-06-23 Wakaba <wakaba@suika.fam.cx>
1884
1885 * HTML.pm.src: HTML5 revision 918 (</head>, </body>, </html>).
1886
1887 2007-06-23 Wakaba <wakaba@suika.fam.cx>
1888
1889 * HTML.pm.src: HTML5 revision 916 (</body>).
1890 HTML5 revision 917 (conforming bare &).
1891
1892 2007-06-23 Wakaba <wakaba@suika.fam.cx>
1893
1894 * NanoDOM.pm (manakai_is_html): Setting to false did not work.
1895
1896 * HTML.pm.src: HTML5 revision 914 (</ in CDATA, RCDATA).
1897 HTML5 revision 915 (<nobr>).
1898
1899 2007-06-23 Wakaba <wakaba@suika.fam.cx>
1900
1901 * HTML.pm.src: HTML5 revisions 908, 909, 912, and 913 (quirks mode).
1902
1903 * NanoDOM.pm (manakai_is_html, manakai_compat_mode, compat_mode):
1904 New attributes.
1905
1906 2007-06-23 Wakaba <wakaba@suika.fam.cx>
1907
1908 * HTML.pm.src: HTML5 revisions 900, 901, 902, and 911 (<
1909 in tags).
1910
1911 2007-06-23 Wakaba <wakaba@suika.fam.cx>
1912
1913 * .cvsignore: |Entities.html| is added.
1914
1915 * HTML.pm.src: |$entity_char| is removed and
1916 requires |Whatpm::_NamedEntityList| instead.
1917 HTML5 revision 898 (refc), except that lack of refc
1918 is parse error.
1919
1920 * mkentitylist.pl: New script.
1921
1922 * Makefile (all): |_NamedEntityList.pm| is added.
1923 (_NamedEntityList.pm, Entities.html): New rules.
1924
1925 2007-06-23 Wakaba <wakaba@suika.fam.cx>
1926
1927 * HTML.pm.src: Parse errors immediately after U+000D
1928 were ignored and U+000D immediately following another
1929 U+000D was not converted to U+000A.
1930
1931 2007-06-23 Wakaba <wakaba@suika.fam.cx>
1932
1933 * HTML.pm.src (set_inner_html): HTML5 revision 892 (adopt
1934 nodes before appended). Parser was not ready for NULL
1935 parse error and escape flag.
1936
1937 * NanoDOM.pm (adopt_node): New.
1938
1939 2007-06-23 Wakaba <wakaba@suika.fam.cx>
1940
1941 * HTML.pm.src: HTML5 revision 886 (insane comment in
1942 CDATA and RCDATA). Note that current implementation
1943 is simply repeating what the spec says and it is maybe not
1944 a best way to do it.
1945
1946 2007-06-23 Wakaba <wakaba@suika.fam.cx>
1947
1948 * HTML.pm.src: HTML5 revision 884 (</form> don't close
1949 the form element if a descendant element without implied end tag has
1950 still been open).
1951
1952 2007-06-23 Wakaba <wakaba@suika.fam.cx>
1953
1954 * ContentChecker.pm: HTML5 revision 881 (Make |id| attribute with
1955 space characters non-conforming).
1956
1957 2007-06-23 Wakaba <wakaba@suika.fam.cx>
1958
1959 * HTML.pm.src: An error message was incorrect.
1960 HTML5 revision 869 (C1 character references).
1961
1962 2007-06-23 Wakaba <wakaba@suika.fam.cx>
1963
1964 * HTML.pm.src: HTML5 revision 867 (a LF at the beginning of
1965 a |textarea| is removed).
1966
1967 2007-06-05 Wakaba <wakaba@suika.fam.cx>
1968
1969 * NanoDOM.pm (get_attribute_node_ns): New method.
1970
1971 * ContentChecker.pm: |script| |async| and |defer|
1972 no longer require |src|. |async| MUST NOT be
1973 specified if |defer|. (HTML5 revision 858).
1974
1975 2007-05-30 Wakaba <wakaba@suika.fam.cx>
1976
1977 * HTML.pm.src: |<form><form>| went to inifinite loop.
1978
1979 2007-05-27 Wakaba <wakaba@suika.fam.cx>
1980
1981 * ContentChecker.pm (html): Set |is_root| (allowed
1982 as a document element) flag on.
1983 (new): Removed.
1984 (check_document): New method.
1985
1986 2007-05-27 Wakaba <wakaba@suika.fam.cx>
1987
1988 * ContentChecker.pm (thead, tfoot): Checker specifications
1989 were incorrect.
1990
1991 2007-05-27 Wakaba <wakaba@suika.fam.cx>
1992
1993 * ContentChecker.pm ($HTMLURIAttrChecker): Include
1994 error position in the |type| option of the error.
1995
1996 * HTMLTable.pm (form_table): The |$onerror| parameter
1997 is now optional. Some bugs are fixed.
1998
1999 2007-05-27 Wakaba <wakaba@suika.fam.cx>
2000
2001 * HTMLTable.pm: New module.
2002
2003 * ContentChecker.pm (table): Invoke table model error checker.
2004
2005 * NanoDOM.pm (first_child, get_attribute_ns): New.
2006
2007 2007-05-26 Wakaba <wakaba@suika.fam.cx>
2008
2009 * ContentChecker.pm ($HTMLLinkTypesAttrChecker): New checker.
2010 (link/@rel, a/@rel, area/@rel): Use new checker.
2011
2012 * Makefile (_LinkTypeList.pm, RelExtensions.html): New rules.
2013
2014 * _LinkTypeList.pm: New file.
2015
2016 * mklinktypelist.pl: New file.
2017
2018 * .cvsignore: |RelExtensions.html| added.
2019
2020 * NanoDOM.pm (child_nodes): Returns an empty array
2021 for non-child-containing node types.
2022 (text_content): New attribute.
2023
2024 2007-05-26 Wakaba <wakaba@suika.fam.cx>
2025
2026 * IMTChecker.pm: New module.
2027
2028 * ContentChecker.pm ($HTMLIMTAttrChecker): Call IMTChecker
2029 to test parameter value validity.
2030
2031 * HTML.pm.src ($style_start_tag): Attributes were
2032 discarded.
2033
2034 2007-05-25 Wakaba <wakaba@suika.fam.cx>
2035
2036 * ContentChecker.pm ($HTMLURIAttrChecker): Implemented.
2037
2038 2007-05-25 Wakaba <wakaba@suika.fam.cx>
2039
2040 * URIChecker.pm: All recommendations from RFC 3986
2041 and RFC 3987 are listed (not all testable items are checked yet).
2042
2043 2007-05-25 Wakaba <wakaba@suika.fam.cx>
2044
2045 * URIChecker.pm: New module.
2046
2047 2007-05-20 Wakaba <wakaba@suika.fam.cx>
2048
2049 * ContentChecker.pm: Now most attributes are implemented
2050 or associated to some placeholder.
2051 ($ElementDefault): Warn unknown attributes
2052 for unknown elements as "attribute not supported".
2053 ($HTMLLanguageTagAttrChecker, $HTMLMQAttrChecker): New placeholders.
2054 ($HTMLUsemapAttrChecker, $HTMLTargetAttrChecker): New checkers.
2055 (|a| attribute checker): Reimplemented.
2056
2057 2007-05-20 Wakaba <wakaba@suika.fam.cx>
2058
2059 * ContentChecker.pm ($HTMLEventHandlerAttrChecker): New placeholder.
2060 ($HTMLAttrChecker): Event handler content attributes
2061 are added.
2062 (link, embed): Required attribute is now checked.
2063 (embed): Unknown local attributes are no longer warned.
2064
2065 2007-05-20 Wakaba <wakaba@suika.fam.cx>
2066
2067 * ContentChecker.pm ($HTMLSpaceURIsAttrChecker): New placeholder.
2068 ($HTMLIMTAttrChecker): New checker.
2069 (link@rel, link@href, link@type, style@type,
2070 a@href, a@ping, a@ping, a@type, embed@src, embed@type,
2071 object@data, object@type, source@src, source@type, area@alt,
2072 area@shape, area@coords, area@href,
2073 area@ping, area@rel, area@type, script@src,
2074 script@defer, script@async, script@type): Checkers added.
2075
2076 2007-05-20 Wakaba <wakaba@suika.fam.cx>
2077
2078 * ContentChecker.pm: Descendant checking was incorrect.
2079
2080 2007-05-19 Wakaba <wakaba@suika.fam.cx>
2081
2082 * ContentChecker.pm: Support |xml:*| and |xmlns:*|
2083 attributes. Report an error if |Element.prefix|
2084 is |xmlns|.
2085
2086 * NanoDOM.pm (prefix): New attribute.
2087
2088 2007-05-19 Wakaba <wakaba@suika.fam.cx>
2089
2090 * HTML.pm.src: In |main| phase, |in body| insertion
2091 mode, action for |<iframe>| was missing.
2092
2093 2007-05-19 Wakaba <wakaba@suika.fam.cx>
2094
2095 * ContentChecker.pm: Support for many of HTML5 elements.
2096 ($GetHTMLNonNegativeIntegerAttrChecker): New.
2097
2098 2007-05-19 Wakaba <wakaba@suika.fam.cx>
2099
2100 * ContentChecker.pm: Support for most elements up to |progress|.
2101 ($HTMLURIAttrChecker): Placeholder.
2102 ($HTMLIntegerAttrChecker, $GetHTMLFloatingPointNumberAttrChecker):
2103 New.
2104
2105 2007-05-19 Wakaba <wakaba@suika.fam.cx>
2106
2107 * ContentChecker.pm: Attribute checkers
2108 for global attributes, |html|, |base|, |style|, and |meta|.
2109
2110 * NanoDOM.pm (insert_before): Weaken reference
2111 to the parent node.
2112 (Attr::new): Set |owner_element| attribute.
2113 (namespace_uri, manakai_local_name): New attribute implementations.
2114 (owner_element): New attribute.
2115
2116 2007-05-19 Wakaba <wakaba@suika.fam.cx>
2117
2118 * ContentChecker.pm ($AttrChecker, $HTMLAttrChecker,
2119 $AnyChecker->{attr_checker}, $HTMLAttrsChecker,
2120 $Element->{$HTML_NS}->{''}): New.
2121 (check_element): Invoke attrs_checker for each element.
2122
2123 2007-05-13 Wakaba <wakaba@suika.fam.cx>
2124
2125 * ContentChecker.pm: Don't use |manakai_element_type_match|.
2126
2127 2007-05-13 Wakaba <wakaba@suika.fam.cx>
2128
2129 * ContentChecker.pm: Use hashs rather than lists for
2130 element type testings.
2131
2132 2007-05-13 Wakaba <wakaba@suika.fam.cx>
2133
2134 * ContentChecker.pm: Don't generate duplicate
2135 error when an element type is put in the "minus" list
2136 and the element type is not allowed explicitly in the particular
2137 element content model.
2138 (html:a checker): New checker.
2139 (html:details, html:datagrid): New checkers.
2140 (html:legend): New checker.
2141
2142 2007-05-13 Wakaba <wakaba@suika.fam.cx>
2143
2144 * ContentChecker.pm (html:li checker): Implemented.
2145
2146 2007-05-13 Wakaba <wakaba@suika.fam.cx>
2147
2148 * ContentChecker.pm ($HTMLInlineOrStriclyInlineChecker): New
2149 checker.
2150 (html:dd checker): New checker.
2151 (html:q, html:em, html:strong, html:small,
2152 html:m, html:dfn, html:code, html:samp, html:span): New checkers.
2153
2154 2007-05-13 Wakaba <wakaba@suika.fam.cx>
2155
2156 * ContentChecker.pm ($AnyChecker): Renamed
2157 from |$ElementDefault->{checker}|.
2158 ($ElementDefault->{checker}): Throw an error that
2159 the element type is not supported by the checker.
2160 ($HTMLMetadataElement): |html:base| was missing.
2161 ($HTMLEmptyChecker): Don't throw an error
2162 for inter-element whitespace nodes.
2163 (html:html checker): Errors were not
2164 thrown even if |html:head| and/or |html:body|
2165 children were missing.
2166 (html:head checker): An error was not
2167 thrown if <meta charset> appered after other
2168 elements.
2169
2170 2007-05-05 Wakaba <wakaba@suika.fam.cx>
2171
2172 * ContentChecker.pm: |footer|, |video|, |audio|, |script|,
2173 and |noscript| elements are implemented.
2174 (new): New method.
2175
2176 2007-05-04 Wakaba <wakaba@suika.fam.cx>
2177
2178 * ContentChecker.pm: New module.
2179
2180 2007-05-04 Wakaba <wakaba@suika.fam.cx>
2181
2182 * NanoDOM.pm (manakai_parent_element,
2183 document_element, manakai_local_name,
2184 manakai_element_type_match): New method.
2185
2186 2007-05-03 Wakaba <wakaba@suika.fam.cx>
2187
2188 * HTML.pm.src: Replace decimal and hexadecimal numeric
2189 entities in C1 range using Windows-1252 mapping. Bare LF
2190 did not count as new line for error reporting.
2191
2192 2007-05-02 Wakaba <wakaba@suika.fam.cx>
2193
2194 * NanoDOM.pm (DOMImplementation): New class.
2195 (append_child): Weaken the |parent_node| reference.
2196 (create_element_ns, Element new): Set the |owner_document|
2197 reference.
2198 (implementation): New attribute.
2199 (owner_document, local_name, namespace_uri): New attributes.
2200
2201 * HTML.pm.src (parse_string): Line and column numbers
2202 are now provided to error handler.
2203 (!!!parse-error): Short descriptions are added.
2204 (_construct_tree): Split into three methods; support
2205 for innerHTML mode.
2206 (set_inner_html): New method.
2207
2208 2007-05-01 Wakaba <wakaba@suika.fam.cx>
2209
2210 * NanoDOM.html: Documentation is added.
2211
2212 * HTML.pod, ContentType.html: Documentation is revised.
2213
2214 * .cvsignore: Pod2html temporary files are added.
2215
2216 * Makefile: Make |NanoDOM.html|.
2217
2218 2007-05-01 Wakaba <wakaba@suika.fam.cx>
2219
2220 * HTML.pm.src (parse_string): New method.
2221 (get_inner_html): Renamed from |inner_html|.
2222
2223 * Makefile: A rule for |HTML.html| is added.
2224
2225 * HTML.pod: New documentation.
2226
2227 2007-05-01 Wakaba <wakaba@suika.fam.cx>
2228
2229 * NanoDOM.pm (last_child, previous_sibling): New attributes.
2230 (clone_node): Attribute nodes were not completely copied.
2231
2232 * HTML.pm.src: Many bugs are fixed.
2233
2234 2007-04-30 Wakaba <wakaba@suika.fam.cx>
2235
2236 * HTML.pm.src: Some typos are fixed.
2237
2238 2007-04-30 Wakaba <wakaba@suika.fam.cx>
2239
2240 * mkhtmlparser.pl, Makefile: References to the |HTML-consume-entity.src|
2241 are removed.
2242
2243 * HTML.pm.src: Tokenizer's handling on named entities are rewritten.
2244
2245 * HTML-consume-entity.src: Removed.
2246
2247 2007-04-30 Wakaba <wakaba@suika.fam.cx>
2248
2249 * HTML.pm.src: Tokenizer's handling on hexadecimal
2250 numeric entities are rewritten.
2251
2252 2007-04-30 Wakaba <wakaba@suika.fam.cx>
2253
2254 * HTML.pm.src: Some tokenizer bugs are fixed.
2255
2256 2007-04-30 Wakaba <wakaba@suika.fam.cx>
2257
2258 * HTML.pm.src: The tree construction stage is implemented.
2259
2260 * mkhtmlparser.pl: New macros are added.
2261
2262 2007-04-28 Wakaba <wakaba@suika.fam.cx>
2263
2264 * ContentType.pm: A note on bug in the specification
2265 is removed since it's been now fixed.
2266
2267 * .cvsignore: New file.
2268
2269 2007-04-28 Wakaba <wakaba@suika.fam.cx>
2270
2271 * HTML.pm.src, HTML-consume-entity.src: New files.
2272
2273 * Makefile (HTML.pm): New rule.
2274
2275 * mkhtmlparser.pl: New script.
2276
2277 2007-04-25 Wakaba <wakaba@suika.fam.cx>
2278
2279 * Makefile: New file.
2280
2281 2007-04-24 Wakaba <wakaba@suika.fam.cx>
2282
2283 * ContentType.pm: An error in pod is fixed.
2284
2285 2007-04-24 Wakaba <wakaba@suika.fam.cx>
2286
2287 * ContentType.pm: Documentation is added.
2288
2289 2007-04-24 Wakaba <wakaba@suika.fam.cx>
2290
2291 * ContentType.pm: New Perl module.
2292
2293 * ChangeLog: New file.
2294
2295

admin@suikawiki.org
ViewVC Help
Powered by ViewVC 1.1.24