/[suikacvs]/markup/html/whatpm/Whatpm/ChangeLog
Suika

Contents of /markup/html/whatpm/Whatpm/ChangeLog

Parent Directory Parent Directory | Revision Log Revision Log


Revision 1.306 - (show annotations) (download)
Sun Sep 14 09:05:54 2008 UTC (16 years, 10 months ago) by wakaba
Branch: MAIN
Changes since 1.305: +9 -0 lines
++ whatpm/Whatpm/ChangeLog	14 Sep 2008 09:05:44 -0000
	* HTML.pm.src: White-space-leaded non-white-space character
	tokens in "before head insertion mode" was not
	correctly handled.
	(set_inner_html): Reimplemented using CharString decodehandle
	class.  Support for $get_wrapper argument.  Support
	for |{read_until}| feature.

2008-09-14  Wakaba  <wakaba@suika.fam.cx>

1 2008-09-14 Wakaba <wakaba@suika.fam.cx>
2
3 * HTML.pm.src: White-space-leaded non-white-space character
4 tokens in "before head insertion mode" was not
5 correctly handled.
6 (set_inner_html): Reimplemented using CharString decodehandle
7 class. Support for $get_wrapper argument. Support
8 for |{read_until}| feature.
9
10 2008-09-14 Wakaba <wakaba@suika.fam.cx>
11
12 * HTML.pm.src: Make a "bare ero" error for unknown
13 entities point the "&" character.
14
15 2008-09-14 Wakaba <wakaba@suika.fam.cx>
16
17 * HTML.pm.src: It turns out that U+FFFD don't have to
18 be added to the list of excluded characters.
19
20 2008-09-14 Wakaba <wakaba@suika.fam.cx>
21
22 * HTML.pm.src ($char_onerror): Have character decoder's |line|
23 and |column| a higher priority than the one set by the
24 tokenizer's input handler.
25 ($self->{read_until}): Exclude U+FFFD (but this might
26 not be necessary, since now we do line/column fixup in
27 the character decode handle).
28
29 2008-09-14 Wakaba <wakaba@suika.fam.cx>
30
31 * HTML.pm.src: Use |{read_until}| where possible.
32
33 2008-09-14 Wakaba <wakaba@suika.fam.cx>
34
35 * HTML.pm.src: Change |{getc_until}| to |{read_until}|
36 and |manakai_getc_until| to |manakai_read_until| to
37 reduce the number of string copies.
38
39 2008-09-14 Wakaba <wakaba@suika.fam.cx>
40
41 * HTML.pm.src (parse_char_string): Use newly created
42 |Whatpm::Charset::DecodeHandle::CharString| instead of Perl's
43 standard feature to |open| a string as a filehandle,
44 since Perl's string filehandle seems not supporting |ungetc|
45 method correctly.
46 (parse_char_stream): Define |{getc_until}| method.
47 (DATA_STATE): Experimental support for |getc_until| feature.
48
49 2008-09-13 Wakaba <wakaba@suika.fam.cx>
50
51 * HTML.pm.src: Check points added to newly added branches.
52
53 2008-09-13 Wakaba <wakaba@suika.fam.cx>
54
55 * HTML.pm.src: Remove |{char}|, which is no longer used.
56 Remove |{entity_in_attr}| and |{last_attribute_value_state}|
57 and replaced by |{prev_state}|.
58
59 * mkhtmlparser.pl: Remove |{char}| feature.
60 Remove |!!!back-next-input-character;| macro.
61
62 2008-09-13 Wakaba <wakaba@suika.fam.cx>
63
64 * HTML.pm.src: Finally we get rid of all the inner loops. Remove
65 entity related tokenizer states in favor of new states
66 implementing the consume character reference algorithm.
67
68 2008-09-13 Wakaba <wakaba@suika.fam.cx>
69
70 * HTML.pm.src: "Consume a character reference" algorithm is
71 now implemented as a tokenizer's state, rather than
72 a method, with minimum changes (more changes will
73 be made, in due course). "Bogus comment state"'s inner
74 loop gets removed.
75
76 2008-09-13 Wakaba <wakaba@suika.fam.cx>
77
78 * HTML.pm.src: Make |PUBLIC| and |SYSTEM| keyword tokenizing
79 into their own tokenizer states.
80
81 2008-09-13 Wakaba <wakaba@suika.fam.cx>
82
83 * HTML.pm.src: |CDATA_SECTION_STATE| (formally |CDATA_BLOCK_STATE|
84 is split into three states.
85
86 2008-09-13 Wakaba <wakaba@suika.fam.cx>
87
88 * HTML.pm.src: |CLOSE_TAG_OPEN_STATE| is broken into
89 itself and new |CDATA_PCDATA_CLOSE_TAG_STATE| so that
90 no longer does the tokenizer have to push back next input
91 characters in those states.
92
93 2008-09-13 Wakaba <wakaba@suika.fam.cx>
94
95 * HTML.pm.src: |MARKUP_DECLARATION_OPEN_STATE| broken
96 into four states so that no longer does the tokenizer have to push
97 back next input characters in that state.
98
99 2008-09-11 Wakaba <wakaba@suika.fam.cx>
100
101 * HTML.pm.src: Methods now accept additional parameter, $get_wrapper,
102 which can be used to insert some wrapper between the character
103 stream handle and the tokenizer. (It is currently not supported
104 for |set_inner_html| for |Element|s).
105
106 2008-09-10 Wakaba <wakaba@suika.fam.cx>
107
108 * HTML.pm.src: Ignore punctuations in charset names.
109
110 2008-09-10 Wakaba <wakaba@suika.fam.cx>
111
112 * ContentChecker.pm: Support for charset-layer error levels.
113
114 * HTML.pm.src: Don't specify |text| argument for the
115 |chardecode:fallback| error, since it is not the encoding
116 being used alternatively.
117
118 2008-09-06 Wakaba <wakaba@suika.fam.cx>
119
120 * HTML.pm.src: Support for |XSLT-compat| (HTML5 revision 2141).
121
122 2008-08-31 Wakaba <wakaba@suika.fam.cx>
123
124 * CacheManifest.pm: Support for extensibility (HTML5 revision 2051).
125
126 2008-08-31 Wakaba <wakaba@suika.fam.cx>
127
128 * HTML.pm.src: Bug fix and sync with the spec with regard
129 to after after frameset insertion mode processing (HTML5
130 revision 1909). Note that the implementation was wrong
131 per the old spec before the r1909 changes.
132
133 2008-08-30 Wakaba <wakaba@suika.fam.cx>
134
135 * HTMLTable.pm: scope=auto algorithm fix synced with the
136 spec (HTML5 revision 2093).
137 ($process_row): Algorithm step numbers synced with the
138 spec (HTML5 revision 2092).
139
140 2008-08-30 Wakaba <wakaba@suika.fam.cx>
141
142 * HTMLTable.pm: Zs is not what we want; we want White_Space! (HTML5
143 revision 2094).
144
145 2008-08-30 Wakaba <wakaba@suika.fam.cx>
146
147 * ContentType.pm: Support for image/svg+xml (HTML5 revision 2096).
148
149 2008-08-30 Wakaba <wakaba@suika.fam.cx>
150
151 * HTML.pm.src: '"' and "'" at the end of attribute
152 name (after another attribute) now raise parse error (HTML5
153 revision 2123). Empty unquoted attribute values are no
154 longer allowed (HTML5 revision 2122).
155
156 2008-08-30 Wakaba <wakaba@suika.fam.cx>
157
158 * mkhtmlparser.pl: Support for MathML |definitionURL| attribute (HTML5
159 revision 2130).
160
161 2008-08-30 Wakaba <wakaba@suika.fam.cx>
162
163 * ContentChecker.pm: |xml:lang| attribute value must be same
164 as |lang| attribute value for HTML elements (HTML5 revision 2062
165 and so on).
166
167 2008-08-30 Wakaba <wakaba@suika.fam.cx>
168
169 * ContentChecker.pm: Error level definition for |xml_id_error|
170 was missing.
171
172 * URIChecker.pm: The end of the URL should be marked as the
173 error location for an empty path error. The position
174 between the userinfo and the port components should be
175 marked as the error location for an empty host error.
176
177 2008-08-30 Wakaba <wakaba@suika.fam.cx>
178
179 * URIChecker.pm: Set parameters representing where in the
180 value the error occurs for errors. Report unknown
181 address format error in warning level, since address
182 formats are rarely added. Path segments starting with "/.."
183 were misinterpreted as a dot-segment.
184
185 2008-08-30 Wakaba <wakaba@suika.fam.cx>
186
187 * URIChecker.pm (check_iri_reference): Requires
188 |Message::DOM::DOMImplementation|.
189
190 2008-08-29 Wakaba <wakaba@suika.fam.cx>
191
192 * IMTChecker.pm: Updated for the new error reporting architecture.
193
194 * ContentChecker.pm: Error levels for IMTs are added.
195
196 2008-08-17 Wakaba <wakaba@suika.fam.cx>
197
198 * H2H.pm (_shift_token): Support for unquoted HTML attribute
199 values.
200
201 2008-08-16 Wakaba <wakaba@suika.fam.cx>
202
203 * CacheManifest.pm: Support for new style of error
204 reports.
205
206 * HTML.pm.src: Set line=1, column=1 to the document node.
207
208 2008-08-16 Wakaba <wakaba@suika.fam.cx>
209
210 * ContentChecker.pm, RDFXML.pm: Pass {level} object to language tag
211 and URL checkers. Support for more error levels for bogus
212 langauge tag and URL "standards".
213
214 * LangTag.pm, URIChecker.pm: Support for new style error
215 level reporting.
216
217 2008-08-15 Wakaba <wakaba@suika.fam.cx>
218
219 * ContentChecker.pm: Support for RDF/XML error levels.
220
221 * HTMLTable.pm, RDFXML.pm: Support for new style of error level
222 specifying. Error types are revised.
223
224 2008-08-15 Wakaba <wakaba@suika.fam.cx>
225
226 * ContentChecker.pm: All error reporting method calls are
227 renewed.
228
229 2008-08-15 Wakaba <wakaba@suika.fam.cx>
230
231 * HTML.pm.src: All error type names and "text" parameters
232 are revised. Use new style for "level" specification.
233
234 * mkhtmlparser.pl: Use new style for "level" specification.
235
236 2008-08-03 Wakaba <wakaba@suika.fam.cx>
237
238 * WebIDL.pm (parse_char_string): Simplified error
239 reporting process for broken ignored valuetype definition.
240 (Valuetype idl_text): Support for special "DOMString" name.
241
242 2008-08-03 Wakaba <wakaba@suika.fam.cx>
243
244 * WebIDL.pm ($get_scoped_name): Append "::::" if the last
245 terminal of the ScopedName is "DOMString", such that whether
246 the last part of the scoped name is "DOMString" or "_DOMString"
247 later. It is necessary to determine whether a |typedef|
248 definition should be ignored or not.
249 (parse_char_string): Unescape the identifier of
250 exception members.
251 ($resolve): Return undef for builtin types and sequence<T>
252 types (we might not have to do this, however...).
253 (check): Support checking for Exceptions, Valuetypes,
254 and Typedefs.
255 ($serialize_type): Support for "DOMString::::" syntax.
256 (Typedef idl_text): Output Type as "DOMString" if it
257 is really "DOMString" (i.e. its internal representation
258 is "::DOMString::").
259
260 2008-08-03 Wakaba <wakaba@suika.fam.cx>
261
262 * WebIDL.pm ($resolve): New code, based on resolve code
263 for constant types in the |check| method.
264 (check): Support for checking of attributes, operations, and
265 arguments.
266 (Attribute/Operation idl_text): Exception names in getraises,
267 setraises, and raises clauses is serizlied by |$serialize_type|
268 code.
269
270 2008-08-02 Wakaba <wakaba@suika.fam.cx>
271
272 * WebIDL.pm ($integer): Order of selections are changed to match
273 hexadecimal numbers (the original pattern, taken from the spec,
274 was not work for hexadecimal numbers, because the "0" prefix
275 matches to the [0-7]* part (as an empty string) and therefore
276 it does not match with remaining "x..." part of a "0x..." integer
277 literal.
278 ($get_type): It now returns a string, not an array reference,
279 for regular types and |sequence| types (i.e. it in any case
280 returns a string).
281 ($get_next_token): The second item in the array that represents
282 a integer or float token is now a Perl number value, not the
283 original string representation of the number.
284 (check): Support for const value consistency checking.
285 No extended attribute is defined for constants.
286 (Node subclasses): Use simple strings rather than array references
287 for default data type values.
288 ($serialize_type): Type values are now simple strings.
289 (value): If the new attribute value is a false value, then
290 a FALSE value is set to the attribute.
291
292 2008-08-02 Wakaba <wakaba@suika.fam.cx>
293
294 * WebIDL.pm ($get_scoped_name): Now scoped names are stored
295 in its stringified format ("scoped name" as defined in the
296 spec). Note that future version of this module should not use
297 array references for type values and the |type_text| attribute
298 should be made obsolete.
299 (parse_char_string): Unescape attribute names.
300 (check): Support for checking of whether inherited interfaces
301 are actually defined or not. Support for checking of whether
302 interface member identifiers are duplicated or not.
303 ($serialize_type): Scoped names are returned as is. A future
304 version of this code should escape identifiers other than "DOMString",
305 otherwise the idl_text would be non-conforming.
306
307 2008-08-02 Wakaba <wakaba@suika.fam.cx>
308
309 * WebIDL.pm (parse_char_string): Set line/column numbers
310 to generated nodes. Unescape identifiers. Extended attributes
311 for Definition's were ignored.
312 (append_child): Set |parent_node| attribute.
313 (parent_node): New attribute.
314 (check): Support interface/exception members. Support
315 extended attributes. Support definition identifier uniqueness
316 constraint.
317 (qualified_name): New attribute.
318 (Interface/Exception idl_text): Extended attributes were
319 not prepended to the returned text.
320
321 2008-08-02 Wakaba <wakaba@suika.fam.cx>
322
323 * WebIDL.pm (parse_char_string): Set line/column numbers
324 to interface object experimentally. s/shift/pop/g, shift
325 would make things wrong. Support for interface forward
326 declarations was missing. Broken interface declarations
327 with no block were not ignored entirely.
328 (Whatpm::WebIDL::Node): New abstract class. This class
329 makes things easier.
330 (child_nodes): New attribute. Unlike DOM's attribute with
331 same name, this attribute returns a dead list of nodes for
332 simplicity.
333 (get_user_data, set_user_data): New methods.
334 (Module idl_text): A SPACE character should be inserted
335 before the |{| character.
336 (Interface idl_text): Support for interface forward declarations.
337 (is_forward_declaration): New attribute.
338
339 2008-07-19 Wakaba <wakaba@suika.fam.cx>
340
341 * WebIDL.pm (type_text): Better serializer.
342
343 2008-07-19 Wakaba <wakaba@suika.fam.cx>
344
345 * WebIDL.pm: Revise forward-compatible parsing so that
346 it now can handle broken extended attributes and as such.
347
348 2008-07-19 Wakaba <wakaba@suika.fam.cx>
349
350 * WebIDL.pm: Real support for extended attributes.
351 Support for extended attributes with arguments.
352
353 2008-07-19 Wakaba <wakaba@suika.fam.cx>
354
355 * WebIDL.pm: Support for |exception| syntax.
356 (Interface->idl_text): Tentative support for inheritances.
357
358 2008-07-19 Wakaba <wakaba@suika.fam.cx>
359
360 * WebIDL.pm: Hierarchical scoped name support was broken.
361 Support for raises, setraises, and getraises syntaxes.
362
363 2008-07-18 Wakaba <wakaba@suika.fam.cx>
364
365 * WebIDL.pm: Support for |idl_text| attribute, version 1 (no
366 proper support for types, extended attributes, and exceptions yet).
367 WebIDL parser, version 1 (no support for exceptions yet,
368 no proper support for extended attributes yet).
369
370 2008-07-09 Wakaba <wakaba@suika.fam.cx>
371
372 * WebIDL.pm (parse_char_string): Support for basic attribute syntax.
373
374 2008-06-29 Wakaba <wakaba@suika.fam.cx>
375
376 * WebIDL.pm: Support for valuetype and const.
377
378 2008-06-29 Wakaba <wakaba@suika.fam.cx>
379
380 * WebIDL.pm: New module.
381
382 2008-06-15 Wakaba <wakaba@suika.fam.cx>
383
384 * Makefile (Entities.html): URI changed.
385
386 2008-06-08 Wakaba <wakaba@suika.fam.cx>
387
388 * HTML.pm.src: Support for ruby parsing (HTML5 revision 1704).
389
390 2008-06-01 Wakaba <wakaba@suika.fam.cx>
391
392 * HTML.pm.src (_get_next_token): A parse error was missing.
393
394 2008-06-01 Wakaba <wakaba@suika.fam.cx>
395
396 * mklinktypelist.pl: rel=contact is no longer part of the HTML5
397 spec (commented out). (HTML5 revision 1711).
398
399 2008-05-25 Wakaba <wakaba@suika.fam.cx>
400
401 * ContentType.pm: Drop support for UTF-32 (HTML5 revision 1701).
402
403 * HTML.pm.src: UTF-16BE and UTF-16LE should be considered
404 as UTF-16 (HTML5 revision 1701).
405
406 2008-05-25 Wakaba <wakaba@suika.fam.cx>
407
408 * HTML.pm.src: Support for <noframes> in <head> (HTML5 revision
409 1692).
410
411 2008-05-25 Wakaba <wakaba@suika.fam.cx>
412
413 * HTML.pm.src: The secondary insertion mode used when switching
414 to foreign content is the "in body" insertion mode (HTML5 revision
415 1696).
416
417 2008-05-25 Wakaba <wakaba@suika.fam.cx>
418
419 * HTML.pm.src: Don't raise parse error for <isindex/> (HTML5
420 revision 1697).
421
422 2008-05-24 Wakaba <wakaba@suika.fam.cx>
423
424 * HTML.pm.src: Support for end-of-file token in foreign content
425 insertion mode (HTML5 revision 1693). Update SVG camelCase
426 attribute list (HTML5 revision 1700). <textarea> closes
427 </select> (HTML5 revision 1699). More start tags close in
428 foreign content insertion mode (HTML5 revision 1698).
429
430 2008-05-24 Wakaba <wakaba@suika.fam.cx>
431
432 * HTML.pm.src: ";" is not part of charset name (HTML5 revision 1665).
433
434 2008-05-24 Wakaba <wakaba@suika.fam.cx>
435
436 * HTML.pm.src: More robust charset parameter detection (HTML5
437 revision 1674).
438
439 2008-05-24 Wakaba <wakaba@suika.fam.cx>
440
441 * ContentType.pm: Support for image/vnd.microsoft.icon (HTML5
442 revision 1676).
443
444 2008-05-24 Wakaba <wakaba@suika.fam.cx>
445
446 * HTML.pm.src: Ignore language part of public identifiers for
447 quriks mode detection (HTML5 revision 1679).
448
449 2008-05-24 Wakaba <wakaba@suika.fam.cx>
450
451 * HTML.pm.src: Reduce the number of errors in truncated doctypes (HTML5
452 revision 1685).
453
454 2008-05-24 Wakaba <wakaba@suika.fam.cx>
455
456 * HTML.pm.src: Support for EOF in new states for tags (HTML5
457 revision 1684).
458
459 2008-05-24 Wakaba <wakaba@suika.fam.cx>
460
461 * HTML.pm.src (_reset_insertion_mode): Make <td>.innerHTML
462 work (HTML5 revision 1690).
463
464 2008-05-24 Wakaba <wakaba@suika.fam.cx>
465
466 * HTML.pm.src (_tree_construction_main): Change handling of
467 end tags in head insertion modes (HTML5 revision 1686).
468 (parse_char_string): Bug fix for non-utf8 character string handlings.
469 (parse_char_stream): |ungetc| does not work well for this context.
470
471 2008-05-18 Wakaba <wakaba@suika.fam.cx>
472
473 * HTML.pm.src (parse_byte_string): Redefined to invoke
474 |parse_byte_stream|.
475 (parse_byte_stream): New method.
476
477 2008-05-18 Wakaba <wakaba@suika.fam.cx>
478
479 * HTML.pm.src (parse_byte_string): Fix the column number reported
480 by encoding layer error reporter.
481
482 2008-05-17 Wakaba <wakaba@suika.fam.cx>
483
484 * HTML.pm.src (parse_byte_string): Use streaming decoder
485 rather than converting the whole byte string and then parsing.
486 Propagate errors in character encoding layer.
487 (get_next_token): Precise error reporting for |bare stago| error.
488
489 2008-05-17 Wakaba <wakaba@suika.fam.cx>
490
491 * HTML.pm.src (parse_char_stream): New method.
492 (parse_char_string): This method is now defined as an invocation
493 of the |parse_char_stream| method.
494
495 2008-05-17 Wakaba <wakaba@suika.fam.cx>
496
497 * HTML.pm.src (parse_byte_string): Report various status
498 of the sniffing as info-level errors. Support for new
499 decoding framework in parser resestting.
500 (new): Various default error levels were not set.
501
502 2008-05-17 Wakaba <wakaba@suika.fam.cx>
503
504 * HTML.pm.src (parse_byte_string): HTML5 encoding siniffing
505 algorithm, except for the actual sniffing, is implemented
506 with new framework with Message::Charset::Info.
507
508 2008-05-16 Wakaba <wakaba@suika.fam.cx>
509
510 * CacheManifest.pm (_parse): Drop fragment identifiers from
511 URIs in fallback section (HTML5 revision 1596).
512
513 2008-05-10 Wakaba <wakaba@suika.fam.cx>
514
515 * Makefile (Entities.html): URI has changed.
516
517 2008-05-10 Wakaba <wakaba@suika.fam.cx>
518
519 * CacheManifest.pm: Don't replace U+0000 NULL (HTML5 revision
520 1553).
521
522 2008-05-06 Wakaba <wakaba@suika.fam.cx>
523
524 * ContentChecker.pm: Noted that those returned in |table| are
525 no longer table elements, but table objects returned
526 by Whatpm::HTMLTable.
527
528 * HTMLTable.pm (form_table): Return table element node
529 as |$table->{element}|.
530 (assign_header): Support for the |headers=""| attribute.
531
532 2008-05-06 Wakaba <wakaba@suika.fam.cx>
533
534 * HTMLTable.pm (assign_header): New function; first version
535 with no support for headers="".
536 (form_table): Include table width and height to the returned
537 table object for covenience. Indexing in column assignement was wrong.
538 Set whether a data cell is empty or not for convenience.
539
540 2008-05-05 Wakaba <wakaba@suika.fam.cx>
541
542 * HTMLTable.pm: Robuster caption support (HTML5 revision 1393).
543
544 2008-05-05 Wakaba <wakaba@suika.fam.cx>
545
546 * HTMLTable.pm: How table model errors are detected is
547 changed (HTML5 revision 1387).
548
549 2008-05-05 Wakaba <wakaba@suika.fam.cx>
550
551 * HTMLTable.pm: The algorithm now moves |tfoot| elements
552 to the end of the table (HTML5 revision 1380).
553
554 2008-05-05 Wakaba <wakaba@suika.fam.cx>
555
556 * HTMLTable.pm: The algorithm is now 0-based indexing, instead
557 of 1-based (HTML5 revision 1376).
558
559 2008-05-05 Wakaba <wakaba@suika.fam.cx>
560
561 * ContentType.pm: "Content-Type: text/plain; charset=UTF-8"
562 and "Content-Encoding" no longer prevent sniffing (HTML5
563 revision 1288).
564
565 2008-05-05 Wakaba <wakaba@suika.fam.cx>
566
567 * ContentType.pm: Skip BOMs n feed or HTML algorithm (HTML5 revision
568 1282).
569
570 2008-05-03 Wakaba <wakaba@suika.fam.cx>
571
572 * ContentChecker.pm: Support for global attributes.
573 Status of XML specs are added.
574
575 2008-05-03 Wakaba <wakaba@suika.fam.cx>
576
577 * ContentChecker.pm (check_element): Support for |template=""|
578 and |ref=""| attribute (referent element type checking).
579
580 2008-04-29 Wakaba <wakaba@suika.fam.cx>
581
582 * CacheManifest.pm (_parse): New same origin definition (HTML5 revision
583 1500) is implemented (except for IDNA part and URI-scheme-specific
584 knowledge). Line number counting was wrong for LF-only
585 documents.
586
587 2008-04-13 Wakaba <wakaba@suika.fam.cx>
588
589 * HTML.pm.src: Raise an parse error for any disallowed
590 character (HTML5 revision 1263).
591
592 2008-04-13 Wakaba <wakaba@suika.fam.cx>
593
594 * mkentitylist.pl: Support for new HTML5 entity table format (the
595 definition for |AElig;| was missing).
596
597 2008-04-13 Wakaba <wakaba@suika.fam.cx>
598
599 * HTML.pm.src, mkhtmlparser.pl: Support for element/attribute
600 name/namespace fixup (HTML5 revisions 1413, 1415, 1416, and 1417).
601
602 2008-04-13 Wakaba <wakaba@suika.fam.cx>
603
604 * HTML.pm.src: List of element names that close foreign content
605 insertion mode is added (HTML5 revisions 1412 and 1418).
606
607 2008-04-13 Wakaba <wakaba@suika.fam.cx>
608
609 * HTML.pm.src: Support for |mglyph| and |malignmark|
610 elements (HTML5 revision 1410).
611
612 2008-04-13 Wakaba <wakaba@suika.fam.cx>
613
614 * HTML.pm.src: Support for new long MathML entities (HTML5
615 revision 1406).
616
617 2008-04-13 Wakaba <wakaba@suika.fam.cx>
618
619 * HTML.pm.src: CDATA section support for MathML and SVG
620 elements (HTML5 revision 1404 and 1420).
621
622 2008-04-12 Wakaba <wakaba@suika.fam.cx>
623
624 * HTML.pm.src, mkhtmlparser.pl: Support for MathML and SVG
625 elements (HTML5 revision 1404). Unused !!!macro definitions
626 are removed.
627
628 2008-04-12 Wakaba <wakaba@suika.fam.cx>
629
630 * HTML.pm.src, mkhtmlparser.pl: The way permitted slash errors
631 are raised is changed (HTML5 revision 1404).
632
633 2008-04-06 Wakaba <wakaba@suika.fam.cx>
634
635 * HTML.pm.src: Category sets that are no longer used are removed.
636
637 2008-04-06 Wakaba <wakaba@suika.fam.cx>
638
639 * HTML.pm.src: The ->[1] property of stack entries are now
640 replaced by constants representing element category.
641
642 2008-04-06 Wakaba <wakaba@suika.fam.cx>
643
644 * HTML.pm.src: Don't use local name stored in stack (i.e. ->[1])
645 for error reporting. (This is a preparation for using constant
646 value for ->[1].)
647
648 2008-03-22 Wakaba <wakaba@suika.fam.cx>
649
650 * RDFXML.pm: Typo fixed.
651
652 2008-03-22 Wakaba <wakaba@suika.fam.cx>
653
654 * ContentChecker.pm: |fact_level| is now treated
655 as same as |must_level|, i.e. level = |m|.
656 (check_element): Make list of URIs in the DOM.
657
658 2008-03-21 Wakaba <wakaba@suika.fam.cx>
659
660 * RDFXML.pm: Language accessor implemented.
661 Local (null-namespace) attribute support.
662
663 2008-03-21 Wakaba <wakaba@suika.fam.cx>
664
665 * RDFXML.pm: Factored out ID checking code.
666
667 2008-03-21 Wakaba <wakaba@suika.fam.cx>
668
669 * RDFXML.pm: TODO items noted. Validation of ID
670 and URI attributes is implemented. Warn if unknown
671 value is used in rdf:parseType="" attribute.
672
673 * URIChecker.pm (check_rdf_uri_reference): New function.
674
675 2008-03-21 Wakaba <wakaba@suika.fam.cx>
676
677 * RDFXML.pm: bnodeid implemented. Relative references
678 are now resolved.
679
680 2008-03-21 Wakaba <wakaba@suika.fam.cx>
681
682 * ContentChecker.pm: RDF reification implemented.
683
684 * RDFXML.pm: undef vs false bug fixed.
685 Reification implemented.
686
687 2008-03-21 Wakaba <wakaba@suika.fam.cx>
688
689 * RDFXML.pm: s/id/ID/ for attribute name.
690 The |node| arguments are added for |ontriple| calls.
691 Too many "attribute not allowed" errors were raised.
692
693 * ContentChecker.pm: Initial experimental support for rdf:RDF
694 element.
695
696 2008-03-21 Wakaba <wakaba@suika.fam.cx>
697
698 * RDFXML.pm: New module.
699
700 2008-03-20 Wakaba <wakaba@suika.fam.cx>
701
702 * HTML.pm.src (set_inner_html): Line/column number
703 code was old one yet.
704
705 2008-03-20 Wakaba <wakaba@suika.fam.cx>
706
707 * HTML.pm.src: Better line/column reporting for "duplicate attribute"
708 errors. Line/column markings for DOCTYPE, comment, and
709 character tokens are reintroduced; otherwise,
710 error location for "not HTML5" error and errors
711 for implied elements are not attached.
712
713 2008-03-20 Wakaba <wakaba@suika.fam.cx>
714
715 * HTML.pm.src: Set line/column numbers to attributes.
716
717 * NanoDOM.pm (create_attribute_ns, set_attribute_node_ns): Added.
718 (value): Setter implemented.
719
720 * mkhtmlparser.pl: Set line/column numbers to Attr nodes.
721
722 2008-03-20 Wakaba <wakaba@suika.fam.cx>
723
724 * HTML.pm.src: Unused line/column markings are removed.
725
726 2008-03-20 Wakaba <wakaba@suika.fam.cx>
727
728 * HTML.pm.src (_get_next_token): Remove |first_start_tag|
729 flag, which is no longer used.
730
731 2008-03-17 Wakaba <wakaba@suika.fam.cx>
732
733 * HTML.pm.src: Set line/column information to element nodes.
734
735 * mkhtmlparser.pl (!!!create-element, !!!insert-element,
736 and !!!insert-element-t): Set line/column information to
737 element nodes.
738
739 2008-03-17 Wakaba <wakaba@suika.fam.cx>
740
741 * HTML.pm.src (_get_next_token): The first "<" character
742 in "<?", "<>", or "</>" should be the error point.
743
744 2008-03-16 Wakaba <wakaba@suika.fam.cx>
745
746 * HTML.pm.src: Some more fixes on error position reporting.
747
748 2008-03-16 Wakaba <wakaba@suika.fam.cx>
749
750 * HTML.pm.src: Token-level precious error reporting.
751
752 2008-03-16 Wakaba <wakaba@suika.fam.cx>
753
754 * HTML.pm.src: Preparation for more precious error point
755 reporting.
756
757 2008-03-11 Wakaba <wakaba@suika.fam.cx>
758
759 * HTML.pm.src: Error type revised.
760
761 2008-03-11 Wakaba <wakaba@suika.fam.cx>
762
763 * HTML.pm.src: Similar codes are merged together, again.
764
765 2008-03-11 Wakaba <wakaba@suika.fam.cx>
766
767 * HTML.pm.src: Similar codes are merged together.
768
769 2008-03-10 Wakaba <wakaba@suika.fam.cx>
770
771 * mkhtmlparser.pl: Set "level" parameter to parse errors.
772
773 * HTML.pm.src: Code refined.
774
775 2008-03-09 Wakaba <wakaba@suika.fam.cx>
776
777 * HTML.pm.src: |</body>| treatement has been changed (HTML5
778 revision 1348). Note that I really don't know this makes
779 any difference in the black-box behavior of the parser.
780
781 2008-03-09 Wakaba <wakaba@suika.fam.cx>
782
783 * HTML.pm.src: New end-of-file token implementation (HTML5
784 revision 1348).
785
786 2008-03-09 Wakaba <wakaba@suika.fam.cx>
787
788 * HTML.pm.src: |applet| support (HTML5 revision 1347).
789
790 2008-03-09 Wakaba <wakaba@suika.fam.cx>
791
792 * HTML.pm.src: Foster parenting in AAA (HTML5 revision 1343).
793
794 2008-03-09 Wakaba <wakaba@suika.fam.cx>
795
796 * HTML.pm.src: Support for |<input>| in the "in select" insertion
797 mode and support for the "in select in table" insertion mode (HTML5
798 revision 1342).
799
800 2008-03-09 Wakaba <wakaba@suika.fam.cx>
801
802 * HTML.pm.src: No foster parenting for <script> and <script>
803 in non-tainted <table>s (HTML5 revision 1336).
804
805 2008-03-09 Wakaba <wakaba@suika.fam.cx>
806
807 * HTML.pm.src: Ignore white space characters between <html>
808 and <head> (HTML5 revision 1332).
809
810 2008-03-09 Wakaba <wakaba@suika.fam.cx>
811
812 * HTML.pm.src: Treat <input type=hidden> as if it were a
813 white space (HTML5 revision 1331).
814
815 2008-03-08 Wakaba <wakaba@suika.fam.cx>
816
817 * HTML.pm.src: Ignore U+000A at the beginning of a |listing|
818 element (HTML5 revision 1330).
819
820 2008-03-08 Wakaba <wakaba@suika.fam.cx>
821
822 * HTML.pm.src: <title> is always appended to the current
823 element (HTML5 revision 1328).
824
825 2008-03-08 Wakaba <wakaba@suika.fam.cx>
826
827 * HTML.pm.src: White space in tainted tables are moved
828 into foster parents (HTML5 revision 1326).
829
830 2008-03-08 Wakaba <wakaba@suika.fam.cx>
831
832 * HTML.pm.src: Reduce errors from foster parenting cases (HTML5
833 revision 1321).
834
835 2008-03-08 Wakaba <wakaba@suika.fam.cx>
836
837 * HTML.pm.src: |</h/n/>| case code rearranged to align with
838 the spec (HTML5 revision 1320). Note that we finally complete
839 all of HTML5 revision 1320 changes.
840
841 2008-03-08 Wakaba <wakaba@suika.fam.cx>
842
843 * HTML.pm.src: |</form>| now works similar to |</div>| for unclosed
844 tags (HTML5 revision 1320).
845
846 2008-03-08 Wakaba <wakaba@suika.fam.cx>
847
848 * HTML.pm.src: |</p>| case rearranged with no actual change in fact.
849
850 2008-03-08 Wakaba <wakaba@suika.fam.cx>
851
852 * HTML.pm.src: A "generate implied end tags" code (t409.1)
853 could not be reached so that it is now removed (HTML5 revision
854 1320).
855
856 2008-03-08 Wakaba <wakaba@suika.fam.cx>
857
858 * HTML.pm.src: Code for the case of |</div>| and so on are revised
859 to align with new spec text (HTML5 revision 1320).
860
861 2008-03-08 Wakaba <wakaba@suika.fam.cx>
862
863 * HTML.pm.src: Remove strange |if| condition; however, it should
864 have had no harm in theory.
865
866 2008-03-08 Wakaba <wakaba@suika.fam.cx>
867
868 * HTML.pm.src (_tree_construction_main): '</p>' in body
869 case is split from other end tags for the preparation
870 of implementing HTML revision 1320.
871
872 2008-03-07 Wakaba <wakaba@suika.fam.cx>
873
874 * HTML.pm.src: Simplified "generate implied end tag" (HTML5
875 revision 1320).
876
877 2008-03-07 Wakaba <wakaba@suika.fam.cx>
878
879 * HTML.pm.src (_tree_construction_main): Merge rules for "h1"
880 and "div" (HTML5 revision 1318). Add comments to where
881 |form| pointer association codes should be inserted (HTML5 revision
882 1319).
883
884 2008-03-06 Wakaba <wakaba@suika.fam.cx>
885
886 * HTML.pm.src: <html> treatement refined (HTML5 revision 1314).
887
888 2008-03-05 Wakaba <wakaba@suika.fam.cx>
889
890 * HTML.pm.src: Since the case t268 should never be reached (no
891 other token type, there are), it is replaced by a |die| statement.
892
893 2008-03-05 Wakaba <wakaba@suika.fam.cx>
894
895 * HTML.pm.src: Typo fixed.
896
897 2008-03-04 Wakaba <wakaba@suika.fam.cx>
898
899 * HTML.pm.src (_tree_construction_initial): Some limited quirks
900 doctypes were not uppercased for comparison.
901
902 2008-03-03 Wakaba <wakaba@suika.fam.cx>
903
904 * HTML.pm.src (tree construction and set_inner_html): Checkpoints
905 are added.
906
907 2008-03-03 Wakaba <wakaba@suika.fam.cx>
908
909 * HTML.pm.src (_tokenize_attempt_to_consume_an_entity): Checkpoints
910 are set. Cases that are unlikely reached are noted as so.
911
912 2008-03-03 Wakaba <wakaba@suika.fam.cx>
913
914 * HTML.pm.src: Checkpoints for debugging are added.
915
916 * mkhtmlparser.pl: Support for |!!!cp| syntax.
917
918 2008-03-03 Wakaba <wakaba@suika.fam.cx>
919
920 * HTML.pm.src, mkhtmlparser.pl: s/_input_character/_char/g
921 for simplicity.
922
923 2008-03-03 Wakaba <wakaba@suika.fam.cx>
924
925 * HTML.pm.src: Flag name changed: s/correct/force-quirks/g (HTML5
926 revision 1307).
927
928 2008-03-03 Wakaba <wakaba@suika.fam.cx>
929
930 * HTML.pm.src (_get_next_token): Where the /incorrect/ flag
931 is set are changed (HTML5 revision 1305).
932
933 2008-03-02 Wakaba <wakaba@suika.fam.cx>
934
935 * HTML.pm.src: Raise a parse error for |<span ===>| (HTML5 revision
936 1292). Entities are not parsed in comment-like part in RCDATA
937 elements (HTML5 revision 1294). Allow bare & at the end
938 of attribute value literals (HTML5 revision 1296). More
939 quirks mode doctypes (HTML5 revision 1302). Requires spaces
940 between attributes and ban attribute names or unquoted
941 attribute values containing single or double quotes (HTML5
942 revision 1303).
943
944 2008-03-02 Wakaba <wakaba@suika.fam.cx>
945
946 * ContentChecker.pm: Typo fixed. Don't raise "character encoding"
947 and related errors unless it is an HTML document (though the spec
948 is unclear on whether it is applied to XHTML document).
949
950 * HTML.pm (%HTMLAttrStatus): WF2 repetition model attributes
951 are added.
952
953 2008-03-02 Wakaba <wakaba@suika.fam.cx>
954
955 * HTML.pm.src: s/local_name/manakai_local_name/g.
956
957 2008-03-01 Wakaba <wakaba@suika.fam.cx>
958
959 * _NamedEntityList.pm: Updated (HTML5 revision 1286).
960
961 * HTML.pm.src: |charset| in |content| attribute is
962 case-insensitive (HTML5 revision 1270).
963
964 2008-02-26 Wakaba <wakaba@suika.fam.cx>
965
966 * ContentChecker.pm: New status constants are added.
967 ($ElementDefault): |status| added.
968 (check_element): Err for non-standard or deprecated elements.
969 (_attr_status_info): For non-standard or deprecated attributes.
970
971 2008-02-24 Wakaba <wakaba@suika.fam.cx>
972
973 * ContentChecker.pm (_attr_status_info): New internal method.
974
975 2008-02-24 Wakaba <wakaba@suika.fam.cx>
976
977 * ContentChecker.pm (check_element): Element standardized
978 status information is now dispatched.
979
980 2008-02-24 Wakaba <wakaba@suika.fam.cx>
981
982 * ContentChecker.pm (check_element): Fix |del|-and-significant
983 problem by adding some more arguments.
984
985 2008-02-24 Wakaba <wakaba@suika.fam.cx>
986
987 * ContentChecker.pm (check_element): Use context of
988 container-for-the-purpose-of-content-model element (not transparent
989 element) for |check_child_element| calling and significant
990 text flag marking. This reintroduces |<del>|-and-significant
991 problem again.
992
993 2008-02-24 Wakaba <wakaba@suika.fam.cx>
994
995 * ContentChecker.pm (check_element): Make semi-transparent
996 elements ignored for the purpose of phase changes in
997 content model checking.
998
999 2008-02-23 Wakaba <wakaba@suika.fam.cx>
1000
1001 * ContentChecker.pm (check_element): In-element state
1002 was not properly managed for transparent cases.
1003
1004 2008-02-23 Wakaba <wakaba@suika.fam.cx>
1005
1006 * ContentChecker.pm (check_element): Support for |video|
1007 and |audio| as semi-transparent elements.
1008
1009 2008-02-23 Wakaba <wakaba@suika.fam.cx>
1010
1011 * ContentChecker.pm ($HTMLSemiTransparentElements): New.
1012 (check_element): s/minuses/minus_elements/, s/pluses/plus_elements/.
1013 Support for |html:object| as a semi-transparent element.
1014
1015 2008-02-23 Wakaba <wakaba@suika.fam.cx>
1016
1017 * ContentChecker.pm (check_element): The way to traverse
1018 the tree is entirely revised to make it easier to track
1019 the state of ancestors/descendants. As a result of this
1020 revision (which rewrites almost all of Whatpm::ContentChecker::HTML),
1021 support for content model checking for HTML elements |figure|,
1022 |object|, |video|, and |audio| and checking for XML elements (and
1023 some XMLNS checkings) are dropped for now. They will be
1024 reimplemented in due cource.
1025
1026 2008-02-17 Wakaba <wakaba@suika.fam.cx>
1027
1028 * HTML.pm.src: |>| in public or system literal closes the DOCTYPE
1029 token (HTML5 revision 1225).
1030
1031 2008-02-17 Wakaba <wakaba@suika.fam.cx>
1032
1033 * ContentChecker.pm ({unsupported_level}): New value.
1034
1035 * HTML.pm.src: Save whether |meta| |content| attribute
1036 contains character references or not.
1037
1038 2008-02-17 Wakaba <wakaba@suika.fam.cx>
1039
1040 * ContentChecker.pm (_get_children): (Incompleted) attempt to
1041 imlement significant content checking for contents
1042 with |del| elements.
1043
1044 2008-02-17 Wakaba <wakaba@suika.fam.cx>
1045
1046 * ContenteChecker.pm ($HTMLTransparentElements): More
1047 elements are added.
1048 (_get_children): HTML |object| elements are now semi-transparent.
1049
1050 * NanoDOM.pm (manakai_html, manakai_head): New methods.
1051
1052 2008-02-16 Wakaba <wakaba@suika.fam.cx>
1053
1054 * CacheManifest.pm: HTML5 revision 1211 implemented.
1055
1056 * CacheManifest.pod: Updated.
1057
1058 2008-02-10 Wakaba <wakaba@suika.fam.cx>
1059
1060 * ContentChecker.pm (check_document, check_element): Support
1061 for second argument ($onsubdoc).
1062 (_get_css_parser): Removed (now it is part of WDCC).
1063
1064 2008-02-09 Wakaba <wakaba@suika.fam.cx>
1065
1066 * ContentChecker.pm (_get_css_parser): New.
1067
1068 2007-11-25 Wakaba <wakaba@suika.fam.cx>
1069
1070 * ContentChecker.pm ($AnyChecker): Old way to add child elements
1071 for checking had been used.
1072
1073 2007-11-25 Wakaba <wakaba@suika.fam.cx>
1074
1075 * ContentChecker.pm (check_element): New todo item type |descendant|.
1076
1077 2007-11-23 Wakaba <wakaba@suika.fam.cx>
1078
1079 * IMTChecker.pm: Revised to raise errors and warnings as (poorly)
1080 specced in RFC 2046 and RFC 4288.
1081 (application/atom+xml): Definition added.
1082
1083 2007-11-23 Wakaba <wakaba@suika.fam.cx>
1084
1085 * URIChecker.pm: Make RFC 3986 should-level errors
1086 warnings (rather than SHOULD-level errors).
1087
1088 2007-11-23 Wakaba <wakaba@suika.fam.cx>
1089
1090 * NanoDOM.pm (get_user_data, set_user_data): New methods.
1091
1092 * HTML.pm.src: A flag for character references in attribute
1093 values are added. Set |manakai_has_reference| user data
1094 to |charset| attribute.
1095
1096 2007-11-23 Wakaba <wakaba@suika.fam.cx>
1097
1098 * NanoDOM.pm (input_encoding, manakai_charset, manakai_has_bom): New
1099 attributes.
1100
1101 * ContentChecker.pm (check_document): Warn if charset requirements
1102 cannot be tested.
1103
1104 2007-11-19 Wakaba <wakaba@suika.fam.cx>
1105
1106 * HTML.pm.src (parse_byte_string): Detect charset
1107 by universalchardet if charset parameter is not specified.
1108
1109 * Makefile (Charset-all, Charset-clean): New rules.
1110
1111 2007-11-18 Wakaba <wakaba@suika.fam.cx>
1112
1113 * ContentChecker.pm (check_document): Check the existence
1114 of character encoding declaration and selection of encoding
1115 for HTML document.
1116
1117 2007-11-18 Wakaba <wakaba@suika.fam.cx>
1118
1119 * ContentType.pm (get_sniffed_type): Return also the official
1120 type in list context.
1121
1122 2007-11-18 Wakaba <wakaba@suika.fam.cx>
1123
1124 * ContentType.pm: Sniffing with leading white space
1125 ignoring (HTML5 revisions 1013 and 1016).
1126
1127 2007-11-18 Wakaba <wakaba@suika.fam.cx>
1128
1129 * ContentType.pm: HTML5 revision 1013 changes, except for leading
1130 white spaces, are implemented.
1131
1132 2007-11-11 Wakaba <wakaba@suika.fam.cx>
1133
1134 * HTML.pm.src (parse_char_string): Set |inner_encoding|
1135 attribute if possible.
1136
1137 2007-11-11 Wakaba <wakaba@suika.fam.cx>
1138
1139 * HTML.pm.src (parse_byte_string): New method.
1140 (parse_char_string): New alias for |parse_string|.
1141 (main phase): Invoking "change the encoding" algorithm if desired.
1142
1143 * HTML.pod: Updated.
1144
1145 2007-11-11 Wakaba <wakaba@suika.fam.cx>
1146
1147 * HTML.pod (get_inner_html): Removed.
1148
1149 * Makefile (HTML-all, HTML-clean): New.
1150
1151 2007-11-11 Wakaba <wakaba@suika.fam.cx>
1152
1153 * HTML.pm.src (get_inner_html): Removed (moved to HTML/Serializer.pm).
1154
1155 2007-11-08 Wakaba <wakaba@suika.fam.cx>
1156
1157 * mklinktypelist.pl: s/noreferer/noreferrer/ (HTML5 revision 1132).
1158
1159 2007-11-04 Wakaba <wakaba@suika.fam.cx>
1160
1161 * Makefile: |CacheManifest.html| is added.
1162
1163 * CacheManifest.pod: New file.
1164
1165 2007-11-04 Wakaba <wakaba@suika.fam.cx>
1166
1167 * CacheManifest.pm: New module.
1168
1169 2007-11-04 Wakaba <wakaba@suika.fam.cx>
1170
1171 * HTML.pm.src: Support for application cache selection algorithm
1172 callback.
1173
1174 2007-11-04 Wakaba <wakaba@suika.fam.cx>
1175
1176 * mklinktypelist.pl: Support for rel=noreferer (HTML5 revision 1118).
1177
1178 2007-10-17 Wakaba <wakaba@suika.fam.cx>
1179
1180 * Makefile (clean): New rule.
1181
1182 * NanoDOM.pm (public_id, system_id): New attributes.
1183
1184 2007-10-17 Wakaba <wakaba@suika.fam.cx>
1185
1186 * Makefile (CSS-all, CSS-clean, clean): New rules.
1187
1188 2007-10-14 Wakaba <wakaba@suika.fam.cx>
1189
1190 * ContentChecker.pm (check_document): Support for
1191 new |is_xml_root| flag.
1192 (check_element): Support for new |pluses| state.
1193 (_add_pluses): New method.
1194 (_remove_minuses): Support for new |minus| item.
1195
1196 2007-09-24 Wakaba <wakaba@suika.fam.cx>
1197
1198 * ContentChecker.pm: Raise specific error for invalid
1199 root element.
1200
1201 2007-09-24 Wakaba <wakaba@suika.fam.cx>
1202
1203 * ContentChecker.pm: Set level values for later uses.
1204
1205 2007-09-09 Wakaba <wakaba@suika.fam.cx>
1206
1207 * ContentChecker.pm: Support for language tag validation.
1208
1209 2007-09-09 Wakaba <wakaba@suika.fam.cx>
1210
1211 * LangTag.pm (check_rfc3066_language_tag): New method.
1212
1213 2007-09-09 Wakaba <wakaba@suika.fam.cx>
1214
1215 * LangTag.pm: New module.
1216
1217 2007-09-04 Wakaba <wakaba@suika.fam.cx>
1218
1219 * HTML.pm.src: Some error types were wrong.
1220
1221 2007-08-17 Wakaba <wakaba@suika.fam.cx>
1222
1223 * CSS/: New directory.
1224
1225 2007-08-17 Wakaba <wakaba@suika.fam.cx>
1226
1227 * ContentChecker.pm (_check_get_children): Support
1228 for |noscript| in |head|.
1229
1230 2007-08-12 Wakaba <wakaba@suika.fam.cx>
1231
1232 * URI/: New directory.
1233
1234 2007-08-11 Wakaba <wakaba@suika.fam.cx>
1235
1236 * HTML.pm.src: Tokenizer's states are now represented in
1237 number.
1238
1239 2007-08-11 Wakaba <wakaba@suika.fam.cx>
1240
1241 * HTML.pm.src: |or|s for insertion modes are replaced
1242 by |&|s.
1243
1244 2007-08-11 Wakaba <wakaba@suika.fam.cx>
1245
1246 * HTML.pm.src: Token types are now represented in number.
1247
1248 2007-08-11 Wakaba <wakaba@suika.fam.cx>
1249
1250 * ContentType.pm (SEE ALSO): Updated.
1251
1252 * HTML.pm.src: Insertion modes are now represented in number.
1253
1254 2007-08-11 Wakaba <wakaba@suika.fam.cx>
1255
1256 * ContentType.pm: Sniffing for bitmap images (HTML5 revision
1257 999) is implemented.
1258
1259 2007-08-08 Wakaba <wakaba@suika.fam.cx>
1260
1261 * ContentType.pm: Sniffing for |<script| (HTML5 revision
1262 983) is implemented.
1263
1264 2007-08-06 Wakaba <wakaba@suika.fam.cx>
1265
1266 * ContentChecker.pod: New documentation.
1267
1268 * Makefile: A rule for |ContentChecker.html| is added.
1269
1270 * ContentChecker.pm: A pod "LICENSE" section is added.
1271
1272 * NanoDOM.pm ($VERSION): New variable.
1273
1274 2007-08-05 Wakaba <wakaba@suika.fam.cx>
1275
1276 * H2H.pm: |b|, |i|, and |sub| are added to the
1277 list of allowed HTML elements.
1278
1279 2007-08-05 Wakaba <wakaba@suika.fam.cx>
1280
1281 * H2H.pm: |samp| is added to the list of allowed
1282 HTML elements.
1283
1284 * URIChecker.pm (check_iri): New.
1285 (check_iri_reference): Error type for IRI reference
1286 syntax error is changed.
1287
1288 2007-08-04 Wakaba <wakaba@suika.fam.cx>
1289
1290 * ContentChecker.pm: Reference to the |Whatpm::ContentChecker::Atom|
1291 is added.
1292 (check_document): Load appropriate module before validation.
1293
1294 2007-08-04 Wakaba <wakaba@suika.fam.cx>
1295
1296 * ContentChecker/: New directory.
1297
1298 2007-08-04 Wakaba <wakaba@suika.fam.cx>
1299
1300 * ContentChecker.pm: HTML |time| element is implemented.
1301
1302 * HTMLTable.pm: Comments are updated as HTML5 is revised.
1303
1304 2007-07-21 Wakaba <wakaba@suika.fam.cx>
1305
1306 * ContentChecker.pm (check_document): Return value
1307 even if no document element is found.
1308
1309 2007-07-21 Wakaba <wakaba@suika.fam.cx>
1310
1311 * HTML.pm.src: |$in_body| is no longer a function.
1312
1313 2007-07-21 Wakaba <wakaba@suika.fam.cx>
1314
1315 * HTML.pm.src: The |$in_body| code has been moved down.
1316
1317 2007-07-21 Wakaba <wakaba@suika.fam.cx>
1318
1319 * HTML.pm.src: The "trailing end" insertion mode
1320 is split into "after html body" and "after html frameset"
1321 insertion modes. Their codes are merged with "after body"
1322 and "after frameset" codes. |$previous_insertion_mode|
1323 has been removed. "after frameset" code is
1324 merged with "in frameset" code.
1325
1326 2007-07-21 Wakaba <wakaba@suika.fam.cx>
1327
1328 * HTML.pm.src: The "before head" insertion mode is
1329 merged with the "in head" insertion mode.
1330
1331 2007-07-21 Wakaba <wakaba@suika.fam.cx>
1332
1333 * HTML.pm.src: Cases in "in head" insertion mode are
1334 reorganized.
1335
1336 2007-07-21 Wakaba <wakaba@suika.fam.cx>
1337
1338 * HTML.pm.src: Some cases in "in table" insertion mode
1339 are merged.
1340
1341 2007-07-21 Wakaba <wakaba@suika.fam.cx>
1342
1343 * HTML.pm.src: The "in row" insertion mode is merged
1344 with "in table" insertion mode.
1345
1346 2007-07-21 Wakaba <wakaba@suika.fam.cx>
1347
1348 * HTML.pm.src: The "in table" and "in table body" insertion
1349 modes are merged.
1350
1351 2007-07-21 Wakaba <wakaba@suika.fam.cx>
1352
1353 * HTML.pm.src: There is no "in table head" or "in table foot"
1354 insertion mode!
1355
1356 2007-07-21 Wakaba <wakaba@suika.fam.cx>
1357
1358 * HTML.pm.src: |<noframes>| "in frameset" and "in noframes"
1359 now directly invoke the handler.
1360
1361 2007-07-21 Wakaba <wakaba@suika.fam.cx>
1362
1363 * HTML.pm.src: Codes for "in cell" insertion mode
1364 is merged to the "in body" insertion mode code.
1365
1366 2007-07-21 Wakaba <wakaba@suika.fam.cx>
1367
1368 * HTML.pm.src: Codes for "in body" and "in caption"
1369 insertion modes are merged.
1370
1371 2007-07-21 Wakaba <wakaba@suika.fam.cx>
1372
1373 * HTML.pm.src: Two |!!!next-token|s were missing.
1374
1375 2007-07-21 Wakaba <wakaba@suika.fam.cx>
1376
1377 * HTML.pm.src: Use numeric constant for |{content_mode}|
1378 instead of string constant for |{content_model_flag}|.
1379
1380 2007-07-21 Wakaba <wakaba@suika.fam.cx>
1381
1382 * HTML.pm.src: Add the name of the attribute
1383 to the "duplicate attribute" error.
1384
1385 2007-07-17 Wakaba <wakaba@suika.fam.cx>
1386
1387 * ContentChecker.pm: Return the |class| node list.
1388
1389 2007-07-17 Wakaba <wakaba@suika.fam.cx>
1390
1391 * ContentChecker.pm: Return the |id| node list.
1392
1393 * HTML.pm.src: A typo is fixed.
1394
1395 2007-07-16 Wakaba <wakaba@suika.fam.cx>
1396
1397 * ContentChecker.pm: Drop wrong |level => 'error'| specification
1398 from "in HTML:xml:lang" error. Character position
1399 is now the last part of the error type in the URI error
1400 description. Report "unsupported" status for language
1401 tags, media queries, script codes, and style sheets.
1402
1403 2007-07-16 Wakaba <wakaba@suika.fam.cx>
1404
1405 * ContentChecker.pm: Report error if |xml:lang|
1406 in HTML, |lang| in XML, |xmlns| in XML, and |meta| |charset|
1407 in XML.
1408
1409 * NanoDOM.pm (Attr.owner_document): New attribute.
1410
1411 2007-07-16 Wakaba <wakaba@suika.fam.cx>
1412
1413 * HTML.pm.src: The character immediately following
1414 a bare |hcro| was discarded. Fix handling of
1415 entity references in attribute values.
1416
1417 2007-07-16 Wakaba <wakaba@suika.fam.cx>
1418
1419 * HTML.pm.src (main and trailin end phases): Token
1420 types |DOCTYPE|, |comment|, |end-of-file|, and |<html>| are
1421 factored out. Error types |in html:#DOCTYPE|
1422 and |after html:#DOCTYPE| are merged
1423 into |DOCTYPE in the middle|. |</frameset>|
1424 in fragment parsing mode changed the
1425 insertion mode.
1426
1427 2007-07-16 Wakaba <wakaba@suika.fam.cx>
1428
1429 * HTML.pm.src: |$phase| has been removed; The |trailing end|
1430 phase is now an insertion mode. Treatments for white
1431 space character tokens were incorrect for some
1432 insertion modes. An old |meta| case was not removed.
1433
1434 2007-07-16 Wakaba <wakaba@suika.fam.cx>
1435
1436 * HTML.pm.src: |meta| charset declaration extraction
1437 implemented (but changing the encoding is not yet:-).
1438
1439 2007-07-15 Wakaba <wakaba@suika.fam.cx>
1440
1441 * Charset/: New directory.
1442
1443 2007-07-15 Wakaba <wakaba@suika.fam.cx>
1444
1445 * H2H.pm: New Perl module (created from
1446 manakai's H2H.dis).
1447
1448 2007-07-15 Wakaba <wakaba@suika.fam.cx>
1449
1450 * XMLSerializer.pm: New Perl module (created from
1451 manakai's SimpleLS.dis).
1452
1453 2007-07-07 Wakaba <wakaba@suika.fam.cx>
1454
1455 * HTML.pm.src: |<!---x-->| was not processed correctly.
1456
1457 2007-07-01 Wakaba <wakaba@suika.fam.cx>
1458
1459 * HTML.pm.src: Report correct error message
1460 for |<body></div></body>|.
1461
1462 2007-07-01 Wakaba <wakaba@suika.fam.cx>
1463
1464 * HTMLTable.pm: An error description was incorrect.
1465
1466 2007-06-30 Wakaba <wakaba@suika.fam.cx>
1467
1468 * ContentChecker.pm: Return |{term}| list.
1469
1470 2007-06-30 Wakaba <wakaba@suika.fam.cx>
1471
1472 * HTML.pm.src: HTML5 revisions 961-966 (</p>, </br>,
1473 nested <nobr>, implied </tbody>, </tfoot>, and </thead>,
1474 and <title> outside of head).
1475
1476 2007-06-30 Wakaba <wakaba@suika.fam.cx>
1477
1478 * IMTChecker.pm: Report warning for unregistered
1479 and private types/subtypes.
1480
1481 * ContentChecker.pm, HTML.pm.src, IMTChecker.pm,
1482 URIChecker.pm, HTMLTable.pm: Error messages are now
1483 consistent; they are all listed in
1484 <http://suika.fam.cx/gate/2005/sw/Whatpm%20Error%20Types>.
1485
1486 2007-06-25 Wakaba <wakaba@suika.fam.cx>
1487
1488 * ContentChecker.pm: |<img ismap>| not in |<a></a>|
1489 is now erred. |<datalist>| is implemented.
1490 Attribute checker for |<command>| and |<menu>| are
1491 added. Support for |contextmenu| global attribute
1492 is added.
1493
1494 2007-06-25 Wakaba <wakaba@suika.fam.cx>
1495
1496 * HTML.pm.src (_reset_insertion_mode): Interpretation
1497 of Step 3 has been changed.
1498
1499 2007-06-25 Wakaba <wakaba@suika.fam.cx>
1500
1501 * HTML.pm.src: Late |<html>| parse error is implemented.
1502
1503 2007-06-24 Wakaba <wakaba@suika.fam.cx>
1504
1505 * URIChecker.pm (check_iri_reference): A |decode| method name was
1506 incorrect.
1507
1508 * ContentChecker.pm: Support for the |footer| element.
1509 Check URI syntax for space-separated URI attributes.
1510 Support for the |tabindex| attribute. Support
1511 for |datetime| attribute.
1512
1513 2007-06-24 Wakaba <wakaba@suika.fam.cx>
1514
1515 * HTML.pm.src: HTML5 revision 1.144 (&#x0D;) and 1.145 (invalid
1516 character references). HTML5 revision 1.146 (white space
1517 characters before root start tag). HTML5 revision
1518 1.148 (named character references in attribute values).
1519 HTML5 revision 1.152 (<plaintext>.innerHTML get).
1520
1521 2007-06-24 Wakaba <wakaba@suika.fam.cx>
1522
1523 * HTML.pm.src: HTML5 revisions 1.142 and 1.143 (<noscript>
1524 in <head>).
1525
1526 2007-06-24 Wakaba <wakaba@suika.fam.cx>
1527
1528 * HTML.pm.src: HTML5 revision 935 (<base>, <link>, <meta>
1529 in body).
1530
1531 * ContentChecker.pm: HTML5 revision 938 (scoped="").
1532
1533 2007-06-24 Wakaba <wakaba@suika.fam.cx>
1534
1535 * HTML.pm.src: HTML5 revision 923 (matching end tag in CDATA
1536 or RCDATA in fragment parsing mode). HTML5 revision
1537 924 (<!--> and <!--->). HTML5 revision 926 (hn in hn).
1538
1539 2007-06-23 Wakaba <wakaba@suika.fam.cx>
1540
1541 * HTML.pm.src (get_inner_html): HTML5 revision 922 (inner_html
1542 for <pre> and <textarea>).
1543
1544 2007-06-23 Wakaba <wakaba@suika.fam.cx>
1545
1546 * HTML.pm.src: HTML5 revision 920 (<isindex>).
1547
1548 2007-06-23 Wakaba <wakaba@suika.fam.cx>
1549
1550 * HTML.pm.src: HTML5 revision 918 (</head>, </body>, </html>).
1551
1552 2007-06-23 Wakaba <wakaba@suika.fam.cx>
1553
1554 * HTML.pm.src: HTML5 revision 916 (</body>).
1555 HTML5 revision 917 (conforming bare &).
1556
1557 2007-06-23 Wakaba <wakaba@suika.fam.cx>
1558
1559 * NanoDOM.pm (manakai_is_html): Setting to false did not work.
1560
1561 * HTML.pm.src: HTML5 revision 914 (</ in CDATA, RCDATA).
1562 HTML5 revision 915 (<nobr>).
1563
1564 2007-06-23 Wakaba <wakaba@suika.fam.cx>
1565
1566 * HTML.pm.src: HTML5 revisions 908, 909, 912, and 913 (quirks mode).
1567
1568 * NanoDOM.pm (manakai_is_html, manakai_compat_mode, compat_mode):
1569 New attributes.
1570
1571 2007-06-23 Wakaba <wakaba@suika.fam.cx>
1572
1573 * HTML.pm.src: HTML5 revisions 900, 901, 902, and 911 (<
1574 in tags).
1575
1576 2007-06-23 Wakaba <wakaba@suika.fam.cx>
1577
1578 * .cvsignore: |Entities.html| is added.
1579
1580 * HTML.pm.src: |$entity_char| is removed and
1581 requires |Whatpm::_NamedEntityList| instead.
1582 HTML5 revision 898 (refc), except that lack of refc
1583 is parse error.
1584
1585 * mkentitylist.pl: New script.
1586
1587 * Makefile (all): |_NamedEntityList.pm| is added.
1588 (_NamedEntityList.pm, Entities.html): New rules.
1589
1590 2007-06-23 Wakaba <wakaba@suika.fam.cx>
1591
1592 * HTML.pm.src: Parse errors immediately after U+000D
1593 were ignored and U+000D immediately following another
1594 U+000D was not converted to U+000A.
1595
1596 2007-06-23 Wakaba <wakaba@suika.fam.cx>
1597
1598 * HTML.pm.src (set_inner_html): HTML5 revision 892 (adopt
1599 nodes before appended). Parser was not ready for NULL
1600 parse error and escape flag.
1601
1602 * NanoDOM.pm (adopt_node): New.
1603
1604 2007-06-23 Wakaba <wakaba@suika.fam.cx>
1605
1606 * HTML.pm.src: HTML5 revision 886 (insane comment in
1607 CDATA and RCDATA). Note that current implementation
1608 is simply repeating what the spec says and it is maybe not
1609 a best way to do it.
1610
1611 2007-06-23 Wakaba <wakaba@suika.fam.cx>
1612
1613 * HTML.pm.src: HTML5 revision 884 (</form> don't close
1614 the form element if a descendant element without implied end tag has
1615 still been open).
1616
1617 2007-06-23 Wakaba <wakaba@suika.fam.cx>
1618
1619 * ContentChecker.pm: HTML5 revision 881 (Make |id| attribute with
1620 space characters non-conforming).
1621
1622 2007-06-23 Wakaba <wakaba@suika.fam.cx>
1623
1624 * HTML.pm.src: An error message was incorrect.
1625 HTML5 revision 869 (C1 character references).
1626
1627 2007-06-23 Wakaba <wakaba@suika.fam.cx>
1628
1629 * HTML.pm.src: HTML5 revision 867 (a LF at the beginning of
1630 a |textarea| is removed).
1631
1632 2007-06-05 Wakaba <wakaba@suika.fam.cx>
1633
1634 * NanoDOM.pm (get_attribute_node_ns): New method.
1635
1636 * ContentChecker.pm: |script| |async| and |defer|
1637 no longer require |src|. |async| MUST NOT be
1638 specified if |defer|. (HTML5 revision 858).
1639
1640 2007-05-30 Wakaba <wakaba@suika.fam.cx>
1641
1642 * HTML.pm.src: |<form><form>| went to inifinite loop.
1643
1644 2007-05-27 Wakaba <wakaba@suika.fam.cx>
1645
1646 * ContentChecker.pm (html): Set |is_root| (allowed
1647 as a document element) flag on.
1648 (new): Removed.
1649 (check_document): New method.
1650
1651 2007-05-27 Wakaba <wakaba@suika.fam.cx>
1652
1653 * ContentChecker.pm (thead, tfoot): Checker specifications
1654 were incorrect.
1655
1656 2007-05-27 Wakaba <wakaba@suika.fam.cx>
1657
1658 * ContentChecker.pm ($HTMLURIAttrChecker): Include
1659 error position in the |type| option of the error.
1660
1661 * HTMLTable.pm (form_table): The |$onerror| parameter
1662 is now optional. Some bugs are fixed.
1663
1664 2007-05-27 Wakaba <wakaba@suika.fam.cx>
1665
1666 * HTMLTable.pm: New module.
1667
1668 * ContentChecker.pm (table): Invoke table model error checker.
1669
1670 * NanoDOM.pm (first_child, get_attribute_ns): New.
1671
1672 2007-05-26 Wakaba <wakaba@suika.fam.cx>
1673
1674 * ContentChecker.pm ($HTMLLinkTypesAttrChecker): New checker.
1675 (link/@rel, a/@rel, area/@rel): Use new checker.
1676
1677 * Makefile (_LinkTypeList.pm, RelExtensions.html): New rules.
1678
1679 * _LinkTypeList.pm: New file.
1680
1681 * mklinktypelist.pl: New file.
1682
1683 * .cvsignore: |RelExtensions.html| added.
1684
1685 * NanoDOM.pm (child_nodes): Returns an empty array
1686 for non-child-containing node types.
1687 (text_content): New attribute.
1688
1689 2007-05-26 Wakaba <wakaba@suika.fam.cx>
1690
1691 * IMTChecker.pm: New module.
1692
1693 * ContentChecker.pm ($HTMLIMTAttrChecker): Call IMTChecker
1694 to test parameter value validity.
1695
1696 * HTML.pm.src ($style_start_tag): Attributes were
1697 discarded.
1698
1699 2007-05-25 Wakaba <wakaba@suika.fam.cx>
1700
1701 * ContentChecker.pm ($HTMLURIAttrChecker): Implemented.
1702
1703 2007-05-25 Wakaba <wakaba@suika.fam.cx>
1704
1705 * URIChecker.pm: All recommendations from RFC 3986
1706 and RFC 3987 are listed (not all testable items are checked yet).
1707
1708 2007-05-25 Wakaba <wakaba@suika.fam.cx>
1709
1710 * URIChecker.pm: New module.
1711
1712 2007-05-20 Wakaba <wakaba@suika.fam.cx>
1713
1714 * ContentChecker.pm: Now most attributes are implemented
1715 or associated to some placeholder.
1716 ($ElementDefault): Warn unknown attributes
1717 for unknown elements as "attribute not supported".
1718 ($HTMLLanguageTagAttrChecker, $HTMLMQAttrChecker): New placeholders.
1719 ($HTMLUsemapAttrChecker, $HTMLTargetAttrChecker): New checkers.
1720 (|a| attribute checker): Reimplemented.
1721
1722 2007-05-20 Wakaba <wakaba@suika.fam.cx>
1723
1724 * ContentChecker.pm ($HTMLEventHandlerAttrChecker): New placeholder.
1725 ($HTMLAttrChecker): Event handler content attributes
1726 are added.
1727 (link, embed): Required attribute is now checked.
1728 (embed): Unknown local attributes are no longer warned.
1729
1730 2007-05-20 Wakaba <wakaba@suika.fam.cx>
1731
1732 * ContentChecker.pm ($HTMLSpaceURIsAttrChecker): New placeholder.
1733 ($HTMLIMTAttrChecker): New checker.
1734 (link@rel, link@href, link@type, style@type,
1735 a@href, a@ping, a@ping, a@type, embed@src, embed@type,
1736 object@data, object@type, source@src, source@type, area@alt,
1737 area@shape, area@coords, area@href,
1738 area@ping, area@rel, area@type, script@src,
1739 script@defer, script@async, script@type): Checkers added.
1740
1741 2007-05-20 Wakaba <wakaba@suika.fam.cx>
1742
1743 * ContentChecker.pm: Descendant checking was incorrect.
1744
1745 2007-05-19 Wakaba <wakaba@suika.fam.cx>
1746
1747 * ContentChecker.pm: Support |xml:*| and |xmlns:*|
1748 attributes. Report an error if |Element.prefix|
1749 is |xmlns|.
1750
1751 * NanoDOM.pm (prefix): New attribute.
1752
1753 2007-05-19 Wakaba <wakaba@suika.fam.cx>
1754
1755 * HTML.pm.src: In |main| phase, |in body| insertion
1756 mode, action for |<iframe>| was missing.
1757
1758 2007-05-19 Wakaba <wakaba@suika.fam.cx>
1759
1760 * ContentChecker.pm: Support for many of HTML5 elements.
1761 ($GetHTMLNonNegativeIntegerAttrChecker): New.
1762
1763 2007-05-19 Wakaba <wakaba@suika.fam.cx>
1764
1765 * ContentChecker.pm: Support for most elements up to |progress|.
1766 ($HTMLURIAttrChecker): Placeholder.
1767 ($HTMLIntegerAttrChecker, $GetHTMLFloatingPointNumberAttrChecker):
1768 New.
1769
1770 2007-05-19 Wakaba <wakaba@suika.fam.cx>
1771
1772 * ContentChecker.pm: Attribute checkers
1773 for global attributes, |html|, |base|, |style|, and |meta|.
1774
1775 * NanoDOM.pm (insert_before): Weaken reference
1776 to the parent node.
1777 (Attr::new): Set |owner_element| attribute.
1778 (namespace_uri, manakai_local_name): New attribute implementations.
1779 (owner_element): New attribute.
1780
1781 2007-05-19 Wakaba <wakaba@suika.fam.cx>
1782
1783 * ContentChecker.pm ($AttrChecker, $HTMLAttrChecker,
1784 $AnyChecker->{attr_checker}, $HTMLAttrsChecker,
1785 $Element->{$HTML_NS}->{''}): New.
1786 (check_element): Invoke attrs_checker for each element.
1787
1788 2007-05-13 Wakaba <wakaba@suika.fam.cx>
1789
1790 * ContentChecker.pm: Don't use |manakai_element_type_match|.
1791
1792 2007-05-13 Wakaba <wakaba@suika.fam.cx>
1793
1794 * ContentChecker.pm: Use hashs rather than lists for
1795 element type testings.
1796
1797 2007-05-13 Wakaba <wakaba@suika.fam.cx>
1798
1799 * ContentChecker.pm: Don't generate duplicate
1800 error when an element type is put in the "minus" list
1801 and the element type is not allowed explicitly in the particular
1802 element content model.
1803 (html:a checker): New checker.
1804 (html:details, html:datagrid): New checkers.
1805 (html:legend): New checker.
1806
1807 2007-05-13 Wakaba <wakaba@suika.fam.cx>
1808
1809 * ContentChecker.pm (html:li checker): Implemented.
1810
1811 2007-05-13 Wakaba <wakaba@suika.fam.cx>
1812
1813 * ContentChecker.pm ($HTMLInlineOrStriclyInlineChecker): New
1814 checker.
1815 (html:dd checker): New checker.
1816 (html:q, html:em, html:strong, html:small,
1817 html:m, html:dfn, html:code, html:samp, html:span): New checkers.
1818
1819 2007-05-13 Wakaba <wakaba@suika.fam.cx>
1820
1821 * ContentChecker.pm ($AnyChecker): Renamed
1822 from |$ElementDefault->{checker}|.
1823 ($ElementDefault->{checker}): Throw an error that
1824 the element type is not supported by the checker.
1825 ($HTMLMetadataElement): |html:base| was missing.
1826 ($HTMLEmptyChecker): Don't throw an error
1827 for inter-element whitespace nodes.
1828 (html:html checker): Errors were not
1829 thrown even if |html:head| and/or |html:body|
1830 children were missing.
1831 (html:head checker): An error was not
1832 thrown if <meta charset> appered after other
1833 elements.
1834
1835 2007-05-05 Wakaba <wakaba@suika.fam.cx>
1836
1837 * ContentChecker.pm: |footer|, |video|, |audio|, |script|,
1838 and |noscript| elements are implemented.
1839 (new): New method.
1840
1841 2007-05-04 Wakaba <wakaba@suika.fam.cx>
1842
1843 * ContentChecker.pm: New module.
1844
1845 2007-05-04 Wakaba <wakaba@suika.fam.cx>
1846
1847 * NanoDOM.pm (manakai_parent_element,
1848 document_element, manakai_local_name,
1849 manakai_element_type_match): New method.
1850
1851 2007-05-03 Wakaba <wakaba@suika.fam.cx>
1852
1853 * HTML.pm.src: Replace decimal and hexadecimal numeric
1854 entities in C1 range using Windows-1252 mapping. Bare LF
1855 did not count as new line for error reporting.
1856
1857 2007-05-02 Wakaba <wakaba@suika.fam.cx>
1858
1859 * NanoDOM.pm (DOMImplementation): New class.
1860 (append_child): Weaken the |parent_node| reference.
1861 (create_element_ns, Element new): Set the |owner_document|
1862 reference.
1863 (implementation): New attribute.
1864 (owner_document, local_name, namespace_uri): New attributes.
1865
1866 * HTML.pm.src (parse_string): Line and column numbers
1867 are now provided to error handler.
1868 (!!!parse-error): Short descriptions are added.
1869 (_construct_tree): Split into three methods; support
1870 for innerHTML mode.
1871 (set_inner_html): New method.
1872
1873 2007-05-01 Wakaba <wakaba@suika.fam.cx>
1874
1875 * NanoDOM.html: Documentation is added.
1876
1877 * HTML.pod, ContentType.html: Documentation is revised.
1878
1879 * .cvsignore: Pod2html temporary files are added.
1880
1881 * Makefile: Make |NanoDOM.html|.
1882
1883 2007-05-01 Wakaba <wakaba@suika.fam.cx>
1884
1885 * HTML.pm.src (parse_string): New method.
1886 (get_inner_html): Renamed from |inner_html|.
1887
1888 * Makefile: A rule for |HTML.html| is added.
1889
1890 * HTML.pod: New documentation.
1891
1892 2007-05-01 Wakaba <wakaba@suika.fam.cx>
1893
1894 * NanoDOM.pm (last_child, previous_sibling): New attributes.
1895 (clone_node): Attribute nodes were not completely copied.
1896
1897 * HTML.pm.src: Many bugs are fixed.
1898
1899 2007-04-30 Wakaba <wakaba@suika.fam.cx>
1900
1901 * HTML.pm.src: Some typos are fixed.
1902
1903 2007-04-30 Wakaba <wakaba@suika.fam.cx>
1904
1905 * mkhtmlparser.pl, Makefile: References to the |HTML-consume-entity.src|
1906 are removed.
1907
1908 * HTML.pm.src: Tokenizer's handling on named entities are rewritten.
1909
1910 * HTML-consume-entity.src: Removed.
1911
1912 2007-04-30 Wakaba <wakaba@suika.fam.cx>
1913
1914 * HTML.pm.src: Tokenizer's handling on hexadecimal
1915 numeric entities are rewritten.
1916
1917 2007-04-30 Wakaba <wakaba@suika.fam.cx>
1918
1919 * HTML.pm.src: Some tokenizer bugs are fixed.
1920
1921 2007-04-30 Wakaba <wakaba@suika.fam.cx>
1922
1923 * HTML.pm.src: The tree construction stage is implemented.
1924
1925 * mkhtmlparser.pl: New macros are added.
1926
1927 2007-04-28 Wakaba <wakaba@suika.fam.cx>
1928
1929 * ContentType.pm: A note on bug in the specification
1930 is removed since it's been now fixed.
1931
1932 * .cvsignore: New file.
1933
1934 2007-04-28 Wakaba <wakaba@suika.fam.cx>
1935
1936 * HTML.pm.src, HTML-consume-entity.src: New files.
1937
1938 * Makefile (HTML.pm): New rule.
1939
1940 * mkhtmlparser.pl: New script.
1941
1942 2007-04-25 Wakaba <wakaba@suika.fam.cx>
1943
1944 * Makefile: New file.
1945
1946 2007-04-24 Wakaba <wakaba@suika.fam.cx>
1947
1948 * ContentType.pm: An error in pod is fixed.
1949
1950 2007-04-24 Wakaba <wakaba@suika.fam.cx>
1951
1952 * ContentType.pm: Documentation is added.
1953
1954 2007-04-24 Wakaba <wakaba@suika.fam.cx>
1955
1956 * ContentType.pm: New Perl module.
1957
1958 * ChangeLog: New file.
1959
1960

admin@suikawiki.org
ViewVC Help
Powered by ViewVC 1.1.24