/[suikacvs]/markup/html/whatpm/Whatpm/ChangeLog
Suika

Contents of /markup/html/whatpm/Whatpm/ChangeLog

Parent Directory Parent Directory | Revision Log Revision Log


Revision 1.312 - (hide annotations) (download)
Mon Sep 15 08:09:39 2008 UTC (16 years, 9 months ago) by wakaba
Branch: MAIN
Changes since 1.311: +4 -0 lines
++ whatpm/Whatpm/ChangeLog	15 Sep 2008 08:09:35 -0000
	* HTML.pm.src: Shorten keys.

2008-09-15  Wakaba  <wakaba@suika.fam.cx>

1 wakaba 1.310 2008-09-15 Wakaba <wakaba@suika.fam.cx>
2    
3 wakaba 1.312 * HTML.pm.src: Shorten keys.
4    
5     2008-09-15 Wakaba <wakaba@suika.fam.cx>
6    
7 wakaba 1.311 * HTML.pm.src: Remove checking for control character, surrogate
8     pair, or noncharacter code points and non-Unicode code
9     points (they should be handled by Whatpm::Charset::UnicodeChecker).
10     (parse_char_stream): Support for the |$get_wrapper| argument and
11     character stream error handlers.
12    
13     2008-09-15 Wakaba <wakaba@suika.fam.cx>
14    
15 wakaba 1.310 * ContentChecker.pm: Don't call |loda_ns_module|
16     for null-namespace elements/attributes.
17    
18     * HTML.pm.src: Fact out $disallowed_control_chars
19     as a hash.
20    
21 wakaba 1.300 2008-09-14 Wakaba <wakaba@suika.fam.cx>
22    
23 wakaba 1.309 * HTML.pm.src: Regexp typo fixed. |{prev_char}|
24     and |{next_char}| initializations are moved to initialization
25     method. |{read_until}| now supports buffering. Sync |set_inner_html|
26     with |parse_char_stream|.
27    
28     2008-09-14 Wakaba <wakaba@suika.fam.cx>
29    
30 wakaba 1.308 * HTML.pm.src (parse_char_stream): Make |set_next_char|
31     invoke |manakai_read_until|, not only |read|, where
32     possible, to decrease the number of |read| method calls.
33    
34     * mkhtmlparser.pl: Related changes to the aforementioned
35     modification.
36    
37     2008-09-14 Wakaba <wakaba@suika.fam.cx>
38    
39 wakaba 1.307 * HTML.pm.src: Use |read| instead of |getc|. |set_inner_html|
40     would report character error from now.
41    
42     2008-09-14 Wakaba <wakaba@suika.fam.cx>
43    
44 wakaba 1.306 * HTML.pm.src: White-space-leaded non-white-space character
45     tokens in "before head insertion mode" was not
46     correctly handled.
47     (set_inner_html): Reimplemented using CharString decodehandle
48     class. Support for $get_wrapper argument. Support
49     for |{read_until}| feature.
50    
51     2008-09-14 Wakaba <wakaba@suika.fam.cx>
52    
53 wakaba 1.305 * HTML.pm.src: Make a "bare ero" error for unknown
54     entities point the "&" character.
55    
56     2008-09-14 Wakaba <wakaba@suika.fam.cx>
57    
58 wakaba 1.304 * HTML.pm.src: It turns out that U+FFFD don't have to
59     be added to the list of excluded characters.
60    
61     2008-09-14 Wakaba <wakaba@suika.fam.cx>
62    
63 wakaba 1.303 * HTML.pm.src ($char_onerror): Have character decoder's |line|
64     and |column| a higher priority than the one set by the
65     tokenizer's input handler.
66     ($self->{read_until}): Exclude U+FFFD (but this might
67     not be necessary, since now we do line/column fixup in
68     the character decode handle).
69    
70     2008-09-14 Wakaba <wakaba@suika.fam.cx>
71    
72 wakaba 1.302 * HTML.pm.src: Use |{read_until}| where possible.
73    
74     2008-09-14 Wakaba <wakaba@suika.fam.cx>
75    
76 wakaba 1.301 * HTML.pm.src: Change |{getc_until}| to |{read_until}|
77     and |manakai_getc_until| to |manakai_read_until| to
78     reduce the number of string copies.
79    
80     2008-09-14 Wakaba <wakaba@suika.fam.cx>
81    
82 wakaba 1.300 * HTML.pm.src (parse_char_string): Use newly created
83     |Whatpm::Charset::DecodeHandle::CharString| instead of Perl's
84     standard feature to |open| a string as a filehandle,
85     since Perl's string filehandle seems not supporting |ungetc|
86     method correctly.
87     (parse_char_stream): Define |{getc_until}| method.
88     (DATA_STATE): Experimental support for |getc_until| feature.
89    
90 wakaba 1.292 2008-09-13 Wakaba <wakaba@suika.fam.cx>
91    
92 wakaba 1.299 * HTML.pm.src: Check points added to newly added branches.
93    
94     2008-09-13 Wakaba <wakaba@suika.fam.cx>
95    
96 wakaba 1.298 * HTML.pm.src: Remove |{char}|, which is no longer used.
97     Remove |{entity_in_attr}| and |{last_attribute_value_state}|
98     and replaced by |{prev_state}|.
99    
100     * mkhtmlparser.pl: Remove |{char}| feature.
101     Remove |!!!back-next-input-character;| macro.
102    
103     2008-09-13 Wakaba <wakaba@suika.fam.cx>
104    
105 wakaba 1.297 * HTML.pm.src: Finally we get rid of all the inner loops. Remove
106     entity related tokenizer states in favor of new states
107     implementing the consume character reference algorithm.
108    
109     2008-09-13 Wakaba <wakaba@suika.fam.cx>
110    
111     * HTML.pm.src: "Consume a character reference" algorithm is
112 wakaba 1.296 now implemented as a tokenizer's state, rather than
113     a method, with minimum changes (more changes will
114     be made, in due course). "Bogus comment state"'s inner
115     loop gets removed.
116    
117     2008-09-13 Wakaba <wakaba@suika.fam.cx>
118    
119 wakaba 1.297 * HTML.pm.src: Make |PUBLIC| and |SYSTEM| keyword tokenizing
120 wakaba 1.295 into their own tokenizer states.
121    
122     2008-09-13 Wakaba <wakaba@suika.fam.cx>
123    
124 wakaba 1.294 * HTML.pm.src: |CDATA_SECTION_STATE| (formally |CDATA_BLOCK_STATE|
125     is split into three states.
126    
127     2008-09-13 Wakaba <wakaba@suika.fam.cx>
128    
129 wakaba 1.293 * HTML.pm.src: |CLOSE_TAG_OPEN_STATE| is broken into
130     itself and new |CDATA_PCDATA_CLOSE_TAG_STATE| so that
131     no longer does the tokenizer have to push back next input
132     characters in those states.
133    
134     2008-09-13 Wakaba <wakaba@suika.fam.cx>
135    
136 wakaba 1.292 * HTML.pm.src: |MARKUP_DECLARATION_OPEN_STATE| broken
137     into four states so that no longer does the tokenizer have to push
138     back next input characters in that state.
139    
140 wakaba 1.291 2008-09-11 Wakaba <wakaba@suika.fam.cx>
141    
142     * HTML.pm.src: Methods now accept additional parameter, $get_wrapper,
143     which can be used to insert some wrapper between the character
144     stream handle and the tokenizer. (It is currently not supported
145     for |set_inner_html| for |Element|s).
146    
147 wakaba 1.289 2008-09-10 Wakaba <wakaba@suika.fam.cx>
148    
149 wakaba 1.290 * HTML.pm.src: Ignore punctuations in charset names.
150    
151     2008-09-10 Wakaba <wakaba@suika.fam.cx>
152    
153 wakaba 1.289 * ContentChecker.pm: Support for charset-layer error levels.
154    
155     * HTML.pm.src: Don't specify |text| argument for the
156     |chardecode:fallback| error, since it is not the encoding
157     being used alternatively.
158    
159 wakaba 1.288 2008-09-06 Wakaba <wakaba@suika.fam.cx>
160    
161     * HTML.pm.src: Support for |XSLT-compat| (HTML5 revision 2141).
162    
163 wakaba 1.286 2008-08-31 Wakaba <wakaba@suika.fam.cx>
164    
165 wakaba 1.287 * CacheManifest.pm: Support for extensibility (HTML5 revision 2051).
166    
167     2008-08-31 Wakaba <wakaba@suika.fam.cx>
168    
169 wakaba 1.286 * HTML.pm.src: Bug fix and sync with the spec with regard
170     to after after frameset insertion mode processing (HTML5
171     revision 1909). Note that the implementation was wrong
172     per the old spec before the r1909 changes.
173    
174 wakaba 1.278 2008-08-30 Wakaba <wakaba@suika.fam.cx>
175    
176 wakaba 1.285 * HTMLTable.pm: scope=auto algorithm fix synced with the
177     spec (HTML5 revision 2093).
178     ($process_row): Algorithm step numbers synced with the
179     spec (HTML5 revision 2092).
180    
181     2008-08-30 Wakaba <wakaba@suika.fam.cx>
182    
183 wakaba 1.284 * HTMLTable.pm: Zs is not what we want; we want White_Space! (HTML5
184     revision 2094).
185    
186     2008-08-30 Wakaba <wakaba@suika.fam.cx>
187    
188     * ContentType.pm: Support for image/svg+xml (HTML5 revision 2096).
189    
190     2008-08-30 Wakaba <wakaba@suika.fam.cx>
191    
192 wakaba 1.283 * HTML.pm.src: '"' and "'" at the end of attribute
193     name (after another attribute) now raise parse error (HTML5
194     revision 2123). Empty unquoted attribute values are no
195     longer allowed (HTML5 revision 2122).
196    
197     2008-08-30 Wakaba <wakaba@suika.fam.cx>
198    
199 wakaba 1.282 * mkhtmlparser.pl: Support for MathML |definitionURL| attribute (HTML5
200     revision 2130).
201    
202     2008-08-30 Wakaba <wakaba@suika.fam.cx>
203    
204 wakaba 1.281 * ContentChecker.pm: |xml:lang| attribute value must be same
205     as |lang| attribute value for HTML elements (HTML5 revision 2062
206     and so on).
207    
208     2008-08-30 Wakaba <wakaba@suika.fam.cx>
209    
210 wakaba 1.280 * ContentChecker.pm: Error level definition for |xml_id_error|
211     was missing.
212    
213     * URIChecker.pm: The end of the URL should be marked as the
214     error location for an empty path error. The position
215     between the userinfo and the port components should be
216     marked as the error location for an empty host error.
217    
218     2008-08-30 Wakaba <wakaba@suika.fam.cx>
219    
220 wakaba 1.279 * URIChecker.pm: Set parameters representing where in the
221     value the error occurs for errors. Report unknown
222     address format error in warning level, since address
223     formats are rarely added. Path segments starting with "/.."
224     were misinterpreted as a dot-segment.
225    
226     2008-08-30 Wakaba <wakaba@suika.fam.cx>
227    
228 wakaba 1.278 * URIChecker.pm (check_iri_reference): Requires
229     |Message::DOM::DOMImplementation|.
230    
231 wakaba 1.277 2008-08-29 Wakaba <wakaba@suika.fam.cx>
232    
233     * IMTChecker.pm: Updated for the new error reporting architecture.
234    
235     * ContentChecker.pm: Error levels for IMTs are added.
236    
237 wakaba 1.276 2008-08-17 Wakaba <wakaba@suika.fam.cx>
238    
239     * H2H.pm (_shift_token): Support for unquoted HTML attribute
240     values.
241    
242 wakaba 1.274 2008-08-16 Wakaba <wakaba@suika.fam.cx>
243    
244 wakaba 1.275 * CacheManifest.pm: Support for new style of error
245     reports.
246    
247     * HTML.pm.src: Set line=1, column=1 to the document node.
248    
249     2008-08-16 Wakaba <wakaba@suika.fam.cx>
250    
251 wakaba 1.274 * ContentChecker.pm, RDFXML.pm: Pass {level} object to language tag
252     and URL checkers. Support for more error levels for bogus
253     langauge tag and URL "standards".
254    
255     * LangTag.pm, URIChecker.pm: Support for new style error
256     level reporting.
257    
258 wakaba 1.271 2008-08-15 Wakaba <wakaba@suika.fam.cx>
259    
260 wakaba 1.273 * ContentChecker.pm: Support for RDF/XML error levels.
261    
262     * HTMLTable.pm, RDFXML.pm: Support for new style of error level
263     specifying. Error types are revised.
264    
265     2008-08-15 Wakaba <wakaba@suika.fam.cx>
266    
267 wakaba 1.272 * ContentChecker.pm: All error reporting method calls are
268     renewed.
269    
270     2008-08-15 Wakaba <wakaba@suika.fam.cx>
271    
272 wakaba 1.271 * HTML.pm.src: All error type names and "text" parameters
273     are revised. Use new style for "level" specification.
274    
275     * mkhtmlparser.pl: Use new style for "level" specification.
276    
277 wakaba 1.268 2008-08-03 Wakaba <wakaba@suika.fam.cx>
278    
279 wakaba 1.270 * WebIDL.pm (parse_char_string): Simplified error
280     reporting process for broken ignored valuetype definition.
281     (Valuetype idl_text): Support for special "DOMString" name.
282    
283     2008-08-03 Wakaba <wakaba@suika.fam.cx>
284    
285 wakaba 1.269 * WebIDL.pm ($get_scoped_name): Append "::::" if the last
286     terminal of the ScopedName is "DOMString", such that whether
287     the last part of the scoped name is "DOMString" or "_DOMString"
288     later. It is necessary to determine whether a |typedef|
289     definition should be ignored or not.
290     (parse_char_string): Unescape the identifier of
291     exception members.
292     ($resolve): Return undef for builtin types and sequence<T>
293     types (we might not have to do this, however...).
294     (check): Support checking for Exceptions, Valuetypes,
295     and Typedefs.
296     ($serialize_type): Support for "DOMString::::" syntax.
297     (Typedef idl_text): Output Type as "DOMString" if it
298     is really "DOMString" (i.e. its internal representation
299     is "::DOMString::").
300    
301     2008-08-03 Wakaba <wakaba@suika.fam.cx>
302    
303 wakaba 1.268 * WebIDL.pm ($resolve): New code, based on resolve code
304     for constant types in the |check| method.
305     (check): Support for checking of attributes, operations, and
306     arguments.
307     (Attribute/Operation idl_text): Exception names in getraises,
308     setraises, and raises clauses is serizlied by |$serialize_type|
309     code.
310    
311 wakaba 1.264 2008-08-02 Wakaba <wakaba@suika.fam.cx>
312    
313 wakaba 1.267 * WebIDL.pm ($integer): Order of selections are changed to match
314     hexadecimal numbers (the original pattern, taken from the spec,
315     was not work for hexadecimal numbers, because the "0" prefix
316     matches to the [0-7]* part (as an empty string) and therefore
317     it does not match with remaining "x..." part of a "0x..." integer
318     literal.
319     ($get_type): It now returns a string, not an array reference,
320     for regular types and |sequence| types (i.e. it in any case
321     returns a string).
322     ($get_next_token): The second item in the array that represents
323     a integer or float token is now a Perl number value, not the
324     original string representation of the number.
325     (check): Support for const value consistency checking.
326     No extended attribute is defined for constants.
327     (Node subclasses): Use simple strings rather than array references
328     for default data type values.
329     ($serialize_type): Type values are now simple strings.
330     (value): If the new attribute value is a false value, then
331     a FALSE value is set to the attribute.
332    
333     2008-08-02 Wakaba <wakaba@suika.fam.cx>
334    
335 wakaba 1.266 * WebIDL.pm ($get_scoped_name): Now scoped names are stored
336     in its stringified format ("scoped name" as defined in the
337     spec). Note that future version of this module should not use
338     array references for type values and the |type_text| attribute
339     should be made obsolete.
340     (parse_char_string): Unescape attribute names.
341     (check): Support for checking of whether inherited interfaces
342     are actually defined or not. Support for checking of whether
343     interface member identifiers are duplicated or not.
344     ($serialize_type): Scoped names are returned as is. A future
345     version of this code should escape identifiers other than "DOMString",
346     otherwise the idl_text would be non-conforming.
347    
348     2008-08-02 Wakaba <wakaba@suika.fam.cx>
349    
350 wakaba 1.264 * WebIDL.pm (parse_char_string): Set line/column numbers
351 wakaba 1.265 to generated nodes. Unescape identifiers. Extended attributes
352     for Definition's were ignored.
353     (append_child): Set |parent_node| attribute.
354     (parent_node): New attribute.
355     (check): Support interface/exception members. Support
356     extended attributes. Support definition identifier uniqueness
357     constraint.
358     (qualified_name): New attribute.
359     (Interface/Exception idl_text): Extended attributes were
360     not prepended to the returned text.
361    
362     2008-08-02 Wakaba <wakaba@suika.fam.cx>
363    
364     * WebIDL.pm (parse_char_string): Set line/column numbers
365 wakaba 1.264 to interface object experimentally. s/shift/pop/g, shift
366     would make things wrong. Support for interface forward
367     declarations was missing. Broken interface declarations
368     with no block were not ignored entirely.
369     (Whatpm::WebIDL::Node): New abstract class. This class
370     makes things easier.
371     (child_nodes): New attribute. Unlike DOM's attribute with
372     same name, this attribute returns a dead list of nodes for
373     simplicity.
374     (get_user_data, set_user_data): New methods.
375     (Module idl_text): A SPACE character should be inserted
376     before the |{| character.
377     (Interface idl_text): Support for interface forward declarations.
378     (is_forward_declaration): New attribute.
379    
380 wakaba 1.259 2008-07-19 Wakaba <wakaba@suika.fam.cx>
381    
382 wakaba 1.263 * WebIDL.pm (type_text): Better serializer.
383    
384     2008-07-19 Wakaba <wakaba@suika.fam.cx>
385    
386 wakaba 1.262 * WebIDL.pm: Revise forward-compatible parsing so that
387     it now can handle broken extended attributes and as such.
388    
389     2008-07-19 Wakaba <wakaba@suika.fam.cx>
390    
391 wakaba 1.261 * WebIDL.pm: Real support for extended attributes.
392     Support for extended attributes with arguments.
393    
394     2008-07-19 Wakaba <wakaba@suika.fam.cx>
395    
396 wakaba 1.260 * WebIDL.pm: Support for |exception| syntax.
397     (Interface->idl_text): Tentative support for inheritances.
398    
399     2008-07-19 Wakaba <wakaba@suika.fam.cx>
400    
401 wakaba 1.259 * WebIDL.pm: Hierarchical scoped name support was broken.
402     Support for raises, setraises, and getraises syntaxes.
403    
404 wakaba 1.258 2008-07-18 Wakaba <wakaba@suika.fam.cx>
405    
406     * WebIDL.pm: Support for |idl_text| attribute, version 1 (no
407     proper support for types, extended attributes, and exceptions yet).
408     WebIDL parser, version 1 (no support for exceptions yet,
409     no proper support for extended attributes yet).
410    
411 wakaba 1.257 2008-07-09 Wakaba <wakaba@suika.fam.cx>
412    
413     * WebIDL.pm (parse_char_string): Support for basic attribute syntax.
414    
415 wakaba 1.255 2008-06-29 Wakaba <wakaba@suika.fam.cx>
416    
417 wakaba 1.256 * WebIDL.pm: Support for valuetype and const.
418    
419     2008-06-29 Wakaba <wakaba@suika.fam.cx>
420    
421 wakaba 1.255 * WebIDL.pm: New module.
422    
423 wakaba 1.254 2008-06-15 Wakaba <wakaba@suika.fam.cx>
424    
425     * Makefile (Entities.html): URI changed.
426    
427 wakaba 1.253 2008-06-08 Wakaba <wakaba@suika.fam.cx>
428    
429     * HTML.pm.src: Support for ruby parsing (HTML5 revision 1704).
430    
431 wakaba 1.251 2008-06-01 Wakaba <wakaba@suika.fam.cx>
432    
433 wakaba 1.252 * HTML.pm.src (_get_next_token): A parse error was missing.
434    
435     2008-06-01 Wakaba <wakaba@suika.fam.cx>
436    
437 wakaba 1.251 * mklinktypelist.pl: rel=contact is no longer part of the HTML5
438     spec (commented out). (HTML5 revision 1711).
439    
440 wakaba 1.248 2008-05-25 Wakaba <wakaba@suika.fam.cx>
441    
442 wakaba 1.250 * ContentType.pm: Drop support for UTF-32 (HTML5 revision 1701).
443    
444     * HTML.pm.src: UTF-16BE and UTF-16LE should be considered
445     as UTF-16 (HTML5 revision 1701).
446    
447     2008-05-25 Wakaba <wakaba@suika.fam.cx>
448    
449 wakaba 1.249 * HTML.pm.src: Support for <noframes> in <head> (HTML5 revision
450     1692).
451    
452     2008-05-25 Wakaba <wakaba@suika.fam.cx>
453    
454     * HTML.pm.src: The secondary insertion mode used when switching
455     to foreign content is the "in body" insertion mode (HTML5 revision
456     1696).
457    
458     2008-05-25 Wakaba <wakaba@suika.fam.cx>
459    
460 wakaba 1.248 * HTML.pm.src: Don't raise parse error for <isindex/> (HTML5
461     revision 1697).
462    
463 wakaba 1.239 2008-05-24 Wakaba <wakaba@suika.fam.cx>
464    
465 wakaba 1.247 * HTML.pm.src: Support for end-of-file token in foreign content
466     insertion mode (HTML5 revision 1693). Update SVG camelCase
467     attribute list (HTML5 revision 1700). <textarea> closes
468     </select> (HTML5 revision 1699). More start tags close in
469     foreign content insertion mode (HTML5 revision 1698).
470    
471     2008-05-24 Wakaba <wakaba@suika.fam.cx>
472    
473 wakaba 1.246 * HTML.pm.src: ";" is not part of charset name (HTML5 revision 1665).
474    
475     2008-05-24 Wakaba <wakaba@suika.fam.cx>
476    
477 wakaba 1.245 * HTML.pm.src: More robust charset parameter detection (HTML5
478     revision 1674).
479    
480     2008-05-24 Wakaba <wakaba@suika.fam.cx>
481    
482 wakaba 1.244 * ContentType.pm: Support for image/vnd.microsoft.icon (HTML5
483     revision 1676).
484    
485     2008-05-24 Wakaba <wakaba@suika.fam.cx>
486    
487 wakaba 1.243 * HTML.pm.src: Ignore language part of public identifiers for
488     quriks mode detection (HTML5 revision 1679).
489    
490     2008-05-24 Wakaba <wakaba@suika.fam.cx>
491    
492 wakaba 1.242 * HTML.pm.src: Reduce the number of errors in truncated doctypes (HTML5
493     revision 1685).
494    
495     2008-05-24 Wakaba <wakaba@suika.fam.cx>
496    
497 wakaba 1.241 * HTML.pm.src: Support for EOF in new states for tags (HTML5
498     revision 1684).
499    
500     2008-05-24 Wakaba <wakaba@suika.fam.cx>
501    
502 wakaba 1.240 * HTML.pm.src (_reset_insertion_mode): Make <td>.innerHTML
503     work (HTML5 revision 1690).
504    
505     2008-05-24 Wakaba <wakaba@suika.fam.cx>
506    
507 wakaba 1.239 * HTML.pm.src (_tree_construction_main): Change handling of
508     end tags in head insertion modes (HTML5 revision 1686).
509     (parse_char_string): Bug fix for non-utf8 character string handlings.
510     (parse_char_stream): |ungetc| does not work well for this context.
511    
512 wakaba 1.237 2008-05-18 Wakaba <wakaba@suika.fam.cx>
513    
514 wakaba 1.238 * HTML.pm.src (parse_byte_string): Redefined to invoke
515     |parse_byte_stream|.
516     (parse_byte_stream): New method.
517    
518     2008-05-18 Wakaba <wakaba@suika.fam.cx>
519    
520 wakaba 1.237 * HTML.pm.src (parse_byte_string): Fix the column number reported
521     by encoding layer error reporter.
522    
523 wakaba 1.233 2008-05-17 Wakaba <wakaba@suika.fam.cx>
524    
525 wakaba 1.236 * HTML.pm.src (parse_byte_string): Use streaming decoder
526     rather than converting the whole byte string and then parsing.
527     Propagate errors in character encoding layer.
528     (get_next_token): Precise error reporting for |bare stago| error.
529    
530     2008-05-17 Wakaba <wakaba@suika.fam.cx>
531    
532 wakaba 1.235 * HTML.pm.src (parse_char_stream): New method.
533     (parse_char_string): This method is now defined as an invocation
534     of the |parse_char_stream| method.
535    
536     2008-05-17 Wakaba <wakaba@suika.fam.cx>
537    
538 wakaba 1.234 * HTML.pm.src (parse_byte_string): Report various status
539     of the sniffing as info-level errors. Support for new
540     decoding framework in parser resestting.
541     (new): Various default error levels were not set.
542    
543     2008-05-17 Wakaba <wakaba@suika.fam.cx>
544    
545 wakaba 1.233 * HTML.pm.src (parse_byte_string): HTML5 encoding siniffing
546     algorithm, except for the actual sniffing, is implemented
547     with new framework with Message::Charset::Info.
548    
549 wakaba 1.232 2008-05-16 Wakaba <wakaba@suika.fam.cx>
550    
551     * CacheManifest.pm (_parse): Drop fragment identifiers from
552     URIs in fallback section (HTML5 revision 1596).
553    
554 wakaba 1.230 2008-05-10 Wakaba <wakaba@suika.fam.cx>
555    
556 wakaba 1.231 * Makefile (Entities.html): URI has changed.
557    
558     2008-05-10 Wakaba <wakaba@suika.fam.cx>
559    
560 wakaba 1.230 * CacheManifest.pm: Don't replace U+0000 NULL (HTML5 revision
561     1553).
562    
563 wakaba 1.228 2008-05-06 Wakaba <wakaba@suika.fam.cx>
564    
565 wakaba 1.229 * ContentChecker.pm: Noted that those returned in |table| are
566     no longer table elements, but table objects returned
567     by Whatpm::HTMLTable.
568    
569     * HTMLTable.pm (form_table): Return table element node
570     as |$table->{element}|.
571     (assign_header): Support for the |headers=""| attribute.
572    
573     2008-05-06 Wakaba <wakaba@suika.fam.cx>
574    
575 wakaba 1.228 * HTMLTable.pm (assign_header): New function; first version
576     with no support for headers="".
577     (form_table): Include table width and height to the returned
578     table object for covenience. Indexing in column assignement was wrong.
579     Set whether a data cell is empty or not for convenience.
580    
581 wakaba 1.222 2008-05-05 Wakaba <wakaba@suika.fam.cx>
582    
583 wakaba 1.227 * HTMLTable.pm: Robuster caption support (HTML5 revision 1393).
584    
585     2008-05-05 Wakaba <wakaba@suika.fam.cx>
586    
587 wakaba 1.226 * HTMLTable.pm: How table model errors are detected is
588     changed (HTML5 revision 1387).
589    
590     2008-05-05 Wakaba <wakaba@suika.fam.cx>
591    
592 wakaba 1.225 * HTMLTable.pm: The algorithm now moves |tfoot| elements
593     to the end of the table (HTML5 revision 1380).
594    
595     2008-05-05 Wakaba <wakaba@suika.fam.cx>
596    
597 wakaba 1.224 * HTMLTable.pm: The algorithm is now 0-based indexing, instead
598     of 1-based (HTML5 revision 1376).
599    
600     2008-05-05 Wakaba <wakaba@suika.fam.cx>
601    
602 wakaba 1.223 * ContentType.pm: "Content-Type: text/plain; charset=UTF-8"
603     and "Content-Encoding" no longer prevent sniffing (HTML5
604     revision 1288).
605    
606     2008-05-05 Wakaba <wakaba@suika.fam.cx>
607    
608 wakaba 1.222 * ContentType.pm: Skip BOMs n feed or HTML algorithm (HTML5 revision
609     1282).
610    
611 wakaba 1.220 2008-05-03 Wakaba <wakaba@suika.fam.cx>
612    
613 wakaba 1.221 * ContentChecker.pm: Support for global attributes.
614     Status of XML specs are added.
615    
616     2008-05-03 Wakaba <wakaba@suika.fam.cx>
617    
618 wakaba 1.220 * ContentChecker.pm (check_element): Support for |template=""|
619     and |ref=""| attribute (referent element type checking).
620    
621 wakaba 1.218 2008-04-29 Wakaba <wakaba@suika.fam.cx>
622    
623 wakaba 1.219 * CacheManifest.pm (_parse): New same origin definition (HTML5 revision
624     1500) is implemented (except for IDNA part and URI-scheme-specific
625 wakaba 1.218 knowledge). Line number counting was wrong for LF-only
626     documents.
627    
628 wakaba 1.211 2008-04-13 Wakaba <wakaba@suika.fam.cx>
629    
630 wakaba 1.217 * HTML.pm.src: Raise an parse error for any disallowed
631     character (HTML5 revision 1263).
632    
633     2008-04-13 Wakaba <wakaba@suika.fam.cx>
634    
635 wakaba 1.216 * mkentitylist.pl: Support for new HTML5 entity table format (the
636     definition for |AElig;| was missing).
637    
638     2008-04-13 Wakaba <wakaba@suika.fam.cx>
639    
640 wakaba 1.215 * HTML.pm.src, mkhtmlparser.pl: Support for element/attribute
641     name/namespace fixup (HTML5 revisions 1413, 1415, 1416, and 1417).
642    
643     2008-04-13 Wakaba <wakaba@suika.fam.cx>
644    
645 wakaba 1.214 * HTML.pm.src: List of element names that close foreign content
646     insertion mode is added (HTML5 revisions 1412 and 1418).
647    
648     2008-04-13 Wakaba <wakaba@suika.fam.cx>
649    
650 wakaba 1.213 * HTML.pm.src: Support for |mglyph| and |malignmark|
651     elements (HTML5 revision 1410).
652    
653     2008-04-13 Wakaba <wakaba@suika.fam.cx>
654    
655 wakaba 1.212 * HTML.pm.src: Support for new long MathML entities (HTML5
656     revision 1406).
657    
658     2008-04-13 Wakaba <wakaba@suika.fam.cx>
659    
660 wakaba 1.211 * HTML.pm.src: CDATA section support for MathML and SVG
661     elements (HTML5 revision 1404 and 1420).
662    
663 wakaba 1.209 2008-04-12 Wakaba <wakaba@suika.fam.cx>
664    
665 wakaba 1.210 * HTML.pm.src, mkhtmlparser.pl: Support for MathML and SVG
666     elements (HTML5 revision 1404). Unused !!!macro definitions
667     are removed.
668    
669     2008-04-12 Wakaba <wakaba@suika.fam.cx>
670    
671 wakaba 1.209 * HTML.pm.src, mkhtmlparser.pl: The way permitted slash errors
672     are raised is changed (HTML5 revision 1404).
673    
674 wakaba 1.206 2008-04-06 Wakaba <wakaba@suika.fam.cx>
675    
676 wakaba 1.208 * HTML.pm.src: Category sets that are no longer used are removed.
677    
678     2008-04-06 Wakaba <wakaba@suika.fam.cx>
679    
680 wakaba 1.207 * HTML.pm.src: The ->[1] property of stack entries are now
681     replaced by constants representing element category.
682    
683     2008-04-06 Wakaba <wakaba@suika.fam.cx>
684    
685 wakaba 1.206 * HTML.pm.src: Don't use local name stored in stack (i.e. ->[1])
686     for error reporting. (This is a preparation for using constant
687     value for ->[1].)
688    
689 wakaba 1.204 2008-03-22 Wakaba <wakaba@suika.fam.cx>
690    
691 wakaba 1.205 * RDFXML.pm: Typo fixed.
692    
693     2008-03-22 Wakaba <wakaba@suika.fam.cx>
694    
695 wakaba 1.204 * ContentChecker.pm: |fact_level| is now treated
696     as same as |must_level|, i.e. level = |m|.
697     (check_element): Make list of URIs in the DOM.
698    
699 wakaba 1.197 2008-03-21 Wakaba <wakaba@suika.fam.cx>
700    
701 wakaba 1.203 * RDFXML.pm: Language accessor implemented.
702     Local (null-namespace) attribute support.
703    
704     2008-03-21 Wakaba <wakaba@suika.fam.cx>
705    
706 wakaba 1.202 * RDFXML.pm: Factored out ID checking code.
707    
708     2008-03-21 Wakaba <wakaba@suika.fam.cx>
709    
710 wakaba 1.201 * RDFXML.pm: TODO items noted. Validation of ID
711     and URI attributes is implemented. Warn if unknown
712     value is used in rdf:parseType="" attribute.
713    
714     * URIChecker.pm (check_rdf_uri_reference): New function.
715    
716     2008-03-21 Wakaba <wakaba@suika.fam.cx>
717    
718 wakaba 1.200 * RDFXML.pm: bnodeid implemented. Relative references
719     are now resolved.
720    
721     2008-03-21 Wakaba <wakaba@suika.fam.cx>
722    
723 wakaba 1.199 * ContentChecker.pm: RDF reification implemented.
724    
725     * RDFXML.pm: undef vs false bug fixed.
726     Reification implemented.
727    
728     2008-03-21 Wakaba <wakaba@suika.fam.cx>
729    
730 wakaba 1.198 * RDFXML.pm: s/id/ID/ for attribute name.
731     The |node| arguments are added for |ontriple| calls.
732     Too many "attribute not allowed" errors were raised.
733    
734     * ContentChecker.pm: Initial experimental support for rdf:RDF
735     element.
736    
737     2008-03-21 Wakaba <wakaba@suika.fam.cx>
738    
739 wakaba 1.197 * RDFXML.pm: New module.
740    
741 wakaba 1.192 2008-03-20 Wakaba <wakaba@suika.fam.cx>
742    
743 wakaba 1.196 * HTML.pm.src (set_inner_html): Line/column number
744     code was old one yet.
745    
746     2008-03-20 Wakaba <wakaba@suika.fam.cx>
747    
748 wakaba 1.195 * HTML.pm.src: Better line/column reporting for "duplicate attribute"
749     errors. Line/column markings for DOCTYPE, comment, and
750     character tokens are reintroduced; otherwise,
751     error location for "not HTML5" error and errors
752     for implied elements are not attached.
753    
754     2008-03-20 Wakaba <wakaba@suika.fam.cx>
755    
756 wakaba 1.194 * HTML.pm.src: Set line/column numbers to attributes.
757    
758     * NanoDOM.pm (create_attribute_ns, set_attribute_node_ns): Added.
759     (value): Setter implemented.
760    
761     * mkhtmlparser.pl: Set line/column numbers to Attr nodes.
762    
763     2008-03-20 Wakaba <wakaba@suika.fam.cx>
764    
765 wakaba 1.193 * HTML.pm.src: Unused line/column markings are removed.
766    
767     2008-03-20 Wakaba <wakaba@suika.fam.cx>
768    
769 wakaba 1.192 * HTML.pm.src (_get_next_token): Remove |first_start_tag|
770     flag, which is no longer used.
771    
772 wakaba 1.190 2008-03-17 Wakaba <wakaba@suika.fam.cx>
773    
774 wakaba 1.191 * HTML.pm.src: Set line/column information to element nodes.
775    
776     * mkhtmlparser.pl (!!!create-element, !!!insert-element,
777     and !!!insert-element-t): Set line/column information to
778     element nodes.
779    
780     2008-03-17 Wakaba <wakaba@suika.fam.cx>
781    
782 wakaba 1.190 * HTML.pm.src (_get_next_token): The first "<" character
783     in "<?", "<>", or "</>" should be the error point.
784    
785 wakaba 1.187 2008-03-16 Wakaba <wakaba@suika.fam.cx>
786    
787 wakaba 1.189 * HTML.pm.src: Some more fixes on error position reporting.
788    
789     2008-03-16 Wakaba <wakaba@suika.fam.cx>
790    
791 wakaba 1.188 * HTML.pm.src: Token-level precious error reporting.
792    
793     2008-03-16 Wakaba <wakaba@suika.fam.cx>
794    
795 wakaba 1.187 * HTML.pm.src: Preparation for more precious error point
796     reporting.
797    
798 wakaba 1.184 2008-03-11 Wakaba <wakaba@suika.fam.cx>
799    
800 wakaba 1.186 * HTML.pm.src: Error type revised.
801    
802     2008-03-11 Wakaba <wakaba@suika.fam.cx>
803    
804 wakaba 1.185 * HTML.pm.src: Similar codes are merged together, again.
805    
806     2008-03-11 Wakaba <wakaba@suika.fam.cx>
807    
808 wakaba 1.184 * HTML.pm.src: Similar codes are merged together.
809    
810 wakaba 1.183 2008-03-10 Wakaba <wakaba@suika.fam.cx>
811    
812     * mkhtmlparser.pl: Set "level" parameter to parse errors.
813    
814     * HTML.pm.src: Code refined.
815    
816 wakaba 1.175 2008-03-09 Wakaba <wakaba@suika.fam.cx>
817    
818 wakaba 1.182 * HTML.pm.src: |</body>| treatement has been changed (HTML5
819     revision 1348). Note that I really don't know this makes
820     any difference in the black-box behavior of the parser.
821    
822     2008-03-09 Wakaba <wakaba@suika.fam.cx>
823    
824 wakaba 1.181 * HTML.pm.src: New end-of-file token implementation (HTML5
825     revision 1348).
826    
827     2008-03-09 Wakaba <wakaba@suika.fam.cx>
828    
829 wakaba 1.180 * HTML.pm.src: |applet| support (HTML5 revision 1347).
830    
831     2008-03-09 Wakaba <wakaba@suika.fam.cx>
832    
833 wakaba 1.179 * HTML.pm.src: Foster parenting in AAA (HTML5 revision 1343).
834    
835     2008-03-09 Wakaba <wakaba@suika.fam.cx>
836    
837 wakaba 1.178 * HTML.pm.src: Support for |<input>| in the "in select" insertion
838     mode and support for the "in select in table" insertion mode (HTML5
839     revision 1342).
840    
841     2008-03-09 Wakaba <wakaba@suika.fam.cx>
842    
843 wakaba 1.177 * HTML.pm.src: No foster parenting for <script> and <script>
844     in non-tainted <table>s (HTML5 revision 1336).
845    
846     2008-03-09 Wakaba <wakaba@suika.fam.cx>
847    
848 wakaba 1.176 * HTML.pm.src: Ignore white space characters between <html>
849     and <head> (HTML5 revision 1332).
850    
851     2008-03-09 Wakaba <wakaba@suika.fam.cx>
852    
853 wakaba 1.175 * HTML.pm.src: Treat <input type=hidden> as if it were a
854     white space (HTML5 revision 1331).
855    
856 wakaba 1.164 2008-03-08 Wakaba <wakaba@suika.fam.cx>
857    
858 wakaba 1.174 * HTML.pm.src: Ignore U+000A at the beginning of a |listing|
859     element (HTML5 revision 1330).
860    
861     2008-03-08 Wakaba <wakaba@suika.fam.cx>
862    
863 wakaba 1.173 * HTML.pm.src: <title> is always appended to the current
864     element (HTML5 revision 1328).
865    
866     2008-03-08 Wakaba <wakaba@suika.fam.cx>
867    
868 wakaba 1.172 * HTML.pm.src: White space in tainted tables are moved
869     into foster parents (HTML5 revision 1326).
870    
871     2008-03-08 Wakaba <wakaba@suika.fam.cx>
872    
873 wakaba 1.171 * HTML.pm.src: Reduce errors from foster parenting cases (HTML5
874     revision 1321).
875    
876     2008-03-08 Wakaba <wakaba@suika.fam.cx>
877    
878 wakaba 1.170 * HTML.pm.src: |</h/n/>| case code rearranged to align with
879     the spec (HTML5 revision 1320). Note that we finally complete
880     all of HTML5 revision 1320 changes.
881    
882     2008-03-08 Wakaba <wakaba@suika.fam.cx>
883    
884 wakaba 1.169 * HTML.pm.src: |</form>| now works similar to |</div>| for unclosed
885     tags (HTML5 revision 1320).
886    
887     2008-03-08 Wakaba <wakaba@suika.fam.cx>
888    
889 wakaba 1.168 * HTML.pm.src: |</p>| case rearranged with no actual change in fact.
890    
891     2008-03-08 Wakaba <wakaba@suika.fam.cx>
892    
893 wakaba 1.167 * HTML.pm.src: A "generate implied end tags" code (t409.1)
894     could not be reached so that it is now removed (HTML5 revision
895     1320).
896    
897     2008-03-08 Wakaba <wakaba@suika.fam.cx>
898    
899 wakaba 1.166 * HTML.pm.src: Code for the case of |</div>| and so on are revised
900     to align with new spec text (HTML5 revision 1320).
901    
902     2008-03-08 Wakaba <wakaba@suika.fam.cx>
903    
904 wakaba 1.165 * HTML.pm.src: Remove strange |if| condition; however, it should
905     have had no harm in theory.
906    
907     2008-03-08 Wakaba <wakaba@suika.fam.cx>
908    
909 wakaba 1.164 * HTML.pm.src (_tree_construction_main): '</p>' in body
910     case is split from other end tags for the preparation
911     of implementing HTML revision 1320.
912    
913 wakaba 1.162 2008-03-07 Wakaba <wakaba@suika.fam.cx>
914    
915 wakaba 1.163 * HTML.pm.src: Simplified "generate implied end tag" (HTML5
916     revision 1320).
917    
918     2008-03-07 Wakaba <wakaba@suika.fam.cx>
919    
920 wakaba 1.162 * HTML.pm.src (_tree_construction_main): Merge rules for "h1"
921     and "div" (HTML5 revision 1318). Add comments to where
922     |form| pointer association codes should be inserted (HTML5 revision
923     1319).
924    
925 wakaba 1.161 2008-03-06 Wakaba <wakaba@suika.fam.cx>
926    
927     * HTML.pm.src: <html> treatement refined (HTML5 revision 1314).
928    
929 wakaba 1.159 2008-03-05 Wakaba <wakaba@suika.fam.cx>
930    
931 wakaba 1.160 * HTML.pm.src: Since the case t268 should never be reached (no
932     other token type, there are), it is replaced by a |die| statement.
933    
934     2008-03-05 Wakaba <wakaba@suika.fam.cx>
935    
936 wakaba 1.159 * HTML.pm.src: Typo fixed.
937    
938 wakaba 1.158 2008-03-04 Wakaba <wakaba@suika.fam.cx>
939    
940     * HTML.pm.src (_tree_construction_initial): Some limited quirks
941     doctypes were not uppercased for comparison.
942    
943 wakaba 1.152 2008-03-03 Wakaba <wakaba@suika.fam.cx>
944    
945 wakaba 1.157 * HTML.pm.src (tree construction and set_inner_html): Checkpoints
946     are added.
947    
948     2008-03-03 Wakaba <wakaba@suika.fam.cx>
949    
950 wakaba 1.156 * HTML.pm.src (_tokenize_attempt_to_consume_an_entity): Checkpoints
951     are set. Cases that are unlikely reached are noted as so.
952    
953     2008-03-03 Wakaba <wakaba@suika.fam.cx>
954    
955 wakaba 1.155 * HTML.pm.src: Checkpoints for debugging are added.
956    
957     * mkhtmlparser.pl: Support for |!!!cp| syntax.
958    
959     2008-03-03 Wakaba <wakaba@suika.fam.cx>
960    
961 wakaba 1.154 * HTML.pm.src, mkhtmlparser.pl: s/_input_character/_char/g
962     for simplicity.
963    
964     2008-03-03 Wakaba <wakaba@suika.fam.cx>
965    
966 wakaba 1.153 * HTML.pm.src: Flag name changed: s/correct/force-quirks/g (HTML5
967     revision 1307).
968    
969     2008-03-03 Wakaba <wakaba@suika.fam.cx>
970    
971 wakaba 1.152 * HTML.pm.src (_get_next_token): Where the /incorrect/ flag
972     is set are changed (HTML5 revision 1305).
973    
974 wakaba 1.149 2008-03-02 Wakaba <wakaba@suika.fam.cx>
975    
976 wakaba 1.151 * HTML.pm.src: Raise a parse error for |<span ===>| (HTML5 revision
977     1292). Entities are not parsed in comment-like part in RCDATA
978     elements (HTML5 revision 1294). Allow bare & at the end
979     of attribute value literals (HTML5 revision 1296). More
980     quirks mode doctypes (HTML5 revision 1302). Requires spaces
981     between attributes and ban attribute names or unquoted
982     attribute values containing single or double quotes (HTML5
983     revision 1303).
984    
985     2008-03-02 Wakaba <wakaba@suika.fam.cx>
986    
987 wakaba 1.150 * ContentChecker.pm: Typo fixed. Don't raise "character encoding"
988     and related errors unless it is an HTML document (though the spec
989     is unclear on whether it is applied to XHTML document).
990    
991     * HTML.pm (%HTMLAttrStatus): WF2 repetition model attributes
992     are added.
993    
994     2008-03-02 Wakaba <wakaba@suika.fam.cx>
995    
996 wakaba 1.149 * HTML.pm.src: s/local_name/manakai_local_name/g.
997    
998 wakaba 1.148 2008-03-01 Wakaba <wakaba@suika.fam.cx>
999    
1000     * _NamedEntityList.pm: Updated (HTML5 revision 1286).
1001    
1002     * HTML.pm.src: |charset| in |content| attribute is
1003     case-insensitive (HTML5 revision 1270).
1004    
1005 wakaba 1.147 2008-02-26 Wakaba <wakaba@suika.fam.cx>
1006    
1007     * ContentChecker.pm: New status constants are added.
1008     ($ElementDefault): |status| added.
1009     (check_element): Err for non-standard or deprecated elements.
1010     (_attr_status_info): For non-standard or deprecated attributes.
1011    
1012 wakaba 1.142 2008-02-24 Wakaba <wakaba@suika.fam.cx>
1013    
1014 wakaba 1.146 * ContentChecker.pm (_attr_status_info): New internal method.
1015    
1016     2008-02-24 Wakaba <wakaba@suika.fam.cx>
1017    
1018 wakaba 1.145 * ContentChecker.pm (check_element): Element standardized
1019     status information is now dispatched.
1020    
1021     2008-02-24 Wakaba <wakaba@suika.fam.cx>
1022    
1023 wakaba 1.144 * ContentChecker.pm (check_element): Fix |del|-and-significant
1024     problem by adding some more arguments.
1025    
1026     2008-02-24 Wakaba <wakaba@suika.fam.cx>
1027    
1028 wakaba 1.143 * ContentChecker.pm (check_element): Use context of
1029     container-for-the-purpose-of-content-model element (not transparent
1030     element) for |check_child_element| calling and significant
1031     text flag marking. This reintroduces |<del>|-and-significant
1032     problem again.
1033    
1034     2008-02-24 Wakaba <wakaba@suika.fam.cx>
1035    
1036 wakaba 1.142 * ContentChecker.pm (check_element): Make semi-transparent
1037     elements ignored for the purpose of phase changes in
1038     content model checking.
1039    
1040 wakaba 1.138 2008-02-23 Wakaba <wakaba@suika.fam.cx>
1041    
1042 wakaba 1.141 * ContentChecker.pm (check_element): In-element state
1043     was not properly managed for transparent cases.
1044    
1045     2008-02-23 Wakaba <wakaba@suika.fam.cx>
1046    
1047 wakaba 1.140 * ContentChecker.pm (check_element): Support for |video|
1048     and |audio| as semi-transparent elements.
1049    
1050     2008-02-23 Wakaba <wakaba@suika.fam.cx>
1051    
1052 wakaba 1.139 * ContentChecker.pm ($HTMLSemiTransparentElements): New.
1053     (check_element): s/minuses/minus_elements/, s/pluses/plus_elements/.
1054     Support for |html:object| as a semi-transparent element.
1055    
1056     2008-02-23 Wakaba <wakaba@suika.fam.cx>
1057    
1058 wakaba 1.138 * ContentChecker.pm (check_element): The way to traverse
1059     the tree is entirely revised to make it easier to track
1060     the state of ancestors/descendants. As a result of this
1061     revision (which rewrites almost all of Whatpm::ContentChecker::HTML),
1062     support for content model checking for HTML elements |figure|,
1063     |object|, |video|, and |audio| and checking for XML elements (and
1064     some XMLNS checkings) are dropped for now. They will be
1065     reimplemented in due cource.
1066    
1067 wakaba 1.134 2008-02-17 Wakaba <wakaba@suika.fam.cx>
1068    
1069 wakaba 1.137 * HTML.pm.src: |>| in public or system literal closes the DOCTYPE
1070     token (HTML5 revision 1225).
1071    
1072     2008-02-17 Wakaba <wakaba@suika.fam.cx>
1073    
1074 wakaba 1.136 * ContentChecker.pm ({unsupported_level}): New value.
1075    
1076     * HTML.pm.src: Save whether |meta| |content| attribute
1077     contains character references or not.
1078    
1079     2008-02-17 Wakaba <wakaba@suika.fam.cx>
1080    
1081 wakaba 1.135 * ContentChecker.pm (_get_children): (Incompleted) attempt to
1082     imlement significant content checking for contents
1083     with |del| elements.
1084    
1085     2008-02-17 Wakaba <wakaba@suika.fam.cx>
1086    
1087 wakaba 1.134 * ContenteChecker.pm ($HTMLTransparentElements): More
1088     elements are added.
1089     (_get_children): HTML |object| elements are now semi-transparent.
1090    
1091     * NanoDOM.pm (manakai_html, manakai_head): New methods.
1092    
1093 wakaba 1.133 2008-02-16 Wakaba <wakaba@suika.fam.cx>
1094    
1095     * CacheManifest.pm: HTML5 revision 1211 implemented.
1096    
1097     * CacheManifest.pod: Updated.
1098    
1099 wakaba 1.132 2008-02-10 Wakaba <wakaba@suika.fam.cx>
1100    
1101     * ContentChecker.pm (check_document, check_element): Support
1102     for second argument ($onsubdoc).
1103     (_get_css_parser): Removed (now it is part of WDCC).
1104    
1105 wakaba 1.131 2008-02-09 Wakaba <wakaba@suika.fam.cx>
1106    
1107     * ContentChecker.pm (_get_css_parser): New.
1108    
1109 wakaba 1.129 2007-11-25 Wakaba <wakaba@suika.fam.cx>
1110    
1111 wakaba 1.130 * ContentChecker.pm ($AnyChecker): Old way to add child elements
1112     for checking had been used.
1113    
1114     2007-11-25 Wakaba <wakaba@suika.fam.cx>
1115    
1116 wakaba 1.129 * ContentChecker.pm (check_element): New todo item type |descendant|.
1117    
1118 wakaba 1.125 2007-11-23 Wakaba <wakaba@suika.fam.cx>
1119    
1120 wakaba 1.128 * IMTChecker.pm: Revised to raise errors and warnings as (poorly)
1121     specced in RFC 2046 and RFC 4288.
1122     (application/atom+xml): Definition added.
1123    
1124     2007-11-23 Wakaba <wakaba@suika.fam.cx>
1125    
1126 wakaba 1.127 * URIChecker.pm: Make RFC 3986 should-level errors
1127     warnings (rather than SHOULD-level errors).
1128    
1129     2007-11-23 Wakaba <wakaba@suika.fam.cx>
1130    
1131 wakaba 1.126 * NanoDOM.pm (get_user_data, set_user_data): New methods.
1132    
1133     * HTML.pm.src: A flag for character references in attribute
1134     values are added. Set |manakai_has_reference| user data
1135     to |charset| attribute.
1136    
1137     2007-11-23 Wakaba <wakaba@suika.fam.cx>
1138    
1139 wakaba 1.125 * NanoDOM.pm (input_encoding, manakai_charset, manakai_has_bom): New
1140     attributes.
1141    
1142     * ContentChecker.pm (check_document): Warn if charset requirements
1143     cannot be tested.
1144    
1145 wakaba 1.124 2007-11-19 Wakaba <wakaba@suika.fam.cx>
1146    
1147     * HTML.pm.src (parse_byte_string): Detect charset
1148     by universalchardet if charset parameter is not specified.
1149    
1150     * Makefile (Charset-all, Charset-clean): New rules.
1151    
1152 wakaba 1.120 2007-11-18 Wakaba <wakaba@suika.fam.cx>
1153    
1154 wakaba 1.123 * ContentChecker.pm (check_document): Check the existence
1155     of character encoding declaration and selection of encoding
1156     for HTML document.
1157    
1158     2007-11-18 Wakaba <wakaba@suika.fam.cx>
1159    
1160 wakaba 1.122 * ContentType.pm (get_sniffed_type): Return also the official
1161     type in list context.
1162    
1163     2007-11-18 Wakaba <wakaba@suika.fam.cx>
1164    
1165 wakaba 1.121 * ContentType.pm: Sniffing with leading white space
1166     ignoring (HTML5 revisions 1013 and 1016).
1167    
1168     2007-11-18 Wakaba <wakaba@suika.fam.cx>
1169    
1170 wakaba 1.120 * ContentType.pm: HTML5 revision 1013 changes, except for leading
1171     white spaces, are implemented.
1172    
1173 wakaba 1.117 2007-11-11 Wakaba <wakaba@suika.fam.cx>
1174    
1175 wakaba 1.119 * HTML.pm.src (parse_char_string): Set |inner_encoding|
1176     attribute if possible.
1177    
1178     2007-11-11 Wakaba <wakaba@suika.fam.cx>
1179    
1180 wakaba 1.118 * HTML.pm.src (parse_byte_string): New method.
1181     (parse_char_string): New alias for |parse_string|.
1182     (main phase): Invoking "change the encoding" algorithm if desired.
1183    
1184     * HTML.pod: Updated.
1185    
1186     2007-11-11 Wakaba <wakaba@suika.fam.cx>
1187    
1188 wakaba 1.117 * HTML.pod (get_inner_html): Removed.
1189    
1190     * Makefile (HTML-all, HTML-clean): New.
1191    
1192     2007-11-11 Wakaba <wakaba@suika.fam.cx>
1193    
1194     * HTML.pm.src (get_inner_html): Removed (moved to HTML/Serializer.pm).
1195    
1196 wakaba 1.116 2007-11-08 Wakaba <wakaba@suika.fam.cx>
1197    
1198     * mklinktypelist.pl: s/noreferer/noreferrer/ (HTML5 revision 1132).
1199    
1200 wakaba 1.112 2007-11-04 Wakaba <wakaba@suika.fam.cx>
1201    
1202 wakaba 1.115 * Makefile: |CacheManifest.html| is added.
1203    
1204     * CacheManifest.pod: New file.
1205    
1206     2007-11-04 Wakaba <wakaba@suika.fam.cx>
1207    
1208 wakaba 1.114 * CacheManifest.pm: New module.
1209    
1210     2007-11-04 Wakaba <wakaba@suika.fam.cx>
1211    
1212 wakaba 1.113 * HTML.pm.src: Support for application cache selection algorithm
1213     callback.
1214    
1215     2007-11-04 Wakaba <wakaba@suika.fam.cx>
1216    
1217 wakaba 1.112 * mklinktypelist.pl: Support for rel=noreferer (HTML5 revision 1118).
1218    
1219 wakaba 1.110 2007-10-17 Wakaba <wakaba@suika.fam.cx>
1220    
1221 wakaba 1.111 * Makefile (clean): New rule.
1222    
1223     * NanoDOM.pm (public_id, system_id): New attributes.
1224    
1225     2007-10-17 Wakaba <wakaba@suika.fam.cx>
1226    
1227 wakaba 1.110 * Makefile (CSS-all, CSS-clean, clean): New rules.
1228    
1229 wakaba 1.109 2007-10-14 Wakaba <wakaba@suika.fam.cx>
1230    
1231     * ContentChecker.pm (check_document): Support for
1232     new |is_xml_root| flag.
1233     (check_element): Support for new |pluses| state.
1234     (_add_pluses): New method.
1235     (_remove_minuses): Support for new |minus| item.
1236    
1237 wakaba 1.106 2007-09-24 Wakaba <wakaba@suika.fam.cx>
1238    
1239 wakaba 1.107 * ContentChecker.pm: Raise specific error for invalid
1240     root element.
1241    
1242     2007-09-24 Wakaba <wakaba@suika.fam.cx>
1243    
1244 wakaba 1.106 * ContentChecker.pm: Set level values for later uses.
1245    
1246 wakaba 1.103 2007-09-09 Wakaba <wakaba@suika.fam.cx>
1247    
1248 wakaba 1.105 * ContentChecker.pm: Support for language tag validation.
1249    
1250     2007-09-09 Wakaba <wakaba@suika.fam.cx>
1251    
1252 wakaba 1.104 * LangTag.pm (check_rfc3066_language_tag): New method.
1253    
1254     2007-09-09 Wakaba <wakaba@suika.fam.cx>
1255    
1256 wakaba 1.103 * LangTag.pm: New module.
1257    
1258 wakaba 1.102 2007-09-04 Wakaba <wakaba@suika.fam.cx>
1259    
1260     * HTML.pm.src: Some error types were wrong.
1261    
1262 wakaba 1.100 2007-08-17 Wakaba <wakaba@suika.fam.cx>
1263    
1264 wakaba 1.101 * CSS/: New directory.
1265    
1266     2007-08-17 Wakaba <wakaba@suika.fam.cx>
1267    
1268 wakaba 1.100 * ContentChecker.pm (_check_get_children): Support
1269     for |noscript| in |head|.
1270    
1271 wakaba 1.99 2007-08-12 Wakaba <wakaba@suika.fam.cx>
1272    
1273     * URI/: New directory.
1274    
1275 wakaba 1.94 2007-08-11 Wakaba <wakaba@suika.fam.cx>
1276    
1277 wakaba 1.98 * HTML.pm.src: Tokenizer's states are now represented in
1278     number.
1279    
1280     2007-08-11 Wakaba <wakaba@suika.fam.cx>
1281    
1282 wakaba 1.97 * HTML.pm.src: |or|s for insertion modes are replaced
1283     by |&|s.
1284    
1285     2007-08-11 Wakaba <wakaba@suika.fam.cx>
1286    
1287 wakaba 1.96 * HTML.pm.src: Token types are now represented in number.
1288    
1289     2007-08-11 Wakaba <wakaba@suika.fam.cx>
1290    
1291 wakaba 1.95 * ContentType.pm (SEE ALSO): Updated.
1292    
1293     * HTML.pm.src: Insertion modes are now represented in number.
1294    
1295     2007-08-11 Wakaba <wakaba@suika.fam.cx>
1296    
1297 wakaba 1.94 * ContentType.pm: Sniffing for bitmap images (HTML5 revision
1298     999) is implemented.
1299    
1300 wakaba 1.93 2007-08-08 Wakaba <wakaba@suika.fam.cx>
1301    
1302     * ContentType.pm: Sniffing for |<script| (HTML5 revision
1303     983) is implemented.
1304    
1305 wakaba 1.92 2007-08-06 Wakaba <wakaba@suika.fam.cx>
1306    
1307     * ContentChecker.pod: New documentation.
1308    
1309     * Makefile: A rule for |ContentChecker.html| is added.
1310    
1311     * ContentChecker.pm: A pod "LICENSE" section is added.
1312    
1313     * NanoDOM.pm ($VERSION): New variable.
1314    
1315 wakaba 1.90 2007-08-05 Wakaba <wakaba@suika.fam.cx>
1316    
1317 wakaba 1.91 * H2H.pm: |b|, |i|, and |sub| are added to the
1318     list of allowed HTML elements.
1319    
1320     2007-08-05 Wakaba <wakaba@suika.fam.cx>
1321    
1322 wakaba 1.90 * H2H.pm: |samp| is added to the list of allowed
1323     HTML elements.
1324    
1325     * URIChecker.pm (check_iri): New.
1326     (check_iri_reference): Error type for IRI reference
1327     syntax error is changed.
1328    
1329 wakaba 1.87 2007-08-04 Wakaba <wakaba@suika.fam.cx>
1330    
1331 wakaba 1.89 * ContentChecker.pm: Reference to the |Whatpm::ContentChecker::Atom|
1332     is added.
1333     (check_document): Load appropriate module before validation.
1334    
1335     2007-08-04 Wakaba <wakaba@suika.fam.cx>
1336    
1337 wakaba 1.88 * ContentChecker/: New directory.
1338    
1339     2007-08-04 Wakaba <wakaba@suika.fam.cx>
1340    
1341 wakaba 1.87 * ContentChecker.pm: HTML |time| element is implemented.
1342    
1343     * HTMLTable.pm: Comments are updated as HTML5 is revised.
1344    
1345 wakaba 1.71 2007-07-21 Wakaba <wakaba@suika.fam.cx>
1346    
1347 wakaba 1.86 * ContentChecker.pm (check_document): Return value
1348     even if no document element is found.
1349    
1350     2007-07-21 Wakaba <wakaba@suika.fam.cx>
1351    
1352 wakaba 1.85 * HTML.pm.src: |$in_body| is no longer a function.
1353    
1354     2007-07-21 Wakaba <wakaba@suika.fam.cx>
1355    
1356 wakaba 1.84 * HTML.pm.src: The |$in_body| code has been moved down.
1357    
1358     2007-07-21 Wakaba <wakaba@suika.fam.cx>
1359    
1360 wakaba 1.83 * HTML.pm.src: The "trailing end" insertion mode
1361     is split into "after html body" and "after html frameset"
1362     insertion modes. Their codes are merged with "after body"
1363     and "after frameset" codes. |$previous_insertion_mode|
1364     has been removed. "after frameset" code is
1365     merged with "in frameset" code.
1366    
1367     2007-07-21 Wakaba <wakaba@suika.fam.cx>
1368    
1369 wakaba 1.82 * HTML.pm.src: The "before head" insertion mode is
1370     merged with the "in head" insertion mode.
1371    
1372     2007-07-21 Wakaba <wakaba@suika.fam.cx>
1373    
1374 wakaba 1.81 * HTML.pm.src: Cases in "in head" insertion mode are
1375     reorganized.
1376    
1377     2007-07-21 Wakaba <wakaba@suika.fam.cx>
1378    
1379 wakaba 1.80 * HTML.pm.src: Some cases in "in table" insertion mode
1380     are merged.
1381    
1382     2007-07-21 Wakaba <wakaba@suika.fam.cx>
1383    
1384 wakaba 1.79 * HTML.pm.src: The "in row" insertion mode is merged
1385     with "in table" insertion mode.
1386    
1387     2007-07-21 Wakaba <wakaba@suika.fam.cx>
1388    
1389 wakaba 1.78 * HTML.pm.src: The "in table" and "in table body" insertion
1390     modes are merged.
1391    
1392     2007-07-21 Wakaba <wakaba@suika.fam.cx>
1393    
1394 wakaba 1.77 * HTML.pm.src: There is no "in table head" or "in table foot"
1395     insertion mode!
1396    
1397     2007-07-21 Wakaba <wakaba@suika.fam.cx>
1398    
1399 wakaba 1.76 * HTML.pm.src: |<noframes>| "in frameset" and "in noframes"
1400     now directly invoke the handler.
1401    
1402     2007-07-21 Wakaba <wakaba@suika.fam.cx>
1403    
1404 wakaba 1.75 * HTML.pm.src: Codes for "in cell" insertion mode
1405     is merged to the "in body" insertion mode code.
1406    
1407     2007-07-21 Wakaba <wakaba@suika.fam.cx>
1408    
1409 wakaba 1.74 * HTML.pm.src: Codes for "in body" and "in caption"
1410     insertion modes are merged.
1411    
1412     2007-07-21 Wakaba <wakaba@suika.fam.cx>
1413    
1414 wakaba 1.73 * HTML.pm.src: Two |!!!next-token|s were missing.
1415    
1416     2007-07-21 Wakaba <wakaba@suika.fam.cx>
1417    
1418 wakaba 1.72 * HTML.pm.src: Use numeric constant for |{content_mode}|
1419     instead of string constant for |{content_model_flag}|.
1420    
1421     2007-07-21 Wakaba <wakaba@suika.fam.cx>
1422    
1423 wakaba 1.71 * HTML.pm.src: Add the name of the attribute
1424     to the "duplicate attribute" error.
1425    
1426 wakaba 1.69 2007-07-17 Wakaba <wakaba@suika.fam.cx>
1427    
1428 wakaba 1.70 * ContentChecker.pm: Return the |class| node list.
1429    
1430     2007-07-17 Wakaba <wakaba@suika.fam.cx>
1431    
1432 wakaba 1.69 * ContentChecker.pm: Return the |id| node list.
1433    
1434     * HTML.pm.src: A typo is fixed.
1435    
1436 wakaba 1.63 2007-07-16 Wakaba <wakaba@suika.fam.cx>
1437    
1438 wakaba 1.68 * ContentChecker.pm: Drop wrong |level => 'error'| specification
1439     from "in HTML:xml:lang" error. Character position
1440     is now the last part of the error type in the URI error
1441     description. Report "unsupported" status for language
1442     tags, media queries, script codes, and style sheets.
1443    
1444     2007-07-16 Wakaba <wakaba@suika.fam.cx>
1445    
1446 wakaba 1.67 * ContentChecker.pm: Report error if |xml:lang|
1447     in HTML, |lang| in XML, |xmlns| in XML, and |meta| |charset|
1448     in XML.
1449    
1450     * NanoDOM.pm (Attr.owner_document): New attribute.
1451    
1452     2007-07-16 Wakaba <wakaba@suika.fam.cx>
1453    
1454 wakaba 1.66 * HTML.pm.src: The character immediately following
1455     a bare |hcro| was discarded. Fix handling of
1456     entity references in attribute values.
1457    
1458     2007-07-16 Wakaba <wakaba@suika.fam.cx>
1459    
1460 wakaba 1.65 * HTML.pm.src (main and trailin end phases): Token
1461     types |DOCTYPE|, |comment|, |end-of-file|, and |<html>| are
1462     factored out. Error types |in html:#DOCTYPE|
1463     and |after html:#DOCTYPE| are merged
1464     into |DOCTYPE in the middle|. |</frameset>|
1465     in fragment parsing mode changed the
1466     insertion mode.
1467    
1468     2007-07-16 Wakaba <wakaba@suika.fam.cx>
1469    
1470 wakaba 1.64 * HTML.pm.src: |$phase| has been removed; The |trailing end|
1471     phase is now an insertion mode. Treatments for white
1472     space character tokens were incorrect for some
1473     insertion modes. An old |meta| case was not removed.
1474    
1475     2007-07-16 Wakaba <wakaba@suika.fam.cx>
1476    
1477 wakaba 1.63 * HTML.pm.src: |meta| charset declaration extraction
1478     implemented (but changing the encoding is not yet:-).
1479    
1480 wakaba 1.60 2007-07-15 Wakaba <wakaba@suika.fam.cx>
1481    
1482 wakaba 1.62 * Charset/: New directory.
1483    
1484     2007-07-15 Wakaba <wakaba@suika.fam.cx>
1485    
1486 wakaba 1.61 * H2H.pm: New Perl module (created from
1487     manakai's H2H.dis).
1488    
1489     2007-07-15 Wakaba <wakaba@suika.fam.cx>
1490    
1491 wakaba 1.60 * XMLSerializer.pm: New Perl module (created from
1492     manakai's SimpleLS.dis).
1493    
1494 wakaba 1.59 2007-07-07 Wakaba <wakaba@suika.fam.cx>
1495    
1496     * HTML.pm.src: |<!---x-->| was not processed correctly.
1497    
1498 wakaba 1.57 2007-07-01 Wakaba <wakaba@suika.fam.cx>
1499    
1500 wakaba 1.58 * HTML.pm.src: Report correct error message
1501     for |<body></div></body>|.
1502    
1503     2007-07-01 Wakaba <wakaba@suika.fam.cx>
1504    
1505 wakaba 1.57 * HTMLTable.pm: An error description was incorrect.
1506    
1507     2007-06-30 Wakaba <wakaba@suika.fam.cx>
1508    
1509     * ContentChecker.pm: Return |{term}| list.
1510    
1511 wakaba 1.55 2007-06-30 Wakaba <wakaba@suika.fam.cx>
1512    
1513 wakaba 1.56 * HTML.pm.src: HTML5 revisions 961-966 (</p>, </br>,
1514     nested <nobr>, implied </tbody>, </tfoot>, and </thead>,
1515     and <title> outside of head).
1516    
1517     2007-06-30 Wakaba <wakaba@suika.fam.cx>
1518    
1519 wakaba 1.55 * IMTChecker.pm: Report warning for unregistered
1520     and private types/subtypes.
1521    
1522     * ContentChecker.pm, HTML.pm.src, IMTChecker.pm,
1523     URIChecker.pm, HTMLTable.pm: Error messages are now
1524     consistent; they are all listed in
1525     <http://suika.fam.cx/gate/2005/sw/Whatpm%20Error%20Types>.
1526    
1527 wakaba 1.52 2007-06-25 Wakaba <wakaba@suika.fam.cx>
1528    
1529 wakaba 1.54 * ContentChecker.pm: |<img ismap>| not in |<a></a>|
1530     is now erred. |<datalist>| is implemented.
1531     Attribute checker for |<command>| and |<menu>| are
1532     added. Support for |contextmenu| global attribute
1533     is added.
1534    
1535     2007-06-25 Wakaba <wakaba@suika.fam.cx>
1536    
1537 wakaba 1.53 * HTML.pm.src (_reset_insertion_mode): Interpretation
1538     of Step 3 has been changed.
1539    
1540     2007-06-25 Wakaba <wakaba@suika.fam.cx>
1541    
1542 wakaba 1.52 * HTML.pm.src: Late |<html>| parse error is implemented.
1543    
1544 wakaba 1.47 2007-06-24 Wakaba <wakaba@suika.fam.cx>
1545    
1546 wakaba 1.51 * URIChecker.pm (check_iri_reference): A |decode| method name was
1547     incorrect.
1548    
1549     * ContentChecker.pm: Support for the |footer| element.
1550     Check URI syntax for space-separated URI attributes.
1551     Support for the |tabindex| attribute. Support
1552     for |datetime| attribute.
1553    
1554     2007-06-24 Wakaba <wakaba@suika.fam.cx>
1555    
1556 wakaba 1.50 * HTML.pm.src: HTML5 revision 1.144 (&#x0D;) and 1.145 (invalid
1557     character references). HTML5 revision 1.146 (white space
1558     characters before root start tag). HTML5 revision
1559     1.148 (named character references in attribute values).
1560     HTML5 revision 1.152 (<plaintext>.innerHTML get).
1561    
1562     2007-06-24 Wakaba <wakaba@suika.fam.cx>
1563    
1564     * HTML.pm.src: HTML5 revisions 1.142 and 1.143 (<noscript>
1565 wakaba 1.49 in <head>).
1566    
1567     2007-06-24 Wakaba <wakaba@suika.fam.cx>
1568    
1569 wakaba 1.48 * HTML.pm.src: HTML5 revision 935 (<base>, <link>, <meta>
1570     in body).
1571    
1572     * ContentChecker.pm: HTML5 revision 938 (scoped="").
1573    
1574     2007-06-24 Wakaba <wakaba@suika.fam.cx>
1575    
1576 wakaba 1.47 * HTML.pm.src: HTML5 revision 923 (matching end tag in CDATA
1577     or RCDATA in fragment parsing mode). HTML5 revision
1578     924 (<!--> and <!--->). HTML5 revision 926 (hn in hn).
1579    
1580     2007-06-23 Wakaba <wakaba@suika.fam.cx>
1581    
1582     * HTML.pm.src (get_inner_html): HTML5 revision 922 (inner_html
1583     for <pre> and <textarea>).
1584    
1585 wakaba 1.33 2007-06-23 Wakaba <wakaba@suika.fam.cx>
1586    
1587 wakaba 1.46 * HTML.pm.src: HTML5 revision 920 (<isindex>).
1588    
1589     2007-06-23 Wakaba <wakaba@suika.fam.cx>
1590    
1591 wakaba 1.45 * HTML.pm.src: HTML5 revision 918 (</head>, </body>, </html>).
1592    
1593     2007-06-23 Wakaba <wakaba@suika.fam.cx>
1594    
1595 wakaba 1.44 * HTML.pm.src: HTML5 revision 916 (</body>).
1596     HTML5 revision 917 (conforming bare &).
1597    
1598     2007-06-23 Wakaba <wakaba@suika.fam.cx>
1599    
1600 wakaba 1.43 * NanoDOM.pm (manakai_is_html): Setting to false did not work.
1601    
1602     * HTML.pm.src: HTML5 revision 914 (</ in CDATA, RCDATA).
1603     HTML5 revision 915 (<nobr>).
1604    
1605     2007-06-23 Wakaba <wakaba@suika.fam.cx>
1606    
1607 wakaba 1.42 * HTML.pm.src: HTML5 revisions 908, 909, 912, and 913 (quirks mode).
1608    
1609     * NanoDOM.pm (manakai_is_html, manakai_compat_mode, compat_mode):
1610     New attributes.
1611    
1612     2007-06-23 Wakaba <wakaba@suika.fam.cx>
1613    
1614 wakaba 1.41 * HTML.pm.src: HTML5 revisions 900, 901, 902, and 911 (<
1615     in tags).
1616    
1617     2007-06-23 Wakaba <wakaba@suika.fam.cx>
1618    
1619 wakaba 1.40 * .cvsignore: |Entities.html| is added.
1620    
1621     * HTML.pm.src: |$entity_char| is removed and
1622     requires |Whatpm::_NamedEntityList| instead.
1623     HTML5 revision 898 (refc), except that lack of refc
1624     is parse error.
1625    
1626     * mkentitylist.pl: New script.
1627    
1628     * Makefile (all): |_NamedEntityList.pm| is added.
1629     (_NamedEntityList.pm, Entities.html): New rules.
1630    
1631     2007-06-23 Wakaba <wakaba@suika.fam.cx>
1632    
1633 wakaba 1.39 * HTML.pm.src: Parse errors immediately after U+000D
1634     were ignored and U+000D immediately following another
1635     U+000D was not converted to U+000A.
1636    
1637     2007-06-23 Wakaba <wakaba@suika.fam.cx>
1638    
1639 wakaba 1.38 * HTML.pm.src (set_inner_html): HTML5 revision 892 (adopt
1640     nodes before appended). Parser was not ready for NULL
1641     parse error and escape flag.
1642    
1643     * NanoDOM.pm (adopt_node): New.
1644    
1645     2007-06-23 Wakaba <wakaba@suika.fam.cx>
1646    
1647 wakaba 1.37 * HTML.pm.src: HTML5 revision 886 (insane comment in
1648     CDATA and RCDATA). Note that current implementation
1649     is simply repeating what the spec says and it is maybe not
1650     a best way to do it.
1651    
1652     2007-06-23 Wakaba <wakaba@suika.fam.cx>
1653    
1654 wakaba 1.36 * HTML.pm.src: HTML5 revision 884 (</form> don't close
1655     the form element if a descendant element without implied end tag has
1656     still been open).
1657    
1658     2007-06-23 Wakaba <wakaba@suika.fam.cx>
1659    
1660 wakaba 1.35 * ContentChecker.pm: HTML5 revision 881 (Make |id| attribute with
1661     space characters non-conforming).
1662    
1663     2007-06-23 Wakaba <wakaba@suika.fam.cx>
1664    
1665 wakaba 1.34 * HTML.pm.src: An error message was incorrect.
1666     HTML5 revision 869 (C1 character references).
1667    
1668     2007-06-23 Wakaba <wakaba@suika.fam.cx>
1669    
1670 wakaba 1.33 * HTML.pm.src: HTML5 revision 867 (a LF at the beginning of
1671     a |textarea| is removed).
1672    
1673 wakaba 1.32 2007-06-05 Wakaba <wakaba@suika.fam.cx>
1674    
1675     * NanoDOM.pm (get_attribute_node_ns): New method.
1676    
1677     * ContentChecker.pm: |script| |async| and |defer|
1678     no longer require |src|. |async| MUST NOT be
1679     specified if |defer|. (HTML5 revision 858).
1680    
1681 wakaba 1.31 2007-05-30 Wakaba <wakaba@suika.fam.cx>
1682    
1683     * HTML.pm.src: |<form><form>| went to inifinite loop.
1684    
1685 wakaba 1.27 2007-05-27 Wakaba <wakaba@suika.fam.cx>
1686    
1687 wakaba 1.30 * ContentChecker.pm (html): Set |is_root| (allowed
1688     as a document element) flag on.
1689     (new): Removed.
1690     (check_document): New method.
1691    
1692     2007-05-27 Wakaba <wakaba@suika.fam.cx>
1693    
1694 wakaba 1.29 * ContentChecker.pm (thead, tfoot): Checker specifications
1695     were incorrect.
1696    
1697     2007-05-27 Wakaba <wakaba@suika.fam.cx>
1698    
1699 wakaba 1.28 * ContentChecker.pm ($HTMLURIAttrChecker): Include
1700     error position in the |type| option of the error.
1701    
1702     * HTMLTable.pm (form_table): The |$onerror| parameter
1703     is now optional. Some bugs are fixed.
1704    
1705     2007-05-27 Wakaba <wakaba@suika.fam.cx>
1706    
1707 wakaba 1.27 * HTMLTable.pm: New module.
1708    
1709     * ContentChecker.pm (table): Invoke table model error checker.
1710    
1711     * NanoDOM.pm (first_child, get_attribute_ns): New.
1712    
1713 wakaba 1.25 2007-05-26 Wakaba <wakaba@suika.fam.cx>
1714    
1715 wakaba 1.26 * ContentChecker.pm ($HTMLLinkTypesAttrChecker): New checker.
1716     (link/@rel, a/@rel, area/@rel): Use new checker.
1717    
1718     * Makefile (_LinkTypeList.pm, RelExtensions.html): New rules.
1719    
1720     * _LinkTypeList.pm: New file.
1721    
1722     * mklinktypelist.pl: New file.
1723    
1724     * .cvsignore: |RelExtensions.html| added.
1725    
1726     * NanoDOM.pm (child_nodes): Returns an empty array
1727     for non-child-containing node types.
1728     (text_content): New attribute.
1729    
1730     2007-05-26 Wakaba <wakaba@suika.fam.cx>
1731    
1732 wakaba 1.25 * IMTChecker.pm: New module.
1733    
1734     * ContentChecker.pm ($HTMLIMTAttrChecker): Call IMTChecker
1735     to test parameter value validity.
1736    
1737     * HTML.pm.src ($style_start_tag): Attributes were
1738     discarded.
1739    
1740 wakaba 1.22 2007-05-25 Wakaba <wakaba@suika.fam.cx>
1741    
1742 wakaba 1.24 * ContentChecker.pm ($HTMLURIAttrChecker): Implemented.
1743    
1744     2007-05-25 Wakaba <wakaba@suika.fam.cx>
1745    
1746 wakaba 1.23 * URIChecker.pm: All recommendations from RFC 3986
1747     and RFC 3987 are listed (not all testable items are checked yet).
1748    
1749     2007-05-25 Wakaba <wakaba@suika.fam.cx>
1750    
1751 wakaba 1.22 * URIChecker.pm: New module.
1752    
1753 wakaba 1.18 2007-05-20 Wakaba <wakaba@suika.fam.cx>
1754    
1755 wakaba 1.21 * ContentChecker.pm: Now most attributes are implemented
1756     or associated to some placeholder.
1757     ($ElementDefault): Warn unknown attributes
1758     for unknown elements as "attribute not supported".
1759     ($HTMLLanguageTagAttrChecker, $HTMLMQAttrChecker): New placeholders.
1760     ($HTMLUsemapAttrChecker, $HTMLTargetAttrChecker): New checkers.
1761     (|a| attribute checker): Reimplemented.
1762    
1763     2007-05-20 Wakaba <wakaba@suika.fam.cx>
1764    
1765 wakaba 1.20 * ContentChecker.pm ($HTMLEventHandlerAttrChecker): New placeholder.
1766     ($HTMLAttrChecker): Event handler content attributes
1767     are added.
1768     (link, embed): Required attribute is now checked.
1769     (embed): Unknown local attributes are no longer warned.
1770    
1771     2007-05-20 Wakaba <wakaba@suika.fam.cx>
1772    
1773 wakaba 1.19 * ContentChecker.pm ($HTMLSpaceURIsAttrChecker): New placeholder.
1774     ($HTMLIMTAttrChecker): New checker.
1775     (link@rel, link@href, link@type, style@type,
1776     a@href, a@ping, a@ping, a@type, embed@src, embed@type,
1777     object@data, object@type, source@src, source@type, area@alt,
1778     area@shape, area@coords, area@href,
1779     area@ping, area@rel, area@type, script@src,
1780     script@defer, script@async, script@type): Checkers added.
1781    
1782     2007-05-20 Wakaba <wakaba@suika.fam.cx>
1783    
1784 wakaba 1.18 * ContentChecker.pm: Descendant checking was incorrect.
1785    
1786 wakaba 1.12 2007-05-19 Wakaba <wakaba@suika.fam.cx>
1787    
1788 wakaba 1.17 * ContentChecker.pm: Support |xml:*| and |xmlns:*|
1789     attributes. Report an error if |Element.prefix|
1790     is |xmlns|.
1791    
1792     * NanoDOM.pm (prefix): New attribute.
1793    
1794     2007-05-19 Wakaba <wakaba@suika.fam.cx>
1795    
1796 wakaba 1.16 * HTML.pm.src: In |main| phase, |in body| insertion
1797     mode, action for |<iframe>| was missing.
1798    
1799     2007-05-19 Wakaba <wakaba@suika.fam.cx>
1800    
1801 wakaba 1.15 * ContentChecker.pm: Support for many of HTML5 elements.
1802     ($GetHTMLNonNegativeIntegerAttrChecker): New.
1803    
1804     2007-05-19 Wakaba <wakaba@suika.fam.cx>
1805    
1806 wakaba 1.14 * ContentChecker.pm: Support for most elements up to |progress|.
1807     ($HTMLURIAttrChecker): Placeholder.
1808     ($HTMLIntegerAttrChecker, $GetHTMLFloatingPointNumberAttrChecker):
1809     New.
1810    
1811     2007-05-19 Wakaba <wakaba@suika.fam.cx>
1812    
1813 wakaba 1.13 * ContentChecker.pm: Attribute checkers
1814     for global attributes, |html|, |base|, |style|, and |meta|.
1815    
1816     * NanoDOM.pm (insert_before): Weaken reference
1817     to the parent node.
1818     (Attr::new): Set |owner_element| attribute.
1819     (namespace_uri, manakai_local_name): New attribute implementations.
1820     (owner_element): New attribute.
1821    
1822     2007-05-19 Wakaba <wakaba@suika.fam.cx>
1823    
1824 wakaba 1.12 * ContentChecker.pm ($AttrChecker, $HTMLAttrChecker,
1825     $AnyChecker->{attr_checker}, $HTMLAttrsChecker,
1826     $Element->{$HTML_NS}->{''}): New.
1827     (check_element): Invoke attrs_checker for each element.
1828    
1829 wakaba 1.6 2007-05-13 Wakaba <wakaba@suika.fam.cx>
1830    
1831 wakaba 1.11 * ContentChecker.pm: Don't use |manakai_element_type_match|.
1832    
1833     2007-05-13 Wakaba <wakaba@suika.fam.cx>
1834    
1835 wakaba 1.10 * ContentChecker.pm: Use hashs rather than lists for
1836     element type testings.
1837    
1838     2007-05-13 Wakaba <wakaba@suika.fam.cx>
1839    
1840 wakaba 1.9 * ContentChecker.pm: Don't generate duplicate
1841     error when an element type is put in the "minus" list
1842     and the element type is not allowed explicitly in the particular
1843     element content model.
1844     (html:a checker): New checker.
1845     (html:details, html:datagrid): New checkers.
1846     (html:legend): New checker.
1847    
1848     2007-05-13 Wakaba <wakaba@suika.fam.cx>
1849    
1850 wakaba 1.8 * ContentChecker.pm (html:li checker): Implemented.
1851    
1852     2007-05-13 Wakaba <wakaba@suika.fam.cx>
1853    
1854 wakaba 1.7 * ContentChecker.pm ($HTMLInlineOrStriclyInlineChecker): New
1855     checker.
1856     (html:dd checker): New checker.
1857     (html:q, html:em, html:strong, html:small,
1858     html:m, html:dfn, html:code, html:samp, html:span): New checkers.
1859    
1860     2007-05-13 Wakaba <wakaba@suika.fam.cx>
1861    
1862 wakaba 1.6 * ContentChecker.pm ($AnyChecker): Renamed
1863     from |$ElementDefault->{checker}|.
1864     ($ElementDefault->{checker}): Throw an error that
1865     the element type is not supported by the checker.
1866     ($HTMLMetadataElement): |html:base| was missing.
1867     ($HTMLEmptyChecker): Don't throw an error
1868     for inter-element whitespace nodes.
1869     (html:html checker): Errors were not
1870     thrown even if |html:head| and/or |html:body|
1871     children were missing.
1872     (html:head checker): An error was not
1873     thrown if <meta charset> appered after other
1874     elements.
1875    
1876 wakaba 1.5 2007-05-05 Wakaba <wakaba@suika.fam.cx>
1877    
1878     * ContentChecker.pm: |footer|, |video|, |audio|, |script|,
1879     and |noscript| elements are implemented.
1880     (new): New method.
1881    
1882 wakaba 1.3 2007-05-04 Wakaba <wakaba@suika.fam.cx>
1883    
1884 wakaba 1.4 * ContentChecker.pm: New module.
1885    
1886     2007-05-04 Wakaba <wakaba@suika.fam.cx>
1887    
1888 wakaba 1.3 * NanoDOM.pm (manakai_parent_element,
1889     document_element, manakai_local_name,
1890     manakai_element_type_match): New method.
1891    
1892     2007-05-03 Wakaba <wakaba@suika.fam.cx>
1893    
1894     * HTML.pm.src: Replace decimal and hexadecimal numeric
1895     entities in C1 range using Windows-1252 mapping. Bare LF
1896     did not count as new line for error reporting.
1897    
1898 wakaba 1.2 2007-05-02 Wakaba <wakaba@suika.fam.cx>
1899    
1900     * NanoDOM.pm (DOMImplementation): New class.
1901     (append_child): Weaken the |parent_node| reference.
1902     (create_element_ns, Element new): Set the |owner_document|
1903     reference.
1904     (implementation): New attribute.
1905     (owner_document, local_name, namespace_uri): New attributes.
1906    
1907     * HTML.pm.src (parse_string): Line and column numbers
1908     are now provided to error handler.
1909     (!!!parse-error): Short descriptions are added.
1910     (_construct_tree): Split into three methods; support
1911     for innerHTML mode.
1912     (set_inner_html): New method.
1913    
1914 wakaba 1.1 2007-05-01 Wakaba <wakaba@suika.fam.cx>
1915    
1916     * NanoDOM.html: Documentation is added.
1917    
1918     * HTML.pod, ContentType.html: Documentation is revised.
1919    
1920     * .cvsignore: Pod2html temporary files are added.
1921    
1922     * Makefile: Make |NanoDOM.html|.
1923    
1924     2007-05-01 Wakaba <wakaba@suika.fam.cx>
1925    
1926     * HTML.pm.src (parse_string): New method.
1927     (get_inner_html): Renamed from |inner_html|.
1928    
1929     * Makefile: A rule for |HTML.html| is added.
1930    
1931     * HTML.pod: New documentation.
1932    
1933     2007-05-01 Wakaba <wakaba@suika.fam.cx>
1934    
1935     * NanoDOM.pm (last_child, previous_sibling): New attributes.
1936     (clone_node): Attribute nodes were not completely copied.
1937    
1938     * HTML.pm.src: Many bugs are fixed.
1939    
1940     2007-04-30 Wakaba <wakaba@suika.fam.cx>
1941    
1942     * HTML.pm.src: Some typos are fixed.
1943    
1944     2007-04-30 Wakaba <wakaba@suika.fam.cx>
1945    
1946     * mkhtmlparser.pl, Makefile: References to the |HTML-consume-entity.src|
1947     are removed.
1948    
1949     * HTML.pm.src: Tokenizer's handling on named entities are rewritten.
1950    
1951     * HTML-consume-entity.src: Removed.
1952    
1953     2007-04-30 Wakaba <wakaba@suika.fam.cx>
1954    
1955     * HTML.pm.src: Tokenizer's handling on hexadecimal
1956     numeric entities are rewritten.
1957    
1958     2007-04-30 Wakaba <wakaba@suika.fam.cx>
1959    
1960     * HTML.pm.src: Some tokenizer bugs are fixed.
1961    
1962     2007-04-30 Wakaba <wakaba@suika.fam.cx>
1963    
1964     * HTML.pm.src: The tree construction stage is implemented.
1965    
1966     * mkhtmlparser.pl: New macros are added.
1967    
1968     2007-04-28 Wakaba <wakaba@suika.fam.cx>
1969    
1970     * ContentType.pm: A note on bug in the specification
1971     is removed since it's been now fixed.
1972    
1973     * .cvsignore: New file.
1974    
1975     2007-04-28 Wakaba <wakaba@suika.fam.cx>
1976    
1977     * HTML.pm.src, HTML-consume-entity.src: New files.
1978    
1979     * Makefile (HTML.pm): New rule.
1980    
1981     * mkhtmlparser.pl: New script.
1982    
1983     2007-04-25 Wakaba <wakaba@suika.fam.cx>
1984    
1985     * Makefile: New file.
1986    
1987     2007-04-24 Wakaba <wakaba@suika.fam.cx>
1988    
1989     * ContentType.pm: An error in pod is fixed.
1990    
1991     2007-04-24 Wakaba <wakaba@suika.fam.cx>
1992    
1993     * ContentType.pm: Documentation is added.
1994    
1995     2007-04-24 Wakaba <wakaba@suika.fam.cx>
1996    
1997     * ContentType.pm: New Perl module.
1998    
1999     * ChangeLog: New file.
2000    
2001    

admin@suikawiki.org
ViewVC Help
Powered by ViewVC 1.1.24