/[suikacvs]/markup/html/whatpm/Whatpm/ChangeLog
Suika

Contents of /markup/html/whatpm/Whatpm/ChangeLog

Parent Directory Parent Directory | Revision Log Revision Log


Revision 1.311 - (hide annotations) (download)
Mon Sep 15 07:19:03 2008 UTC (16 years, 10 months ago) by wakaba
Branch: MAIN
Changes since 1.310: +8 -0 lines
++ whatpm/Whatpm/ChangeLog	15 Sep 2008 07:17:34 -0000
	* HTML.pm.src: Remove checking for control character, surrogate
	pair, or noncharacter code points and non-Unicode code
	points (they should be handled by Whatpm::Charset::UnicodeChecker).
	(parse_char_stream): Support for the |$get_wrapper| argument and
	character stream error handlers.

2008-09-15  Wakaba  <wakaba@suika.fam.cx>

++ whatpm/Whatpm/Charset/ChangeLog	15 Sep 2008 07:18:45 -0000
	* DecodeHandle.pm (onerror): Return |undef| if no explicit value
	is set.

	* UnicodeChecker.pm: Support for HTML5 parse errors.
	(onerror): Return |undef| if no explicit value is set.

2008-09-15  Wakaba  <wakaba@suika.fam.cx>

1 wakaba 1.310 2008-09-15 Wakaba <wakaba@suika.fam.cx>
2    
3 wakaba 1.311 * HTML.pm.src: Remove checking for control character, surrogate
4     pair, or noncharacter code points and non-Unicode code
5     points (they should be handled by Whatpm::Charset::UnicodeChecker).
6     (parse_char_stream): Support for the |$get_wrapper| argument and
7     character stream error handlers.
8    
9     2008-09-15 Wakaba <wakaba@suika.fam.cx>
10    
11 wakaba 1.310 * ContentChecker.pm: Don't call |loda_ns_module|
12     for null-namespace elements/attributes.
13    
14     * HTML.pm.src: Fact out $disallowed_control_chars
15     as a hash.
16    
17 wakaba 1.300 2008-09-14 Wakaba <wakaba@suika.fam.cx>
18    
19 wakaba 1.309 * HTML.pm.src: Regexp typo fixed. |{prev_char}|
20     and |{next_char}| initializations are moved to initialization
21     method. |{read_until}| now supports buffering. Sync |set_inner_html|
22     with |parse_char_stream|.
23    
24     2008-09-14 Wakaba <wakaba@suika.fam.cx>
25    
26 wakaba 1.308 * HTML.pm.src (parse_char_stream): Make |set_next_char|
27     invoke |manakai_read_until|, not only |read|, where
28     possible, to decrease the number of |read| method calls.
29    
30     * mkhtmlparser.pl: Related changes to the aforementioned
31     modification.
32    
33     2008-09-14 Wakaba <wakaba@suika.fam.cx>
34    
35 wakaba 1.307 * HTML.pm.src: Use |read| instead of |getc|. |set_inner_html|
36     would report character error from now.
37    
38     2008-09-14 Wakaba <wakaba@suika.fam.cx>
39    
40 wakaba 1.306 * HTML.pm.src: White-space-leaded non-white-space character
41     tokens in "before head insertion mode" was not
42     correctly handled.
43     (set_inner_html): Reimplemented using CharString decodehandle
44     class. Support for $get_wrapper argument. Support
45     for |{read_until}| feature.
46    
47     2008-09-14 Wakaba <wakaba@suika.fam.cx>
48    
49 wakaba 1.305 * HTML.pm.src: Make a "bare ero" error for unknown
50     entities point the "&" character.
51    
52     2008-09-14 Wakaba <wakaba@suika.fam.cx>
53    
54 wakaba 1.304 * HTML.pm.src: It turns out that U+FFFD don't have to
55     be added to the list of excluded characters.
56    
57     2008-09-14 Wakaba <wakaba@suika.fam.cx>
58    
59 wakaba 1.303 * HTML.pm.src ($char_onerror): Have character decoder's |line|
60     and |column| a higher priority than the one set by the
61     tokenizer's input handler.
62     ($self->{read_until}): Exclude U+FFFD (but this might
63     not be necessary, since now we do line/column fixup in
64     the character decode handle).
65    
66     2008-09-14 Wakaba <wakaba@suika.fam.cx>
67    
68 wakaba 1.302 * HTML.pm.src: Use |{read_until}| where possible.
69    
70     2008-09-14 Wakaba <wakaba@suika.fam.cx>
71    
72 wakaba 1.301 * HTML.pm.src: Change |{getc_until}| to |{read_until}|
73     and |manakai_getc_until| to |manakai_read_until| to
74     reduce the number of string copies.
75    
76     2008-09-14 Wakaba <wakaba@suika.fam.cx>
77    
78 wakaba 1.300 * HTML.pm.src (parse_char_string): Use newly created
79     |Whatpm::Charset::DecodeHandle::CharString| instead of Perl's
80     standard feature to |open| a string as a filehandle,
81     since Perl's string filehandle seems not supporting |ungetc|
82     method correctly.
83     (parse_char_stream): Define |{getc_until}| method.
84     (DATA_STATE): Experimental support for |getc_until| feature.
85    
86 wakaba 1.292 2008-09-13 Wakaba <wakaba@suika.fam.cx>
87    
88 wakaba 1.299 * HTML.pm.src: Check points added to newly added branches.
89    
90     2008-09-13 Wakaba <wakaba@suika.fam.cx>
91    
92 wakaba 1.298 * HTML.pm.src: Remove |{char}|, which is no longer used.
93     Remove |{entity_in_attr}| and |{last_attribute_value_state}|
94     and replaced by |{prev_state}|.
95    
96     * mkhtmlparser.pl: Remove |{char}| feature.
97     Remove |!!!back-next-input-character;| macro.
98    
99     2008-09-13 Wakaba <wakaba@suika.fam.cx>
100    
101 wakaba 1.297 * HTML.pm.src: Finally we get rid of all the inner loops. Remove
102     entity related tokenizer states in favor of new states
103     implementing the consume character reference algorithm.
104    
105     2008-09-13 Wakaba <wakaba@suika.fam.cx>
106    
107     * HTML.pm.src: "Consume a character reference" algorithm is
108 wakaba 1.296 now implemented as a tokenizer's state, rather than
109     a method, with minimum changes (more changes will
110     be made, in due course). "Bogus comment state"'s inner
111     loop gets removed.
112    
113     2008-09-13 Wakaba <wakaba@suika.fam.cx>
114    
115 wakaba 1.297 * HTML.pm.src: Make |PUBLIC| and |SYSTEM| keyword tokenizing
116 wakaba 1.295 into their own tokenizer states.
117    
118     2008-09-13 Wakaba <wakaba@suika.fam.cx>
119    
120 wakaba 1.294 * HTML.pm.src: |CDATA_SECTION_STATE| (formally |CDATA_BLOCK_STATE|
121     is split into three states.
122    
123     2008-09-13 Wakaba <wakaba@suika.fam.cx>
124    
125 wakaba 1.293 * HTML.pm.src: |CLOSE_TAG_OPEN_STATE| is broken into
126     itself and new |CDATA_PCDATA_CLOSE_TAG_STATE| so that
127     no longer does the tokenizer have to push back next input
128     characters in those states.
129    
130     2008-09-13 Wakaba <wakaba@suika.fam.cx>
131    
132 wakaba 1.292 * HTML.pm.src: |MARKUP_DECLARATION_OPEN_STATE| broken
133     into four states so that no longer does the tokenizer have to push
134     back next input characters in that state.
135    
136 wakaba 1.291 2008-09-11 Wakaba <wakaba@suika.fam.cx>
137    
138     * HTML.pm.src: Methods now accept additional parameter, $get_wrapper,
139     which can be used to insert some wrapper between the character
140     stream handle and the tokenizer. (It is currently not supported
141     for |set_inner_html| for |Element|s).
142    
143 wakaba 1.289 2008-09-10 Wakaba <wakaba@suika.fam.cx>
144    
145 wakaba 1.290 * HTML.pm.src: Ignore punctuations in charset names.
146    
147     2008-09-10 Wakaba <wakaba@suika.fam.cx>
148    
149 wakaba 1.289 * ContentChecker.pm: Support for charset-layer error levels.
150    
151     * HTML.pm.src: Don't specify |text| argument for the
152     |chardecode:fallback| error, since it is not the encoding
153     being used alternatively.
154    
155 wakaba 1.288 2008-09-06 Wakaba <wakaba@suika.fam.cx>
156    
157     * HTML.pm.src: Support for |XSLT-compat| (HTML5 revision 2141).
158    
159 wakaba 1.286 2008-08-31 Wakaba <wakaba@suika.fam.cx>
160    
161 wakaba 1.287 * CacheManifest.pm: Support for extensibility (HTML5 revision 2051).
162    
163     2008-08-31 Wakaba <wakaba@suika.fam.cx>
164    
165 wakaba 1.286 * HTML.pm.src: Bug fix and sync with the spec with regard
166     to after after frameset insertion mode processing (HTML5
167     revision 1909). Note that the implementation was wrong
168     per the old spec before the r1909 changes.
169    
170 wakaba 1.278 2008-08-30 Wakaba <wakaba@suika.fam.cx>
171    
172 wakaba 1.285 * HTMLTable.pm: scope=auto algorithm fix synced with the
173     spec (HTML5 revision 2093).
174     ($process_row): Algorithm step numbers synced with the
175     spec (HTML5 revision 2092).
176    
177     2008-08-30 Wakaba <wakaba@suika.fam.cx>
178    
179 wakaba 1.284 * HTMLTable.pm: Zs is not what we want; we want White_Space! (HTML5
180     revision 2094).
181    
182     2008-08-30 Wakaba <wakaba@suika.fam.cx>
183    
184     * ContentType.pm: Support for image/svg+xml (HTML5 revision 2096).
185    
186     2008-08-30 Wakaba <wakaba@suika.fam.cx>
187    
188 wakaba 1.283 * HTML.pm.src: '"' and "'" at the end of attribute
189     name (after another attribute) now raise parse error (HTML5
190     revision 2123). Empty unquoted attribute values are no
191     longer allowed (HTML5 revision 2122).
192    
193     2008-08-30 Wakaba <wakaba@suika.fam.cx>
194    
195 wakaba 1.282 * mkhtmlparser.pl: Support for MathML |definitionURL| attribute (HTML5
196     revision 2130).
197    
198     2008-08-30 Wakaba <wakaba@suika.fam.cx>
199    
200 wakaba 1.281 * ContentChecker.pm: |xml:lang| attribute value must be same
201     as |lang| attribute value for HTML elements (HTML5 revision 2062
202     and so on).
203    
204     2008-08-30 Wakaba <wakaba@suika.fam.cx>
205    
206 wakaba 1.280 * ContentChecker.pm: Error level definition for |xml_id_error|
207     was missing.
208    
209     * URIChecker.pm: The end of the URL should be marked as the
210     error location for an empty path error. The position
211     between the userinfo and the port components should be
212     marked as the error location for an empty host error.
213    
214     2008-08-30 Wakaba <wakaba@suika.fam.cx>
215    
216 wakaba 1.279 * URIChecker.pm: Set parameters representing where in the
217     value the error occurs for errors. Report unknown
218     address format error in warning level, since address
219     formats are rarely added. Path segments starting with "/.."
220     were misinterpreted as a dot-segment.
221    
222     2008-08-30 Wakaba <wakaba@suika.fam.cx>
223    
224 wakaba 1.278 * URIChecker.pm (check_iri_reference): Requires
225     |Message::DOM::DOMImplementation|.
226    
227 wakaba 1.277 2008-08-29 Wakaba <wakaba@suika.fam.cx>
228    
229     * IMTChecker.pm: Updated for the new error reporting architecture.
230    
231     * ContentChecker.pm: Error levels for IMTs are added.
232    
233 wakaba 1.276 2008-08-17 Wakaba <wakaba@suika.fam.cx>
234    
235     * H2H.pm (_shift_token): Support for unquoted HTML attribute
236     values.
237    
238 wakaba 1.274 2008-08-16 Wakaba <wakaba@suika.fam.cx>
239    
240 wakaba 1.275 * CacheManifest.pm: Support for new style of error
241     reports.
242    
243     * HTML.pm.src: Set line=1, column=1 to the document node.
244    
245     2008-08-16 Wakaba <wakaba@suika.fam.cx>
246    
247 wakaba 1.274 * ContentChecker.pm, RDFXML.pm: Pass {level} object to language tag
248     and URL checkers. Support for more error levels for bogus
249     langauge tag and URL "standards".
250    
251     * LangTag.pm, URIChecker.pm: Support for new style error
252     level reporting.
253    
254 wakaba 1.271 2008-08-15 Wakaba <wakaba@suika.fam.cx>
255    
256 wakaba 1.273 * ContentChecker.pm: Support for RDF/XML error levels.
257    
258     * HTMLTable.pm, RDFXML.pm: Support for new style of error level
259     specifying. Error types are revised.
260    
261     2008-08-15 Wakaba <wakaba@suika.fam.cx>
262    
263 wakaba 1.272 * ContentChecker.pm: All error reporting method calls are
264     renewed.
265    
266     2008-08-15 Wakaba <wakaba@suika.fam.cx>
267    
268 wakaba 1.271 * HTML.pm.src: All error type names and "text" parameters
269     are revised. Use new style for "level" specification.
270    
271     * mkhtmlparser.pl: Use new style for "level" specification.
272    
273 wakaba 1.268 2008-08-03 Wakaba <wakaba@suika.fam.cx>
274    
275 wakaba 1.270 * WebIDL.pm (parse_char_string): Simplified error
276     reporting process for broken ignored valuetype definition.
277     (Valuetype idl_text): Support for special "DOMString" name.
278    
279     2008-08-03 Wakaba <wakaba@suika.fam.cx>
280    
281 wakaba 1.269 * WebIDL.pm ($get_scoped_name): Append "::::" if the last
282     terminal of the ScopedName is "DOMString", such that whether
283     the last part of the scoped name is "DOMString" or "_DOMString"
284     later. It is necessary to determine whether a |typedef|
285     definition should be ignored or not.
286     (parse_char_string): Unescape the identifier of
287     exception members.
288     ($resolve): Return undef for builtin types and sequence<T>
289     types (we might not have to do this, however...).
290     (check): Support checking for Exceptions, Valuetypes,
291     and Typedefs.
292     ($serialize_type): Support for "DOMString::::" syntax.
293     (Typedef idl_text): Output Type as "DOMString" if it
294     is really "DOMString" (i.e. its internal representation
295     is "::DOMString::").
296    
297     2008-08-03 Wakaba <wakaba@suika.fam.cx>
298    
299 wakaba 1.268 * WebIDL.pm ($resolve): New code, based on resolve code
300     for constant types in the |check| method.
301     (check): Support for checking of attributes, operations, and
302     arguments.
303     (Attribute/Operation idl_text): Exception names in getraises,
304     setraises, and raises clauses is serizlied by |$serialize_type|
305     code.
306    
307 wakaba 1.264 2008-08-02 Wakaba <wakaba@suika.fam.cx>
308    
309 wakaba 1.267 * WebIDL.pm ($integer): Order of selections are changed to match
310     hexadecimal numbers (the original pattern, taken from the spec,
311     was not work for hexadecimal numbers, because the "0" prefix
312     matches to the [0-7]* part (as an empty string) and therefore
313     it does not match with remaining "x..." part of a "0x..." integer
314     literal.
315     ($get_type): It now returns a string, not an array reference,
316     for regular types and |sequence| types (i.e. it in any case
317     returns a string).
318     ($get_next_token): The second item in the array that represents
319     a integer or float token is now a Perl number value, not the
320     original string representation of the number.
321     (check): Support for const value consistency checking.
322     No extended attribute is defined for constants.
323     (Node subclasses): Use simple strings rather than array references
324     for default data type values.
325     ($serialize_type): Type values are now simple strings.
326     (value): If the new attribute value is a false value, then
327     a FALSE value is set to the attribute.
328    
329     2008-08-02 Wakaba <wakaba@suika.fam.cx>
330    
331 wakaba 1.266 * WebIDL.pm ($get_scoped_name): Now scoped names are stored
332     in its stringified format ("scoped name" as defined in the
333     spec). Note that future version of this module should not use
334     array references for type values and the |type_text| attribute
335     should be made obsolete.
336     (parse_char_string): Unescape attribute names.
337     (check): Support for checking of whether inherited interfaces
338     are actually defined or not. Support for checking of whether
339     interface member identifiers are duplicated or not.
340     ($serialize_type): Scoped names are returned as is. A future
341     version of this code should escape identifiers other than "DOMString",
342     otherwise the idl_text would be non-conforming.
343    
344     2008-08-02 Wakaba <wakaba@suika.fam.cx>
345    
346 wakaba 1.264 * WebIDL.pm (parse_char_string): Set line/column numbers
347 wakaba 1.265 to generated nodes. Unescape identifiers. Extended attributes
348     for Definition's were ignored.
349     (append_child): Set |parent_node| attribute.
350     (parent_node): New attribute.
351     (check): Support interface/exception members. Support
352     extended attributes. Support definition identifier uniqueness
353     constraint.
354     (qualified_name): New attribute.
355     (Interface/Exception idl_text): Extended attributes were
356     not prepended to the returned text.
357    
358     2008-08-02 Wakaba <wakaba@suika.fam.cx>
359    
360     * WebIDL.pm (parse_char_string): Set line/column numbers
361 wakaba 1.264 to interface object experimentally. s/shift/pop/g, shift
362     would make things wrong. Support for interface forward
363     declarations was missing. Broken interface declarations
364     with no block were not ignored entirely.
365     (Whatpm::WebIDL::Node): New abstract class. This class
366     makes things easier.
367     (child_nodes): New attribute. Unlike DOM's attribute with
368     same name, this attribute returns a dead list of nodes for
369     simplicity.
370     (get_user_data, set_user_data): New methods.
371     (Module idl_text): A SPACE character should be inserted
372     before the |{| character.
373     (Interface idl_text): Support for interface forward declarations.
374     (is_forward_declaration): New attribute.
375    
376 wakaba 1.259 2008-07-19 Wakaba <wakaba@suika.fam.cx>
377    
378 wakaba 1.263 * WebIDL.pm (type_text): Better serializer.
379    
380     2008-07-19 Wakaba <wakaba@suika.fam.cx>
381    
382 wakaba 1.262 * WebIDL.pm: Revise forward-compatible parsing so that
383     it now can handle broken extended attributes and as such.
384    
385     2008-07-19 Wakaba <wakaba@suika.fam.cx>
386    
387 wakaba 1.261 * WebIDL.pm: Real support for extended attributes.
388     Support for extended attributes with arguments.
389    
390     2008-07-19 Wakaba <wakaba@suika.fam.cx>
391    
392 wakaba 1.260 * WebIDL.pm: Support for |exception| syntax.
393     (Interface->idl_text): Tentative support for inheritances.
394    
395     2008-07-19 Wakaba <wakaba@suika.fam.cx>
396    
397 wakaba 1.259 * WebIDL.pm: Hierarchical scoped name support was broken.
398     Support for raises, setraises, and getraises syntaxes.
399    
400 wakaba 1.258 2008-07-18 Wakaba <wakaba@suika.fam.cx>
401    
402     * WebIDL.pm: Support for |idl_text| attribute, version 1 (no
403     proper support for types, extended attributes, and exceptions yet).
404     WebIDL parser, version 1 (no support for exceptions yet,
405     no proper support for extended attributes yet).
406    
407 wakaba 1.257 2008-07-09 Wakaba <wakaba@suika.fam.cx>
408    
409     * WebIDL.pm (parse_char_string): Support for basic attribute syntax.
410    
411 wakaba 1.255 2008-06-29 Wakaba <wakaba@suika.fam.cx>
412    
413 wakaba 1.256 * WebIDL.pm: Support for valuetype and const.
414    
415     2008-06-29 Wakaba <wakaba@suika.fam.cx>
416    
417 wakaba 1.255 * WebIDL.pm: New module.
418    
419 wakaba 1.254 2008-06-15 Wakaba <wakaba@suika.fam.cx>
420    
421     * Makefile (Entities.html): URI changed.
422    
423 wakaba 1.253 2008-06-08 Wakaba <wakaba@suika.fam.cx>
424    
425     * HTML.pm.src: Support for ruby parsing (HTML5 revision 1704).
426    
427 wakaba 1.251 2008-06-01 Wakaba <wakaba@suika.fam.cx>
428    
429 wakaba 1.252 * HTML.pm.src (_get_next_token): A parse error was missing.
430    
431     2008-06-01 Wakaba <wakaba@suika.fam.cx>
432    
433 wakaba 1.251 * mklinktypelist.pl: rel=contact is no longer part of the HTML5
434     spec (commented out). (HTML5 revision 1711).
435    
436 wakaba 1.248 2008-05-25 Wakaba <wakaba@suika.fam.cx>
437    
438 wakaba 1.250 * ContentType.pm: Drop support for UTF-32 (HTML5 revision 1701).
439    
440     * HTML.pm.src: UTF-16BE and UTF-16LE should be considered
441     as UTF-16 (HTML5 revision 1701).
442    
443     2008-05-25 Wakaba <wakaba@suika.fam.cx>
444    
445 wakaba 1.249 * HTML.pm.src: Support for <noframes> in <head> (HTML5 revision
446     1692).
447    
448     2008-05-25 Wakaba <wakaba@suika.fam.cx>
449    
450     * HTML.pm.src: The secondary insertion mode used when switching
451     to foreign content is the "in body" insertion mode (HTML5 revision
452     1696).
453    
454     2008-05-25 Wakaba <wakaba@suika.fam.cx>
455    
456 wakaba 1.248 * HTML.pm.src: Don't raise parse error for <isindex/> (HTML5
457     revision 1697).
458    
459 wakaba 1.239 2008-05-24 Wakaba <wakaba@suika.fam.cx>
460    
461 wakaba 1.247 * HTML.pm.src: Support for end-of-file token in foreign content
462     insertion mode (HTML5 revision 1693). Update SVG camelCase
463     attribute list (HTML5 revision 1700). <textarea> closes
464     </select> (HTML5 revision 1699). More start tags close in
465     foreign content insertion mode (HTML5 revision 1698).
466    
467     2008-05-24 Wakaba <wakaba@suika.fam.cx>
468    
469 wakaba 1.246 * HTML.pm.src: ";" is not part of charset name (HTML5 revision 1665).
470    
471     2008-05-24 Wakaba <wakaba@suika.fam.cx>
472    
473 wakaba 1.245 * HTML.pm.src: More robust charset parameter detection (HTML5
474     revision 1674).
475    
476     2008-05-24 Wakaba <wakaba@suika.fam.cx>
477    
478 wakaba 1.244 * ContentType.pm: Support for image/vnd.microsoft.icon (HTML5
479     revision 1676).
480    
481     2008-05-24 Wakaba <wakaba@suika.fam.cx>
482    
483 wakaba 1.243 * HTML.pm.src: Ignore language part of public identifiers for
484     quriks mode detection (HTML5 revision 1679).
485    
486     2008-05-24 Wakaba <wakaba@suika.fam.cx>
487    
488 wakaba 1.242 * HTML.pm.src: Reduce the number of errors in truncated doctypes (HTML5
489     revision 1685).
490    
491     2008-05-24 Wakaba <wakaba@suika.fam.cx>
492    
493 wakaba 1.241 * HTML.pm.src: Support for EOF in new states for tags (HTML5
494     revision 1684).
495    
496     2008-05-24 Wakaba <wakaba@suika.fam.cx>
497    
498 wakaba 1.240 * HTML.pm.src (_reset_insertion_mode): Make <td>.innerHTML
499     work (HTML5 revision 1690).
500    
501     2008-05-24 Wakaba <wakaba@suika.fam.cx>
502    
503 wakaba 1.239 * HTML.pm.src (_tree_construction_main): Change handling of
504     end tags in head insertion modes (HTML5 revision 1686).
505     (parse_char_string): Bug fix for non-utf8 character string handlings.
506     (parse_char_stream): |ungetc| does not work well for this context.
507    
508 wakaba 1.237 2008-05-18 Wakaba <wakaba@suika.fam.cx>
509    
510 wakaba 1.238 * HTML.pm.src (parse_byte_string): Redefined to invoke
511     |parse_byte_stream|.
512     (parse_byte_stream): New method.
513    
514     2008-05-18 Wakaba <wakaba@suika.fam.cx>
515    
516 wakaba 1.237 * HTML.pm.src (parse_byte_string): Fix the column number reported
517     by encoding layer error reporter.
518    
519 wakaba 1.233 2008-05-17 Wakaba <wakaba@suika.fam.cx>
520    
521 wakaba 1.236 * HTML.pm.src (parse_byte_string): Use streaming decoder
522     rather than converting the whole byte string and then parsing.
523     Propagate errors in character encoding layer.
524     (get_next_token): Precise error reporting for |bare stago| error.
525    
526     2008-05-17 Wakaba <wakaba@suika.fam.cx>
527    
528 wakaba 1.235 * HTML.pm.src (parse_char_stream): New method.
529     (parse_char_string): This method is now defined as an invocation
530     of the |parse_char_stream| method.
531    
532     2008-05-17 Wakaba <wakaba@suika.fam.cx>
533    
534 wakaba 1.234 * HTML.pm.src (parse_byte_string): Report various status
535     of the sniffing as info-level errors. Support for new
536     decoding framework in parser resestting.
537     (new): Various default error levels were not set.
538    
539     2008-05-17 Wakaba <wakaba@suika.fam.cx>
540    
541 wakaba 1.233 * HTML.pm.src (parse_byte_string): HTML5 encoding siniffing
542     algorithm, except for the actual sniffing, is implemented
543     with new framework with Message::Charset::Info.
544    
545 wakaba 1.232 2008-05-16 Wakaba <wakaba@suika.fam.cx>
546    
547     * CacheManifest.pm (_parse): Drop fragment identifiers from
548     URIs in fallback section (HTML5 revision 1596).
549    
550 wakaba 1.230 2008-05-10 Wakaba <wakaba@suika.fam.cx>
551    
552 wakaba 1.231 * Makefile (Entities.html): URI has changed.
553    
554     2008-05-10 Wakaba <wakaba@suika.fam.cx>
555    
556 wakaba 1.230 * CacheManifest.pm: Don't replace U+0000 NULL (HTML5 revision
557     1553).
558    
559 wakaba 1.228 2008-05-06 Wakaba <wakaba@suika.fam.cx>
560    
561 wakaba 1.229 * ContentChecker.pm: Noted that those returned in |table| are
562     no longer table elements, but table objects returned
563     by Whatpm::HTMLTable.
564    
565     * HTMLTable.pm (form_table): Return table element node
566     as |$table->{element}|.
567     (assign_header): Support for the |headers=""| attribute.
568    
569     2008-05-06 Wakaba <wakaba@suika.fam.cx>
570    
571 wakaba 1.228 * HTMLTable.pm (assign_header): New function; first version
572     with no support for headers="".
573     (form_table): Include table width and height to the returned
574     table object for covenience. Indexing in column assignement was wrong.
575     Set whether a data cell is empty or not for convenience.
576    
577 wakaba 1.222 2008-05-05 Wakaba <wakaba@suika.fam.cx>
578    
579 wakaba 1.227 * HTMLTable.pm: Robuster caption support (HTML5 revision 1393).
580    
581     2008-05-05 Wakaba <wakaba@suika.fam.cx>
582    
583 wakaba 1.226 * HTMLTable.pm: How table model errors are detected is
584     changed (HTML5 revision 1387).
585    
586     2008-05-05 Wakaba <wakaba@suika.fam.cx>
587    
588 wakaba 1.225 * HTMLTable.pm: The algorithm now moves |tfoot| elements
589     to the end of the table (HTML5 revision 1380).
590    
591     2008-05-05 Wakaba <wakaba@suika.fam.cx>
592    
593 wakaba 1.224 * HTMLTable.pm: The algorithm is now 0-based indexing, instead
594     of 1-based (HTML5 revision 1376).
595    
596     2008-05-05 Wakaba <wakaba@suika.fam.cx>
597    
598 wakaba 1.223 * ContentType.pm: "Content-Type: text/plain; charset=UTF-8"
599     and "Content-Encoding" no longer prevent sniffing (HTML5
600     revision 1288).
601    
602     2008-05-05 Wakaba <wakaba@suika.fam.cx>
603    
604 wakaba 1.222 * ContentType.pm: Skip BOMs n feed or HTML algorithm (HTML5 revision
605     1282).
606    
607 wakaba 1.220 2008-05-03 Wakaba <wakaba@suika.fam.cx>
608    
609 wakaba 1.221 * ContentChecker.pm: Support for global attributes.
610     Status of XML specs are added.
611    
612     2008-05-03 Wakaba <wakaba@suika.fam.cx>
613    
614 wakaba 1.220 * ContentChecker.pm (check_element): Support for |template=""|
615     and |ref=""| attribute (referent element type checking).
616    
617 wakaba 1.218 2008-04-29 Wakaba <wakaba@suika.fam.cx>
618    
619 wakaba 1.219 * CacheManifest.pm (_parse): New same origin definition (HTML5 revision
620     1500) is implemented (except for IDNA part and URI-scheme-specific
621 wakaba 1.218 knowledge). Line number counting was wrong for LF-only
622     documents.
623    
624 wakaba 1.211 2008-04-13 Wakaba <wakaba@suika.fam.cx>
625    
626 wakaba 1.217 * HTML.pm.src: Raise an parse error for any disallowed
627     character (HTML5 revision 1263).
628    
629     2008-04-13 Wakaba <wakaba@suika.fam.cx>
630    
631 wakaba 1.216 * mkentitylist.pl: Support for new HTML5 entity table format (the
632     definition for |AElig;| was missing).
633    
634     2008-04-13 Wakaba <wakaba@suika.fam.cx>
635    
636 wakaba 1.215 * HTML.pm.src, mkhtmlparser.pl: Support for element/attribute
637     name/namespace fixup (HTML5 revisions 1413, 1415, 1416, and 1417).
638    
639     2008-04-13 Wakaba <wakaba@suika.fam.cx>
640    
641 wakaba 1.214 * HTML.pm.src: List of element names that close foreign content
642     insertion mode is added (HTML5 revisions 1412 and 1418).
643    
644     2008-04-13 Wakaba <wakaba@suika.fam.cx>
645    
646 wakaba 1.213 * HTML.pm.src: Support for |mglyph| and |malignmark|
647     elements (HTML5 revision 1410).
648    
649     2008-04-13 Wakaba <wakaba@suika.fam.cx>
650    
651 wakaba 1.212 * HTML.pm.src: Support for new long MathML entities (HTML5
652     revision 1406).
653    
654     2008-04-13 Wakaba <wakaba@suika.fam.cx>
655    
656 wakaba 1.211 * HTML.pm.src: CDATA section support for MathML and SVG
657     elements (HTML5 revision 1404 and 1420).
658    
659 wakaba 1.209 2008-04-12 Wakaba <wakaba@suika.fam.cx>
660    
661 wakaba 1.210 * HTML.pm.src, mkhtmlparser.pl: Support for MathML and SVG
662     elements (HTML5 revision 1404). Unused !!!macro definitions
663     are removed.
664    
665     2008-04-12 Wakaba <wakaba@suika.fam.cx>
666    
667 wakaba 1.209 * HTML.pm.src, mkhtmlparser.pl: The way permitted slash errors
668     are raised is changed (HTML5 revision 1404).
669    
670 wakaba 1.206 2008-04-06 Wakaba <wakaba@suika.fam.cx>
671    
672 wakaba 1.208 * HTML.pm.src: Category sets that are no longer used are removed.
673    
674     2008-04-06 Wakaba <wakaba@suika.fam.cx>
675    
676 wakaba 1.207 * HTML.pm.src: The ->[1] property of stack entries are now
677     replaced by constants representing element category.
678    
679     2008-04-06 Wakaba <wakaba@suika.fam.cx>
680    
681 wakaba 1.206 * HTML.pm.src: Don't use local name stored in stack (i.e. ->[1])
682     for error reporting. (This is a preparation for using constant
683     value for ->[1].)
684    
685 wakaba 1.204 2008-03-22 Wakaba <wakaba@suika.fam.cx>
686    
687 wakaba 1.205 * RDFXML.pm: Typo fixed.
688    
689     2008-03-22 Wakaba <wakaba@suika.fam.cx>
690    
691 wakaba 1.204 * ContentChecker.pm: |fact_level| is now treated
692     as same as |must_level|, i.e. level = |m|.
693     (check_element): Make list of URIs in the DOM.
694    
695 wakaba 1.197 2008-03-21 Wakaba <wakaba@suika.fam.cx>
696    
697 wakaba 1.203 * RDFXML.pm: Language accessor implemented.
698     Local (null-namespace) attribute support.
699    
700     2008-03-21 Wakaba <wakaba@suika.fam.cx>
701    
702 wakaba 1.202 * RDFXML.pm: Factored out ID checking code.
703    
704     2008-03-21 Wakaba <wakaba@suika.fam.cx>
705    
706 wakaba 1.201 * RDFXML.pm: TODO items noted. Validation of ID
707     and URI attributes is implemented. Warn if unknown
708     value is used in rdf:parseType="" attribute.
709    
710     * URIChecker.pm (check_rdf_uri_reference): New function.
711    
712     2008-03-21 Wakaba <wakaba@suika.fam.cx>
713    
714 wakaba 1.200 * RDFXML.pm: bnodeid implemented. Relative references
715     are now resolved.
716    
717     2008-03-21 Wakaba <wakaba@suika.fam.cx>
718    
719 wakaba 1.199 * ContentChecker.pm: RDF reification implemented.
720    
721     * RDFXML.pm: undef vs false bug fixed.
722     Reification implemented.
723    
724     2008-03-21 Wakaba <wakaba@suika.fam.cx>
725    
726 wakaba 1.198 * RDFXML.pm: s/id/ID/ for attribute name.
727     The |node| arguments are added for |ontriple| calls.
728     Too many "attribute not allowed" errors were raised.
729    
730     * ContentChecker.pm: Initial experimental support for rdf:RDF
731     element.
732    
733     2008-03-21 Wakaba <wakaba@suika.fam.cx>
734    
735 wakaba 1.197 * RDFXML.pm: New module.
736    
737 wakaba 1.192 2008-03-20 Wakaba <wakaba@suika.fam.cx>
738    
739 wakaba 1.196 * HTML.pm.src (set_inner_html): Line/column number
740     code was old one yet.
741    
742     2008-03-20 Wakaba <wakaba@suika.fam.cx>
743    
744 wakaba 1.195 * HTML.pm.src: Better line/column reporting for "duplicate attribute"
745     errors. Line/column markings for DOCTYPE, comment, and
746     character tokens are reintroduced; otherwise,
747     error location for "not HTML5" error and errors
748     for implied elements are not attached.
749    
750     2008-03-20 Wakaba <wakaba@suika.fam.cx>
751    
752 wakaba 1.194 * HTML.pm.src: Set line/column numbers to attributes.
753    
754     * NanoDOM.pm (create_attribute_ns, set_attribute_node_ns): Added.
755     (value): Setter implemented.
756    
757     * mkhtmlparser.pl: Set line/column numbers to Attr nodes.
758    
759     2008-03-20 Wakaba <wakaba@suika.fam.cx>
760    
761 wakaba 1.193 * HTML.pm.src: Unused line/column markings are removed.
762    
763     2008-03-20 Wakaba <wakaba@suika.fam.cx>
764    
765 wakaba 1.192 * HTML.pm.src (_get_next_token): Remove |first_start_tag|
766     flag, which is no longer used.
767    
768 wakaba 1.190 2008-03-17 Wakaba <wakaba@suika.fam.cx>
769    
770 wakaba 1.191 * HTML.pm.src: Set line/column information to element nodes.
771    
772     * mkhtmlparser.pl (!!!create-element, !!!insert-element,
773     and !!!insert-element-t): Set line/column information to
774     element nodes.
775    
776     2008-03-17 Wakaba <wakaba@suika.fam.cx>
777    
778 wakaba 1.190 * HTML.pm.src (_get_next_token): The first "<" character
779     in "<?", "<>", or "</>" should be the error point.
780    
781 wakaba 1.187 2008-03-16 Wakaba <wakaba@suika.fam.cx>
782    
783 wakaba 1.189 * HTML.pm.src: Some more fixes on error position reporting.
784    
785     2008-03-16 Wakaba <wakaba@suika.fam.cx>
786    
787 wakaba 1.188 * HTML.pm.src: Token-level precious error reporting.
788    
789     2008-03-16 Wakaba <wakaba@suika.fam.cx>
790    
791 wakaba 1.187 * HTML.pm.src: Preparation for more precious error point
792     reporting.
793    
794 wakaba 1.184 2008-03-11 Wakaba <wakaba@suika.fam.cx>
795    
796 wakaba 1.186 * HTML.pm.src: Error type revised.
797    
798     2008-03-11 Wakaba <wakaba@suika.fam.cx>
799    
800 wakaba 1.185 * HTML.pm.src: Similar codes are merged together, again.
801    
802     2008-03-11 Wakaba <wakaba@suika.fam.cx>
803    
804 wakaba 1.184 * HTML.pm.src: Similar codes are merged together.
805    
806 wakaba 1.183 2008-03-10 Wakaba <wakaba@suika.fam.cx>
807    
808     * mkhtmlparser.pl: Set "level" parameter to parse errors.
809    
810     * HTML.pm.src: Code refined.
811    
812 wakaba 1.175 2008-03-09 Wakaba <wakaba@suika.fam.cx>
813    
814 wakaba 1.182 * HTML.pm.src: |</body>| treatement has been changed (HTML5
815     revision 1348). Note that I really don't know this makes
816     any difference in the black-box behavior of the parser.
817    
818     2008-03-09 Wakaba <wakaba@suika.fam.cx>
819    
820 wakaba 1.181 * HTML.pm.src: New end-of-file token implementation (HTML5
821     revision 1348).
822    
823     2008-03-09 Wakaba <wakaba@suika.fam.cx>
824    
825 wakaba 1.180 * HTML.pm.src: |applet| support (HTML5 revision 1347).
826    
827     2008-03-09 Wakaba <wakaba@suika.fam.cx>
828    
829 wakaba 1.179 * HTML.pm.src: Foster parenting in AAA (HTML5 revision 1343).
830    
831     2008-03-09 Wakaba <wakaba@suika.fam.cx>
832    
833 wakaba 1.178 * HTML.pm.src: Support for |<input>| in the "in select" insertion
834     mode and support for the "in select in table" insertion mode (HTML5
835     revision 1342).
836    
837     2008-03-09 Wakaba <wakaba@suika.fam.cx>
838    
839 wakaba 1.177 * HTML.pm.src: No foster parenting for <script> and <script>
840     in non-tainted <table>s (HTML5 revision 1336).
841    
842     2008-03-09 Wakaba <wakaba@suika.fam.cx>
843    
844 wakaba 1.176 * HTML.pm.src: Ignore white space characters between <html>
845     and <head> (HTML5 revision 1332).
846    
847     2008-03-09 Wakaba <wakaba@suika.fam.cx>
848    
849 wakaba 1.175 * HTML.pm.src: Treat <input type=hidden> as if it were a
850     white space (HTML5 revision 1331).
851    
852 wakaba 1.164 2008-03-08 Wakaba <wakaba@suika.fam.cx>
853    
854 wakaba 1.174 * HTML.pm.src: Ignore U+000A at the beginning of a |listing|
855     element (HTML5 revision 1330).
856    
857     2008-03-08 Wakaba <wakaba@suika.fam.cx>
858    
859 wakaba 1.173 * HTML.pm.src: <title> is always appended to the current
860     element (HTML5 revision 1328).
861    
862     2008-03-08 Wakaba <wakaba@suika.fam.cx>
863    
864 wakaba 1.172 * HTML.pm.src: White space in tainted tables are moved
865     into foster parents (HTML5 revision 1326).
866    
867     2008-03-08 Wakaba <wakaba@suika.fam.cx>
868    
869 wakaba 1.171 * HTML.pm.src: Reduce errors from foster parenting cases (HTML5
870     revision 1321).
871    
872     2008-03-08 Wakaba <wakaba@suika.fam.cx>
873    
874 wakaba 1.170 * HTML.pm.src: |</h/n/>| case code rearranged to align with
875     the spec (HTML5 revision 1320). Note that we finally complete
876     all of HTML5 revision 1320 changes.
877    
878     2008-03-08 Wakaba <wakaba@suika.fam.cx>
879    
880 wakaba 1.169 * HTML.pm.src: |</form>| now works similar to |</div>| for unclosed
881     tags (HTML5 revision 1320).
882    
883     2008-03-08 Wakaba <wakaba@suika.fam.cx>
884    
885 wakaba 1.168 * HTML.pm.src: |</p>| case rearranged with no actual change in fact.
886    
887     2008-03-08 Wakaba <wakaba@suika.fam.cx>
888    
889 wakaba 1.167 * HTML.pm.src: A "generate implied end tags" code (t409.1)
890     could not be reached so that it is now removed (HTML5 revision
891     1320).
892    
893     2008-03-08 Wakaba <wakaba@suika.fam.cx>
894    
895 wakaba 1.166 * HTML.pm.src: Code for the case of |</div>| and so on are revised
896     to align with new spec text (HTML5 revision 1320).
897    
898     2008-03-08 Wakaba <wakaba@suika.fam.cx>
899    
900 wakaba 1.165 * HTML.pm.src: Remove strange |if| condition; however, it should
901     have had no harm in theory.
902    
903     2008-03-08 Wakaba <wakaba@suika.fam.cx>
904    
905 wakaba 1.164 * HTML.pm.src (_tree_construction_main): '</p>' in body
906     case is split from other end tags for the preparation
907     of implementing HTML revision 1320.
908    
909 wakaba 1.162 2008-03-07 Wakaba <wakaba@suika.fam.cx>
910    
911 wakaba 1.163 * HTML.pm.src: Simplified "generate implied end tag" (HTML5
912     revision 1320).
913    
914     2008-03-07 Wakaba <wakaba@suika.fam.cx>
915    
916 wakaba 1.162 * HTML.pm.src (_tree_construction_main): Merge rules for "h1"
917     and "div" (HTML5 revision 1318). Add comments to where
918     |form| pointer association codes should be inserted (HTML5 revision
919     1319).
920    
921 wakaba 1.161 2008-03-06 Wakaba <wakaba@suika.fam.cx>
922    
923     * HTML.pm.src: <html> treatement refined (HTML5 revision 1314).
924    
925 wakaba 1.159 2008-03-05 Wakaba <wakaba@suika.fam.cx>
926    
927 wakaba 1.160 * HTML.pm.src: Since the case t268 should never be reached (no
928     other token type, there are), it is replaced by a |die| statement.
929    
930     2008-03-05 Wakaba <wakaba@suika.fam.cx>
931    
932 wakaba 1.159 * HTML.pm.src: Typo fixed.
933    
934 wakaba 1.158 2008-03-04 Wakaba <wakaba@suika.fam.cx>
935    
936     * HTML.pm.src (_tree_construction_initial): Some limited quirks
937     doctypes were not uppercased for comparison.
938    
939 wakaba 1.152 2008-03-03 Wakaba <wakaba@suika.fam.cx>
940    
941 wakaba 1.157 * HTML.pm.src (tree construction and set_inner_html): Checkpoints
942     are added.
943    
944     2008-03-03 Wakaba <wakaba@suika.fam.cx>
945    
946 wakaba 1.156 * HTML.pm.src (_tokenize_attempt_to_consume_an_entity): Checkpoints
947     are set. Cases that are unlikely reached are noted as so.
948    
949     2008-03-03 Wakaba <wakaba@suika.fam.cx>
950    
951 wakaba 1.155 * HTML.pm.src: Checkpoints for debugging are added.
952    
953     * mkhtmlparser.pl: Support for |!!!cp| syntax.
954    
955     2008-03-03 Wakaba <wakaba@suika.fam.cx>
956    
957 wakaba 1.154 * HTML.pm.src, mkhtmlparser.pl: s/_input_character/_char/g
958     for simplicity.
959    
960     2008-03-03 Wakaba <wakaba@suika.fam.cx>
961    
962 wakaba 1.153 * HTML.pm.src: Flag name changed: s/correct/force-quirks/g (HTML5
963     revision 1307).
964    
965     2008-03-03 Wakaba <wakaba@suika.fam.cx>
966    
967 wakaba 1.152 * HTML.pm.src (_get_next_token): Where the /incorrect/ flag
968     is set are changed (HTML5 revision 1305).
969    
970 wakaba 1.149 2008-03-02 Wakaba <wakaba@suika.fam.cx>
971    
972 wakaba 1.151 * HTML.pm.src: Raise a parse error for |<span ===>| (HTML5 revision
973     1292). Entities are not parsed in comment-like part in RCDATA
974     elements (HTML5 revision 1294). Allow bare & at the end
975     of attribute value literals (HTML5 revision 1296). More
976     quirks mode doctypes (HTML5 revision 1302). Requires spaces
977     between attributes and ban attribute names or unquoted
978     attribute values containing single or double quotes (HTML5
979     revision 1303).
980    
981     2008-03-02 Wakaba <wakaba@suika.fam.cx>
982    
983 wakaba 1.150 * ContentChecker.pm: Typo fixed. Don't raise "character encoding"
984     and related errors unless it is an HTML document (though the spec
985     is unclear on whether it is applied to XHTML document).
986    
987     * HTML.pm (%HTMLAttrStatus): WF2 repetition model attributes
988     are added.
989    
990     2008-03-02 Wakaba <wakaba@suika.fam.cx>
991    
992 wakaba 1.149 * HTML.pm.src: s/local_name/manakai_local_name/g.
993    
994 wakaba 1.148 2008-03-01 Wakaba <wakaba@suika.fam.cx>
995    
996     * _NamedEntityList.pm: Updated (HTML5 revision 1286).
997    
998     * HTML.pm.src: |charset| in |content| attribute is
999     case-insensitive (HTML5 revision 1270).
1000    
1001 wakaba 1.147 2008-02-26 Wakaba <wakaba@suika.fam.cx>
1002    
1003     * ContentChecker.pm: New status constants are added.
1004     ($ElementDefault): |status| added.
1005     (check_element): Err for non-standard or deprecated elements.
1006     (_attr_status_info): For non-standard or deprecated attributes.
1007    
1008 wakaba 1.142 2008-02-24 Wakaba <wakaba@suika.fam.cx>
1009    
1010 wakaba 1.146 * ContentChecker.pm (_attr_status_info): New internal method.
1011    
1012     2008-02-24 Wakaba <wakaba@suika.fam.cx>
1013    
1014 wakaba 1.145 * ContentChecker.pm (check_element): Element standardized
1015     status information is now dispatched.
1016    
1017     2008-02-24 Wakaba <wakaba@suika.fam.cx>
1018    
1019 wakaba 1.144 * ContentChecker.pm (check_element): Fix |del|-and-significant
1020     problem by adding some more arguments.
1021    
1022     2008-02-24 Wakaba <wakaba@suika.fam.cx>
1023    
1024 wakaba 1.143 * ContentChecker.pm (check_element): Use context of
1025     container-for-the-purpose-of-content-model element (not transparent
1026     element) for |check_child_element| calling and significant
1027     text flag marking. This reintroduces |<del>|-and-significant
1028     problem again.
1029    
1030     2008-02-24 Wakaba <wakaba@suika.fam.cx>
1031    
1032 wakaba 1.142 * ContentChecker.pm (check_element): Make semi-transparent
1033     elements ignored for the purpose of phase changes in
1034     content model checking.
1035    
1036 wakaba 1.138 2008-02-23 Wakaba <wakaba@suika.fam.cx>
1037    
1038 wakaba 1.141 * ContentChecker.pm (check_element): In-element state
1039     was not properly managed for transparent cases.
1040    
1041     2008-02-23 Wakaba <wakaba@suika.fam.cx>
1042    
1043 wakaba 1.140 * ContentChecker.pm (check_element): Support for |video|
1044     and |audio| as semi-transparent elements.
1045    
1046     2008-02-23 Wakaba <wakaba@suika.fam.cx>
1047    
1048 wakaba 1.139 * ContentChecker.pm ($HTMLSemiTransparentElements): New.
1049     (check_element): s/minuses/minus_elements/, s/pluses/plus_elements/.
1050     Support for |html:object| as a semi-transparent element.
1051    
1052     2008-02-23 Wakaba <wakaba@suika.fam.cx>
1053    
1054 wakaba 1.138 * ContentChecker.pm (check_element): The way to traverse
1055     the tree is entirely revised to make it easier to track
1056     the state of ancestors/descendants. As a result of this
1057     revision (which rewrites almost all of Whatpm::ContentChecker::HTML),
1058     support for content model checking for HTML elements |figure|,
1059     |object|, |video|, and |audio| and checking for XML elements (and
1060     some XMLNS checkings) are dropped for now. They will be
1061     reimplemented in due cource.
1062    
1063 wakaba 1.134 2008-02-17 Wakaba <wakaba@suika.fam.cx>
1064    
1065 wakaba 1.137 * HTML.pm.src: |>| in public or system literal closes the DOCTYPE
1066     token (HTML5 revision 1225).
1067    
1068     2008-02-17 Wakaba <wakaba@suika.fam.cx>
1069    
1070 wakaba 1.136 * ContentChecker.pm ({unsupported_level}): New value.
1071    
1072     * HTML.pm.src: Save whether |meta| |content| attribute
1073     contains character references or not.
1074    
1075     2008-02-17 Wakaba <wakaba@suika.fam.cx>
1076    
1077 wakaba 1.135 * ContentChecker.pm (_get_children): (Incompleted) attempt to
1078     imlement significant content checking for contents
1079     with |del| elements.
1080    
1081     2008-02-17 Wakaba <wakaba@suika.fam.cx>
1082    
1083 wakaba 1.134 * ContenteChecker.pm ($HTMLTransparentElements): More
1084     elements are added.
1085     (_get_children): HTML |object| elements are now semi-transparent.
1086    
1087     * NanoDOM.pm (manakai_html, manakai_head): New methods.
1088    
1089 wakaba 1.133 2008-02-16 Wakaba <wakaba@suika.fam.cx>
1090    
1091     * CacheManifest.pm: HTML5 revision 1211 implemented.
1092    
1093     * CacheManifest.pod: Updated.
1094    
1095 wakaba 1.132 2008-02-10 Wakaba <wakaba@suika.fam.cx>
1096    
1097     * ContentChecker.pm (check_document, check_element): Support
1098     for second argument ($onsubdoc).
1099     (_get_css_parser): Removed (now it is part of WDCC).
1100    
1101 wakaba 1.131 2008-02-09 Wakaba <wakaba@suika.fam.cx>
1102    
1103     * ContentChecker.pm (_get_css_parser): New.
1104    
1105 wakaba 1.129 2007-11-25 Wakaba <wakaba@suika.fam.cx>
1106    
1107 wakaba 1.130 * ContentChecker.pm ($AnyChecker): Old way to add child elements
1108     for checking had been used.
1109    
1110     2007-11-25 Wakaba <wakaba@suika.fam.cx>
1111    
1112 wakaba 1.129 * ContentChecker.pm (check_element): New todo item type |descendant|.
1113    
1114 wakaba 1.125 2007-11-23 Wakaba <wakaba@suika.fam.cx>
1115    
1116 wakaba 1.128 * IMTChecker.pm: Revised to raise errors and warnings as (poorly)
1117     specced in RFC 2046 and RFC 4288.
1118     (application/atom+xml): Definition added.
1119    
1120     2007-11-23 Wakaba <wakaba@suika.fam.cx>
1121    
1122 wakaba 1.127 * URIChecker.pm: Make RFC 3986 should-level errors
1123     warnings (rather than SHOULD-level errors).
1124    
1125     2007-11-23 Wakaba <wakaba@suika.fam.cx>
1126    
1127 wakaba 1.126 * NanoDOM.pm (get_user_data, set_user_data): New methods.
1128    
1129     * HTML.pm.src: A flag for character references in attribute
1130     values are added. Set |manakai_has_reference| user data
1131     to |charset| attribute.
1132    
1133     2007-11-23 Wakaba <wakaba@suika.fam.cx>
1134    
1135 wakaba 1.125 * NanoDOM.pm (input_encoding, manakai_charset, manakai_has_bom): New
1136     attributes.
1137    
1138     * ContentChecker.pm (check_document): Warn if charset requirements
1139     cannot be tested.
1140    
1141 wakaba 1.124 2007-11-19 Wakaba <wakaba@suika.fam.cx>
1142    
1143     * HTML.pm.src (parse_byte_string): Detect charset
1144     by universalchardet if charset parameter is not specified.
1145    
1146     * Makefile (Charset-all, Charset-clean): New rules.
1147    
1148 wakaba 1.120 2007-11-18 Wakaba <wakaba@suika.fam.cx>
1149    
1150 wakaba 1.123 * ContentChecker.pm (check_document): Check the existence
1151     of character encoding declaration and selection of encoding
1152     for HTML document.
1153    
1154     2007-11-18 Wakaba <wakaba@suika.fam.cx>
1155    
1156 wakaba 1.122 * ContentType.pm (get_sniffed_type): Return also the official
1157     type in list context.
1158    
1159     2007-11-18 Wakaba <wakaba@suika.fam.cx>
1160    
1161 wakaba 1.121 * ContentType.pm: Sniffing with leading white space
1162     ignoring (HTML5 revisions 1013 and 1016).
1163    
1164     2007-11-18 Wakaba <wakaba@suika.fam.cx>
1165    
1166 wakaba 1.120 * ContentType.pm: HTML5 revision 1013 changes, except for leading
1167     white spaces, are implemented.
1168    
1169 wakaba 1.117 2007-11-11 Wakaba <wakaba@suika.fam.cx>
1170    
1171 wakaba 1.119 * HTML.pm.src (parse_char_string): Set |inner_encoding|
1172     attribute if possible.
1173    
1174     2007-11-11 Wakaba <wakaba@suika.fam.cx>
1175    
1176 wakaba 1.118 * HTML.pm.src (parse_byte_string): New method.
1177     (parse_char_string): New alias for |parse_string|.
1178     (main phase): Invoking "change the encoding" algorithm if desired.
1179    
1180     * HTML.pod: Updated.
1181    
1182     2007-11-11 Wakaba <wakaba@suika.fam.cx>
1183    
1184 wakaba 1.117 * HTML.pod (get_inner_html): Removed.
1185    
1186     * Makefile (HTML-all, HTML-clean): New.
1187    
1188     2007-11-11 Wakaba <wakaba@suika.fam.cx>
1189    
1190     * HTML.pm.src (get_inner_html): Removed (moved to HTML/Serializer.pm).
1191    
1192 wakaba 1.116 2007-11-08 Wakaba <wakaba@suika.fam.cx>
1193    
1194     * mklinktypelist.pl: s/noreferer/noreferrer/ (HTML5 revision 1132).
1195    
1196 wakaba 1.112 2007-11-04 Wakaba <wakaba@suika.fam.cx>
1197    
1198 wakaba 1.115 * Makefile: |CacheManifest.html| is added.
1199    
1200     * CacheManifest.pod: New file.
1201    
1202     2007-11-04 Wakaba <wakaba@suika.fam.cx>
1203    
1204 wakaba 1.114 * CacheManifest.pm: New module.
1205    
1206     2007-11-04 Wakaba <wakaba@suika.fam.cx>
1207    
1208 wakaba 1.113 * HTML.pm.src: Support for application cache selection algorithm
1209     callback.
1210    
1211     2007-11-04 Wakaba <wakaba@suika.fam.cx>
1212    
1213 wakaba 1.112 * mklinktypelist.pl: Support for rel=noreferer (HTML5 revision 1118).
1214    
1215 wakaba 1.110 2007-10-17 Wakaba <wakaba@suika.fam.cx>
1216    
1217 wakaba 1.111 * Makefile (clean): New rule.
1218    
1219     * NanoDOM.pm (public_id, system_id): New attributes.
1220    
1221     2007-10-17 Wakaba <wakaba@suika.fam.cx>
1222    
1223 wakaba 1.110 * Makefile (CSS-all, CSS-clean, clean): New rules.
1224    
1225 wakaba 1.109 2007-10-14 Wakaba <wakaba@suika.fam.cx>
1226    
1227     * ContentChecker.pm (check_document): Support for
1228     new |is_xml_root| flag.
1229     (check_element): Support for new |pluses| state.
1230     (_add_pluses): New method.
1231     (_remove_minuses): Support for new |minus| item.
1232    
1233 wakaba 1.106 2007-09-24 Wakaba <wakaba@suika.fam.cx>
1234    
1235 wakaba 1.107 * ContentChecker.pm: Raise specific error for invalid
1236     root element.
1237    
1238     2007-09-24 Wakaba <wakaba@suika.fam.cx>
1239    
1240 wakaba 1.106 * ContentChecker.pm: Set level values for later uses.
1241    
1242 wakaba 1.103 2007-09-09 Wakaba <wakaba@suika.fam.cx>
1243    
1244 wakaba 1.105 * ContentChecker.pm: Support for language tag validation.
1245    
1246     2007-09-09 Wakaba <wakaba@suika.fam.cx>
1247    
1248 wakaba 1.104 * LangTag.pm (check_rfc3066_language_tag): New method.
1249    
1250     2007-09-09 Wakaba <wakaba@suika.fam.cx>
1251    
1252 wakaba 1.103 * LangTag.pm: New module.
1253    
1254 wakaba 1.102 2007-09-04 Wakaba <wakaba@suika.fam.cx>
1255    
1256     * HTML.pm.src: Some error types were wrong.
1257    
1258 wakaba 1.100 2007-08-17 Wakaba <wakaba@suika.fam.cx>
1259    
1260 wakaba 1.101 * CSS/: New directory.
1261    
1262     2007-08-17 Wakaba <wakaba@suika.fam.cx>
1263    
1264 wakaba 1.100 * ContentChecker.pm (_check_get_children): Support
1265     for |noscript| in |head|.
1266    
1267 wakaba 1.99 2007-08-12 Wakaba <wakaba@suika.fam.cx>
1268    
1269     * URI/: New directory.
1270    
1271 wakaba 1.94 2007-08-11 Wakaba <wakaba@suika.fam.cx>
1272    
1273 wakaba 1.98 * HTML.pm.src: Tokenizer's states are now represented in
1274     number.
1275    
1276     2007-08-11 Wakaba <wakaba@suika.fam.cx>
1277    
1278 wakaba 1.97 * HTML.pm.src: |or|s for insertion modes are replaced
1279     by |&|s.
1280    
1281     2007-08-11 Wakaba <wakaba@suika.fam.cx>
1282    
1283 wakaba 1.96 * HTML.pm.src: Token types are now represented in number.
1284    
1285     2007-08-11 Wakaba <wakaba@suika.fam.cx>
1286    
1287 wakaba 1.95 * ContentType.pm (SEE ALSO): Updated.
1288    
1289     * HTML.pm.src: Insertion modes are now represented in number.
1290    
1291     2007-08-11 Wakaba <wakaba@suika.fam.cx>
1292    
1293 wakaba 1.94 * ContentType.pm: Sniffing for bitmap images (HTML5 revision
1294     999) is implemented.
1295    
1296 wakaba 1.93 2007-08-08 Wakaba <wakaba@suika.fam.cx>
1297    
1298     * ContentType.pm: Sniffing for |<script| (HTML5 revision
1299     983) is implemented.
1300    
1301 wakaba 1.92 2007-08-06 Wakaba <wakaba@suika.fam.cx>
1302    
1303     * ContentChecker.pod: New documentation.
1304    
1305     * Makefile: A rule for |ContentChecker.html| is added.
1306    
1307     * ContentChecker.pm: A pod "LICENSE" section is added.
1308    
1309     * NanoDOM.pm ($VERSION): New variable.
1310    
1311 wakaba 1.90 2007-08-05 Wakaba <wakaba@suika.fam.cx>
1312    
1313 wakaba 1.91 * H2H.pm: |b|, |i|, and |sub| are added to the
1314     list of allowed HTML elements.
1315    
1316     2007-08-05 Wakaba <wakaba@suika.fam.cx>
1317    
1318 wakaba 1.90 * H2H.pm: |samp| is added to the list of allowed
1319     HTML elements.
1320    
1321     * URIChecker.pm (check_iri): New.
1322     (check_iri_reference): Error type for IRI reference
1323     syntax error is changed.
1324    
1325 wakaba 1.87 2007-08-04 Wakaba <wakaba@suika.fam.cx>
1326    
1327 wakaba 1.89 * ContentChecker.pm: Reference to the |Whatpm::ContentChecker::Atom|
1328     is added.
1329     (check_document): Load appropriate module before validation.
1330    
1331     2007-08-04 Wakaba <wakaba@suika.fam.cx>
1332    
1333 wakaba 1.88 * ContentChecker/: New directory.
1334    
1335     2007-08-04 Wakaba <wakaba@suika.fam.cx>
1336    
1337 wakaba 1.87 * ContentChecker.pm: HTML |time| element is implemented.
1338    
1339     * HTMLTable.pm: Comments are updated as HTML5 is revised.
1340    
1341 wakaba 1.71 2007-07-21 Wakaba <wakaba@suika.fam.cx>
1342    
1343 wakaba 1.86 * ContentChecker.pm (check_document): Return value
1344     even if no document element is found.
1345    
1346     2007-07-21 Wakaba <wakaba@suika.fam.cx>
1347    
1348 wakaba 1.85 * HTML.pm.src: |$in_body| is no longer a function.
1349    
1350     2007-07-21 Wakaba <wakaba@suika.fam.cx>
1351    
1352 wakaba 1.84 * HTML.pm.src: The |$in_body| code has been moved down.
1353    
1354     2007-07-21 Wakaba <wakaba@suika.fam.cx>
1355    
1356 wakaba 1.83 * HTML.pm.src: The "trailing end" insertion mode
1357     is split into "after html body" and "after html frameset"
1358     insertion modes. Their codes are merged with "after body"
1359     and "after frameset" codes. |$previous_insertion_mode|
1360     has been removed. "after frameset" code is
1361     merged with "in frameset" code.
1362    
1363     2007-07-21 Wakaba <wakaba@suika.fam.cx>
1364    
1365 wakaba 1.82 * HTML.pm.src: The "before head" insertion mode is
1366     merged with the "in head" insertion mode.
1367    
1368     2007-07-21 Wakaba <wakaba@suika.fam.cx>
1369    
1370 wakaba 1.81 * HTML.pm.src: Cases in "in head" insertion mode are
1371     reorganized.
1372    
1373     2007-07-21 Wakaba <wakaba@suika.fam.cx>
1374    
1375 wakaba 1.80 * HTML.pm.src: Some cases in "in table" insertion mode
1376     are merged.
1377    
1378     2007-07-21 Wakaba <wakaba@suika.fam.cx>
1379    
1380 wakaba 1.79 * HTML.pm.src: The "in row" insertion mode is merged
1381     with "in table" insertion mode.
1382    
1383     2007-07-21 Wakaba <wakaba@suika.fam.cx>
1384    
1385 wakaba 1.78 * HTML.pm.src: The "in table" and "in table body" insertion
1386     modes are merged.
1387    
1388     2007-07-21 Wakaba <wakaba@suika.fam.cx>
1389    
1390 wakaba 1.77 * HTML.pm.src: There is no "in table head" or "in table foot"
1391     insertion mode!
1392    
1393     2007-07-21 Wakaba <wakaba@suika.fam.cx>
1394    
1395 wakaba 1.76 * HTML.pm.src: |<noframes>| "in frameset" and "in noframes"
1396     now directly invoke the handler.
1397    
1398     2007-07-21 Wakaba <wakaba@suika.fam.cx>
1399    
1400 wakaba 1.75 * HTML.pm.src: Codes for "in cell" insertion mode
1401     is merged to the "in body" insertion mode code.
1402    
1403     2007-07-21 Wakaba <wakaba@suika.fam.cx>
1404    
1405 wakaba 1.74 * HTML.pm.src: Codes for "in body" and "in caption"
1406     insertion modes are merged.
1407    
1408     2007-07-21 Wakaba <wakaba@suika.fam.cx>
1409    
1410 wakaba 1.73 * HTML.pm.src: Two |!!!next-token|s were missing.
1411    
1412     2007-07-21 Wakaba <wakaba@suika.fam.cx>
1413    
1414 wakaba 1.72 * HTML.pm.src: Use numeric constant for |{content_mode}|
1415     instead of string constant for |{content_model_flag}|.
1416    
1417     2007-07-21 Wakaba <wakaba@suika.fam.cx>
1418    
1419 wakaba 1.71 * HTML.pm.src: Add the name of the attribute
1420     to the "duplicate attribute" error.
1421    
1422 wakaba 1.69 2007-07-17 Wakaba <wakaba@suika.fam.cx>
1423    
1424 wakaba 1.70 * ContentChecker.pm: Return the |class| node list.
1425    
1426     2007-07-17 Wakaba <wakaba@suika.fam.cx>
1427    
1428 wakaba 1.69 * ContentChecker.pm: Return the |id| node list.
1429    
1430     * HTML.pm.src: A typo is fixed.
1431    
1432 wakaba 1.63 2007-07-16 Wakaba <wakaba@suika.fam.cx>
1433    
1434 wakaba 1.68 * ContentChecker.pm: Drop wrong |level => 'error'| specification
1435     from "in HTML:xml:lang" error. Character position
1436     is now the last part of the error type in the URI error
1437     description. Report "unsupported" status for language
1438     tags, media queries, script codes, and style sheets.
1439    
1440     2007-07-16 Wakaba <wakaba@suika.fam.cx>
1441    
1442 wakaba 1.67 * ContentChecker.pm: Report error if |xml:lang|
1443     in HTML, |lang| in XML, |xmlns| in XML, and |meta| |charset|
1444     in XML.
1445    
1446     * NanoDOM.pm (Attr.owner_document): New attribute.
1447    
1448     2007-07-16 Wakaba <wakaba@suika.fam.cx>
1449    
1450 wakaba 1.66 * HTML.pm.src: The character immediately following
1451     a bare |hcro| was discarded. Fix handling of
1452     entity references in attribute values.
1453    
1454     2007-07-16 Wakaba <wakaba@suika.fam.cx>
1455    
1456 wakaba 1.65 * HTML.pm.src (main and trailin end phases): Token
1457     types |DOCTYPE|, |comment|, |end-of-file|, and |<html>| are
1458     factored out. Error types |in html:#DOCTYPE|
1459     and |after html:#DOCTYPE| are merged
1460     into |DOCTYPE in the middle|. |</frameset>|
1461     in fragment parsing mode changed the
1462     insertion mode.
1463    
1464     2007-07-16 Wakaba <wakaba@suika.fam.cx>
1465    
1466 wakaba 1.64 * HTML.pm.src: |$phase| has been removed; The |trailing end|
1467     phase is now an insertion mode. Treatments for white
1468     space character tokens were incorrect for some
1469     insertion modes. An old |meta| case was not removed.
1470    
1471     2007-07-16 Wakaba <wakaba@suika.fam.cx>
1472    
1473 wakaba 1.63 * HTML.pm.src: |meta| charset declaration extraction
1474     implemented (but changing the encoding is not yet:-).
1475    
1476 wakaba 1.60 2007-07-15 Wakaba <wakaba@suika.fam.cx>
1477    
1478 wakaba 1.62 * Charset/: New directory.
1479    
1480     2007-07-15 Wakaba <wakaba@suika.fam.cx>
1481    
1482 wakaba 1.61 * H2H.pm: New Perl module (created from
1483     manakai's H2H.dis).
1484    
1485     2007-07-15 Wakaba <wakaba@suika.fam.cx>
1486    
1487 wakaba 1.60 * XMLSerializer.pm: New Perl module (created from
1488     manakai's SimpleLS.dis).
1489    
1490 wakaba 1.59 2007-07-07 Wakaba <wakaba@suika.fam.cx>
1491    
1492     * HTML.pm.src: |<!---x-->| was not processed correctly.
1493    
1494 wakaba 1.57 2007-07-01 Wakaba <wakaba@suika.fam.cx>
1495    
1496 wakaba 1.58 * HTML.pm.src: Report correct error message
1497     for |<body></div></body>|.
1498    
1499     2007-07-01 Wakaba <wakaba@suika.fam.cx>
1500    
1501 wakaba 1.57 * HTMLTable.pm: An error description was incorrect.
1502    
1503     2007-06-30 Wakaba <wakaba@suika.fam.cx>
1504    
1505     * ContentChecker.pm: Return |{term}| list.
1506    
1507 wakaba 1.55 2007-06-30 Wakaba <wakaba@suika.fam.cx>
1508    
1509 wakaba 1.56 * HTML.pm.src: HTML5 revisions 961-966 (</p>, </br>,
1510     nested <nobr>, implied </tbody>, </tfoot>, and </thead>,
1511     and <title> outside of head).
1512    
1513     2007-06-30 Wakaba <wakaba@suika.fam.cx>
1514    
1515 wakaba 1.55 * IMTChecker.pm: Report warning for unregistered
1516     and private types/subtypes.
1517    
1518     * ContentChecker.pm, HTML.pm.src, IMTChecker.pm,
1519     URIChecker.pm, HTMLTable.pm: Error messages are now
1520     consistent; they are all listed in
1521     <http://suika.fam.cx/gate/2005/sw/Whatpm%20Error%20Types>.
1522    
1523 wakaba 1.52 2007-06-25 Wakaba <wakaba@suika.fam.cx>
1524    
1525 wakaba 1.54 * ContentChecker.pm: |<img ismap>| not in |<a></a>|
1526     is now erred. |<datalist>| is implemented.
1527     Attribute checker for |<command>| and |<menu>| are
1528     added. Support for |contextmenu| global attribute
1529     is added.
1530    
1531     2007-06-25 Wakaba <wakaba@suika.fam.cx>
1532    
1533 wakaba 1.53 * HTML.pm.src (_reset_insertion_mode): Interpretation
1534     of Step 3 has been changed.
1535    
1536     2007-06-25 Wakaba <wakaba@suika.fam.cx>
1537    
1538 wakaba 1.52 * HTML.pm.src: Late |<html>| parse error is implemented.
1539    
1540 wakaba 1.47 2007-06-24 Wakaba <wakaba@suika.fam.cx>
1541    
1542 wakaba 1.51 * URIChecker.pm (check_iri_reference): A |decode| method name was
1543     incorrect.
1544    
1545     * ContentChecker.pm: Support for the |footer| element.
1546     Check URI syntax for space-separated URI attributes.
1547     Support for the |tabindex| attribute. Support
1548     for |datetime| attribute.
1549    
1550     2007-06-24 Wakaba <wakaba@suika.fam.cx>
1551    
1552 wakaba 1.50 * HTML.pm.src: HTML5 revision 1.144 (&#x0D;) and 1.145 (invalid
1553     character references). HTML5 revision 1.146 (white space
1554     characters before root start tag). HTML5 revision
1555     1.148 (named character references in attribute values).
1556     HTML5 revision 1.152 (<plaintext>.innerHTML get).
1557    
1558     2007-06-24 Wakaba <wakaba@suika.fam.cx>
1559    
1560     * HTML.pm.src: HTML5 revisions 1.142 and 1.143 (<noscript>
1561 wakaba 1.49 in <head>).
1562    
1563     2007-06-24 Wakaba <wakaba@suika.fam.cx>
1564    
1565 wakaba 1.48 * HTML.pm.src: HTML5 revision 935 (<base>, <link>, <meta>
1566     in body).
1567    
1568     * ContentChecker.pm: HTML5 revision 938 (scoped="").
1569    
1570     2007-06-24 Wakaba <wakaba@suika.fam.cx>
1571    
1572 wakaba 1.47 * HTML.pm.src: HTML5 revision 923 (matching end tag in CDATA
1573     or RCDATA in fragment parsing mode). HTML5 revision
1574     924 (<!--> and <!--->). HTML5 revision 926 (hn in hn).
1575    
1576     2007-06-23 Wakaba <wakaba@suika.fam.cx>
1577    
1578     * HTML.pm.src (get_inner_html): HTML5 revision 922 (inner_html
1579     for <pre> and <textarea>).
1580    
1581 wakaba 1.33 2007-06-23 Wakaba <wakaba@suika.fam.cx>
1582    
1583 wakaba 1.46 * HTML.pm.src: HTML5 revision 920 (<isindex>).
1584    
1585     2007-06-23 Wakaba <wakaba@suika.fam.cx>
1586    
1587 wakaba 1.45 * HTML.pm.src: HTML5 revision 918 (</head>, </body>, </html>).
1588    
1589     2007-06-23 Wakaba <wakaba@suika.fam.cx>
1590    
1591 wakaba 1.44 * HTML.pm.src: HTML5 revision 916 (</body>).
1592     HTML5 revision 917 (conforming bare &).
1593    
1594     2007-06-23 Wakaba <wakaba@suika.fam.cx>
1595    
1596 wakaba 1.43 * NanoDOM.pm (manakai_is_html): Setting to false did not work.
1597    
1598     * HTML.pm.src: HTML5 revision 914 (</ in CDATA, RCDATA).
1599     HTML5 revision 915 (<nobr>).
1600    
1601     2007-06-23 Wakaba <wakaba@suika.fam.cx>
1602    
1603 wakaba 1.42 * HTML.pm.src: HTML5 revisions 908, 909, 912, and 913 (quirks mode).
1604    
1605     * NanoDOM.pm (manakai_is_html, manakai_compat_mode, compat_mode):
1606     New attributes.
1607    
1608     2007-06-23 Wakaba <wakaba@suika.fam.cx>
1609    
1610 wakaba 1.41 * HTML.pm.src: HTML5 revisions 900, 901, 902, and 911 (<
1611     in tags).
1612    
1613     2007-06-23 Wakaba <wakaba@suika.fam.cx>
1614    
1615 wakaba 1.40 * .cvsignore: |Entities.html| is added.
1616    
1617     * HTML.pm.src: |$entity_char| is removed and
1618     requires |Whatpm::_NamedEntityList| instead.
1619     HTML5 revision 898 (refc), except that lack of refc
1620     is parse error.
1621    
1622     * mkentitylist.pl: New script.
1623    
1624     * Makefile (all): |_NamedEntityList.pm| is added.
1625     (_NamedEntityList.pm, Entities.html): New rules.
1626    
1627     2007-06-23 Wakaba <wakaba@suika.fam.cx>
1628    
1629 wakaba 1.39 * HTML.pm.src: Parse errors immediately after U+000D
1630     were ignored and U+000D immediately following another
1631     U+000D was not converted to U+000A.
1632    
1633     2007-06-23 Wakaba <wakaba@suika.fam.cx>
1634    
1635 wakaba 1.38 * HTML.pm.src (set_inner_html): HTML5 revision 892 (adopt
1636     nodes before appended). Parser was not ready for NULL
1637     parse error and escape flag.
1638    
1639     * NanoDOM.pm (adopt_node): New.
1640    
1641     2007-06-23 Wakaba <wakaba@suika.fam.cx>
1642    
1643 wakaba 1.37 * HTML.pm.src: HTML5 revision 886 (insane comment in
1644     CDATA and RCDATA). Note that current implementation
1645     is simply repeating what the spec says and it is maybe not
1646     a best way to do it.
1647    
1648     2007-06-23 Wakaba <wakaba@suika.fam.cx>
1649    
1650 wakaba 1.36 * HTML.pm.src: HTML5 revision 884 (</form> don't close
1651     the form element if a descendant element without implied end tag has
1652     still been open).
1653    
1654     2007-06-23 Wakaba <wakaba@suika.fam.cx>
1655    
1656 wakaba 1.35 * ContentChecker.pm: HTML5 revision 881 (Make |id| attribute with
1657     space characters non-conforming).
1658    
1659     2007-06-23 Wakaba <wakaba@suika.fam.cx>
1660    
1661 wakaba 1.34 * HTML.pm.src: An error message was incorrect.
1662     HTML5 revision 869 (C1 character references).
1663    
1664     2007-06-23 Wakaba <wakaba@suika.fam.cx>
1665    
1666 wakaba 1.33 * HTML.pm.src: HTML5 revision 867 (a LF at the beginning of
1667     a |textarea| is removed).
1668    
1669 wakaba 1.32 2007-06-05 Wakaba <wakaba@suika.fam.cx>
1670    
1671     * NanoDOM.pm (get_attribute_node_ns): New method.
1672    
1673     * ContentChecker.pm: |script| |async| and |defer|
1674     no longer require |src|. |async| MUST NOT be
1675     specified if |defer|. (HTML5 revision 858).
1676    
1677 wakaba 1.31 2007-05-30 Wakaba <wakaba@suika.fam.cx>
1678    
1679     * HTML.pm.src: |<form><form>| went to inifinite loop.
1680    
1681 wakaba 1.27 2007-05-27 Wakaba <wakaba@suika.fam.cx>
1682    
1683 wakaba 1.30 * ContentChecker.pm (html): Set |is_root| (allowed
1684     as a document element) flag on.
1685     (new): Removed.
1686     (check_document): New method.
1687    
1688     2007-05-27 Wakaba <wakaba@suika.fam.cx>
1689    
1690 wakaba 1.29 * ContentChecker.pm (thead, tfoot): Checker specifications
1691     were incorrect.
1692    
1693     2007-05-27 Wakaba <wakaba@suika.fam.cx>
1694    
1695 wakaba 1.28 * ContentChecker.pm ($HTMLURIAttrChecker): Include
1696     error position in the |type| option of the error.
1697    
1698     * HTMLTable.pm (form_table): The |$onerror| parameter
1699     is now optional. Some bugs are fixed.
1700    
1701     2007-05-27 Wakaba <wakaba@suika.fam.cx>
1702    
1703 wakaba 1.27 * HTMLTable.pm: New module.
1704    
1705     * ContentChecker.pm (table): Invoke table model error checker.
1706    
1707     * NanoDOM.pm (first_child, get_attribute_ns): New.
1708    
1709 wakaba 1.25 2007-05-26 Wakaba <wakaba@suika.fam.cx>
1710    
1711 wakaba 1.26 * ContentChecker.pm ($HTMLLinkTypesAttrChecker): New checker.
1712     (link/@rel, a/@rel, area/@rel): Use new checker.
1713    
1714     * Makefile (_LinkTypeList.pm, RelExtensions.html): New rules.
1715    
1716     * _LinkTypeList.pm: New file.
1717    
1718     * mklinktypelist.pl: New file.
1719    
1720     * .cvsignore: |RelExtensions.html| added.
1721    
1722     * NanoDOM.pm (child_nodes): Returns an empty array
1723     for non-child-containing node types.
1724     (text_content): New attribute.
1725    
1726     2007-05-26 Wakaba <wakaba@suika.fam.cx>
1727    
1728 wakaba 1.25 * IMTChecker.pm: New module.
1729    
1730     * ContentChecker.pm ($HTMLIMTAttrChecker): Call IMTChecker
1731     to test parameter value validity.
1732    
1733     * HTML.pm.src ($style_start_tag): Attributes were
1734     discarded.
1735    
1736 wakaba 1.22 2007-05-25 Wakaba <wakaba@suika.fam.cx>
1737    
1738 wakaba 1.24 * ContentChecker.pm ($HTMLURIAttrChecker): Implemented.
1739    
1740     2007-05-25 Wakaba <wakaba@suika.fam.cx>
1741    
1742 wakaba 1.23 * URIChecker.pm: All recommendations from RFC 3986
1743     and RFC 3987 are listed (not all testable items are checked yet).
1744    
1745     2007-05-25 Wakaba <wakaba@suika.fam.cx>
1746    
1747 wakaba 1.22 * URIChecker.pm: New module.
1748    
1749 wakaba 1.18 2007-05-20 Wakaba <wakaba@suika.fam.cx>
1750    
1751 wakaba 1.21 * ContentChecker.pm: Now most attributes are implemented
1752     or associated to some placeholder.
1753     ($ElementDefault): Warn unknown attributes
1754     for unknown elements as "attribute not supported".
1755     ($HTMLLanguageTagAttrChecker, $HTMLMQAttrChecker): New placeholders.
1756     ($HTMLUsemapAttrChecker, $HTMLTargetAttrChecker): New checkers.
1757     (|a| attribute checker): Reimplemented.
1758    
1759     2007-05-20 Wakaba <wakaba@suika.fam.cx>
1760    
1761 wakaba 1.20 * ContentChecker.pm ($HTMLEventHandlerAttrChecker): New placeholder.
1762     ($HTMLAttrChecker): Event handler content attributes
1763     are added.
1764     (link, embed): Required attribute is now checked.
1765     (embed): Unknown local attributes are no longer warned.
1766    
1767     2007-05-20 Wakaba <wakaba@suika.fam.cx>
1768    
1769 wakaba 1.19 * ContentChecker.pm ($HTMLSpaceURIsAttrChecker): New placeholder.
1770     ($HTMLIMTAttrChecker): New checker.
1771     (link@rel, link@href, link@type, style@type,
1772     a@href, a@ping, a@ping, a@type, embed@src, embed@type,
1773     object@data, object@type, source@src, source@type, area@alt,
1774     area@shape, area@coords, area@href,
1775     area@ping, area@rel, area@type, script@src,
1776     script@defer, script@async, script@type): Checkers added.
1777    
1778     2007-05-20 Wakaba <wakaba@suika.fam.cx>
1779    
1780 wakaba 1.18 * ContentChecker.pm: Descendant checking was incorrect.
1781    
1782 wakaba 1.12 2007-05-19 Wakaba <wakaba@suika.fam.cx>
1783    
1784 wakaba 1.17 * ContentChecker.pm: Support |xml:*| and |xmlns:*|
1785     attributes. Report an error if |Element.prefix|
1786     is |xmlns|.
1787    
1788     * NanoDOM.pm (prefix): New attribute.
1789    
1790     2007-05-19 Wakaba <wakaba@suika.fam.cx>
1791    
1792 wakaba 1.16 * HTML.pm.src: In |main| phase, |in body| insertion
1793     mode, action for |<iframe>| was missing.
1794    
1795     2007-05-19 Wakaba <wakaba@suika.fam.cx>
1796    
1797 wakaba 1.15 * ContentChecker.pm: Support for many of HTML5 elements.
1798     ($GetHTMLNonNegativeIntegerAttrChecker): New.
1799    
1800     2007-05-19 Wakaba <wakaba@suika.fam.cx>
1801    
1802 wakaba 1.14 * ContentChecker.pm: Support for most elements up to |progress|.
1803     ($HTMLURIAttrChecker): Placeholder.
1804     ($HTMLIntegerAttrChecker, $GetHTMLFloatingPointNumberAttrChecker):
1805     New.
1806    
1807     2007-05-19 Wakaba <wakaba@suika.fam.cx>
1808    
1809 wakaba 1.13 * ContentChecker.pm: Attribute checkers
1810     for global attributes, |html|, |base|, |style|, and |meta|.
1811    
1812     * NanoDOM.pm (insert_before): Weaken reference
1813     to the parent node.
1814     (Attr::new): Set |owner_element| attribute.
1815     (namespace_uri, manakai_local_name): New attribute implementations.
1816     (owner_element): New attribute.
1817    
1818     2007-05-19 Wakaba <wakaba@suika.fam.cx>
1819    
1820 wakaba 1.12 * ContentChecker.pm ($AttrChecker, $HTMLAttrChecker,
1821     $AnyChecker->{attr_checker}, $HTMLAttrsChecker,
1822     $Element->{$HTML_NS}->{''}): New.
1823     (check_element): Invoke attrs_checker for each element.
1824    
1825 wakaba 1.6 2007-05-13 Wakaba <wakaba@suika.fam.cx>
1826    
1827 wakaba 1.11 * ContentChecker.pm: Don't use |manakai_element_type_match|.
1828    
1829     2007-05-13 Wakaba <wakaba@suika.fam.cx>
1830    
1831 wakaba 1.10 * ContentChecker.pm: Use hashs rather than lists for
1832     element type testings.
1833    
1834     2007-05-13 Wakaba <wakaba@suika.fam.cx>
1835    
1836 wakaba 1.9 * ContentChecker.pm: Don't generate duplicate
1837     error when an element type is put in the "minus" list
1838     and the element type is not allowed explicitly in the particular
1839     element content model.
1840     (html:a checker): New checker.
1841     (html:details, html:datagrid): New checkers.
1842     (html:legend): New checker.
1843    
1844     2007-05-13 Wakaba <wakaba@suika.fam.cx>
1845    
1846 wakaba 1.8 * ContentChecker.pm (html:li checker): Implemented.
1847    
1848     2007-05-13 Wakaba <wakaba@suika.fam.cx>
1849    
1850 wakaba 1.7 * ContentChecker.pm ($HTMLInlineOrStriclyInlineChecker): New
1851     checker.
1852     (html:dd checker): New checker.
1853     (html:q, html:em, html:strong, html:small,
1854     html:m, html:dfn, html:code, html:samp, html:span): New checkers.
1855    
1856     2007-05-13 Wakaba <wakaba@suika.fam.cx>
1857    
1858 wakaba 1.6 * ContentChecker.pm ($AnyChecker): Renamed
1859     from |$ElementDefault->{checker}|.
1860     ($ElementDefault->{checker}): Throw an error that
1861     the element type is not supported by the checker.
1862     ($HTMLMetadataElement): |html:base| was missing.
1863     ($HTMLEmptyChecker): Don't throw an error
1864     for inter-element whitespace nodes.
1865     (html:html checker): Errors were not
1866     thrown even if |html:head| and/or |html:body|
1867     children were missing.
1868     (html:head checker): An error was not
1869     thrown if <meta charset> appered after other
1870     elements.
1871    
1872 wakaba 1.5 2007-05-05 Wakaba <wakaba@suika.fam.cx>
1873    
1874     * ContentChecker.pm: |footer|, |video|, |audio|, |script|,
1875     and |noscript| elements are implemented.
1876     (new): New method.
1877    
1878 wakaba 1.3 2007-05-04 Wakaba <wakaba@suika.fam.cx>
1879    
1880 wakaba 1.4 * ContentChecker.pm: New module.
1881    
1882     2007-05-04 Wakaba <wakaba@suika.fam.cx>
1883    
1884 wakaba 1.3 * NanoDOM.pm (manakai_parent_element,
1885     document_element, manakai_local_name,
1886     manakai_element_type_match): New method.
1887    
1888     2007-05-03 Wakaba <wakaba@suika.fam.cx>
1889    
1890     * HTML.pm.src: Replace decimal and hexadecimal numeric
1891     entities in C1 range using Windows-1252 mapping. Bare LF
1892     did not count as new line for error reporting.
1893    
1894 wakaba 1.2 2007-05-02 Wakaba <wakaba@suika.fam.cx>
1895    
1896     * NanoDOM.pm (DOMImplementation): New class.
1897     (append_child): Weaken the |parent_node| reference.
1898     (create_element_ns, Element new): Set the |owner_document|
1899     reference.
1900     (implementation): New attribute.
1901     (owner_document, local_name, namespace_uri): New attributes.
1902    
1903     * HTML.pm.src (parse_string): Line and column numbers
1904     are now provided to error handler.
1905     (!!!parse-error): Short descriptions are added.
1906     (_construct_tree): Split into three methods; support
1907     for innerHTML mode.
1908     (set_inner_html): New method.
1909    
1910 wakaba 1.1 2007-05-01 Wakaba <wakaba@suika.fam.cx>
1911    
1912     * NanoDOM.html: Documentation is added.
1913    
1914     * HTML.pod, ContentType.html: Documentation is revised.
1915    
1916     * .cvsignore: Pod2html temporary files are added.
1917    
1918     * Makefile: Make |NanoDOM.html|.
1919    
1920     2007-05-01 Wakaba <wakaba@suika.fam.cx>
1921    
1922     * HTML.pm.src (parse_string): New method.
1923     (get_inner_html): Renamed from |inner_html|.
1924    
1925     * Makefile: A rule for |HTML.html| is added.
1926    
1927     * HTML.pod: New documentation.
1928    
1929     2007-05-01 Wakaba <wakaba@suika.fam.cx>
1930    
1931     * NanoDOM.pm (last_child, previous_sibling): New attributes.
1932     (clone_node): Attribute nodes were not completely copied.
1933    
1934     * HTML.pm.src: Many bugs are fixed.
1935    
1936     2007-04-30 Wakaba <wakaba@suika.fam.cx>
1937    
1938     * HTML.pm.src: Some typos are fixed.
1939    
1940     2007-04-30 Wakaba <wakaba@suika.fam.cx>
1941    
1942     * mkhtmlparser.pl, Makefile: References to the |HTML-consume-entity.src|
1943     are removed.
1944    
1945     * HTML.pm.src: Tokenizer's handling on named entities are rewritten.
1946    
1947     * HTML-consume-entity.src: Removed.
1948    
1949     2007-04-30 Wakaba <wakaba@suika.fam.cx>
1950    
1951     * HTML.pm.src: Tokenizer's handling on hexadecimal
1952     numeric entities are rewritten.
1953    
1954     2007-04-30 Wakaba <wakaba@suika.fam.cx>
1955    
1956     * HTML.pm.src: Some tokenizer bugs are fixed.
1957    
1958     2007-04-30 Wakaba <wakaba@suika.fam.cx>
1959    
1960     * HTML.pm.src: The tree construction stage is implemented.
1961    
1962     * mkhtmlparser.pl: New macros are added.
1963    
1964     2007-04-28 Wakaba <wakaba@suika.fam.cx>
1965    
1966     * ContentType.pm: A note on bug in the specification
1967     is removed since it's been now fixed.
1968    
1969     * .cvsignore: New file.
1970    
1971     2007-04-28 Wakaba <wakaba@suika.fam.cx>
1972    
1973     * HTML.pm.src, HTML-consume-entity.src: New files.
1974    
1975     * Makefile (HTML.pm): New rule.
1976    
1977     * mkhtmlparser.pl: New script.
1978    
1979     2007-04-25 Wakaba <wakaba@suika.fam.cx>
1980    
1981     * Makefile: New file.
1982    
1983     2007-04-24 Wakaba <wakaba@suika.fam.cx>
1984    
1985     * ContentType.pm: An error in pod is fixed.
1986    
1987     2007-04-24 Wakaba <wakaba@suika.fam.cx>
1988    
1989     * ContentType.pm: Documentation is added.
1990    
1991     2007-04-24 Wakaba <wakaba@suika.fam.cx>
1992    
1993     * ContentType.pm: New Perl module.
1994    
1995     * ChangeLog: New file.
1996    
1997    

admin@suikawiki.org
ViewVC Help
Powered by ViewVC 1.1.24