/[suikacvs]/markup/html/whatpm/Whatpm/ChangeLog
Suika

Contents of /markup/html/whatpm/Whatpm/ChangeLog

Parent Directory Parent Directory | Revision Log Revision Log


Revision 1.306 - (hide annotations) (download)
Sun Sep 14 09:05:54 2008 UTC (16 years, 7 months ago) by wakaba
Branch: MAIN
Changes since 1.305: +9 -0 lines
++ whatpm/Whatpm/ChangeLog	14 Sep 2008 09:05:44 -0000
	* HTML.pm.src: White-space-leaded non-white-space character
	tokens in "before head insertion mode" was not
	correctly handled.
	(set_inner_html): Reimplemented using CharString decodehandle
	class.  Support for $get_wrapper argument.  Support
	for |{read_until}| feature.

2008-09-14  Wakaba  <wakaba@suika.fam.cx>

1 wakaba 1.300 2008-09-14 Wakaba <wakaba@suika.fam.cx>
2    
3 wakaba 1.306 * HTML.pm.src: White-space-leaded non-white-space character
4     tokens in "before head insertion mode" was not
5     correctly handled.
6     (set_inner_html): Reimplemented using CharString decodehandle
7     class. Support for $get_wrapper argument. Support
8     for |{read_until}| feature.
9    
10     2008-09-14 Wakaba <wakaba@suika.fam.cx>
11    
12 wakaba 1.305 * HTML.pm.src: Make a "bare ero" error for unknown
13     entities point the "&" character.
14    
15     2008-09-14 Wakaba <wakaba@suika.fam.cx>
16    
17 wakaba 1.304 * HTML.pm.src: It turns out that U+FFFD don't have to
18     be added to the list of excluded characters.
19    
20     2008-09-14 Wakaba <wakaba@suika.fam.cx>
21    
22 wakaba 1.303 * HTML.pm.src ($char_onerror): Have character decoder's |line|
23     and |column| a higher priority than the one set by the
24     tokenizer's input handler.
25     ($self->{read_until}): Exclude U+FFFD (but this might
26     not be necessary, since now we do line/column fixup in
27     the character decode handle).
28    
29     2008-09-14 Wakaba <wakaba@suika.fam.cx>
30    
31 wakaba 1.302 * HTML.pm.src: Use |{read_until}| where possible.
32    
33     2008-09-14 Wakaba <wakaba@suika.fam.cx>
34    
35 wakaba 1.301 * HTML.pm.src: Change |{getc_until}| to |{read_until}|
36     and |manakai_getc_until| to |manakai_read_until| to
37     reduce the number of string copies.
38    
39     2008-09-14 Wakaba <wakaba@suika.fam.cx>
40    
41 wakaba 1.300 * HTML.pm.src (parse_char_string): Use newly created
42     |Whatpm::Charset::DecodeHandle::CharString| instead of Perl's
43     standard feature to |open| a string as a filehandle,
44     since Perl's string filehandle seems not supporting |ungetc|
45     method correctly.
46     (parse_char_stream): Define |{getc_until}| method.
47     (DATA_STATE): Experimental support for |getc_until| feature.
48    
49 wakaba 1.292 2008-09-13 Wakaba <wakaba@suika.fam.cx>
50    
51 wakaba 1.299 * HTML.pm.src: Check points added to newly added branches.
52    
53     2008-09-13 Wakaba <wakaba@suika.fam.cx>
54    
55 wakaba 1.298 * HTML.pm.src: Remove |{char}|, which is no longer used.
56     Remove |{entity_in_attr}| and |{last_attribute_value_state}|
57     and replaced by |{prev_state}|.
58    
59     * mkhtmlparser.pl: Remove |{char}| feature.
60     Remove |!!!back-next-input-character;| macro.
61    
62     2008-09-13 Wakaba <wakaba@suika.fam.cx>
63    
64 wakaba 1.297 * HTML.pm.src: Finally we get rid of all the inner loops. Remove
65     entity related tokenizer states in favor of new states
66     implementing the consume character reference algorithm.
67    
68     2008-09-13 Wakaba <wakaba@suika.fam.cx>
69    
70     * HTML.pm.src: "Consume a character reference" algorithm is
71 wakaba 1.296 now implemented as a tokenizer's state, rather than
72     a method, with minimum changes (more changes will
73     be made, in due course). "Bogus comment state"'s inner
74     loop gets removed.
75    
76     2008-09-13 Wakaba <wakaba@suika.fam.cx>
77    
78 wakaba 1.297 * HTML.pm.src: Make |PUBLIC| and |SYSTEM| keyword tokenizing
79 wakaba 1.295 into their own tokenizer states.
80    
81     2008-09-13 Wakaba <wakaba@suika.fam.cx>
82    
83 wakaba 1.294 * HTML.pm.src: |CDATA_SECTION_STATE| (formally |CDATA_BLOCK_STATE|
84     is split into three states.
85    
86     2008-09-13 Wakaba <wakaba@suika.fam.cx>
87    
88 wakaba 1.293 * HTML.pm.src: |CLOSE_TAG_OPEN_STATE| is broken into
89     itself and new |CDATA_PCDATA_CLOSE_TAG_STATE| so that
90     no longer does the tokenizer have to push back next input
91     characters in those states.
92    
93     2008-09-13 Wakaba <wakaba@suika.fam.cx>
94    
95 wakaba 1.292 * HTML.pm.src: |MARKUP_DECLARATION_OPEN_STATE| broken
96     into four states so that no longer does the tokenizer have to push
97     back next input characters in that state.
98    
99 wakaba 1.291 2008-09-11 Wakaba <wakaba@suika.fam.cx>
100    
101     * HTML.pm.src: Methods now accept additional parameter, $get_wrapper,
102     which can be used to insert some wrapper between the character
103     stream handle and the tokenizer. (It is currently not supported
104     for |set_inner_html| for |Element|s).
105    
106 wakaba 1.289 2008-09-10 Wakaba <wakaba@suika.fam.cx>
107    
108 wakaba 1.290 * HTML.pm.src: Ignore punctuations in charset names.
109    
110     2008-09-10 Wakaba <wakaba@suika.fam.cx>
111    
112 wakaba 1.289 * ContentChecker.pm: Support for charset-layer error levels.
113    
114     * HTML.pm.src: Don't specify |text| argument for the
115     |chardecode:fallback| error, since it is not the encoding
116     being used alternatively.
117    
118 wakaba 1.288 2008-09-06 Wakaba <wakaba@suika.fam.cx>
119    
120     * HTML.pm.src: Support for |XSLT-compat| (HTML5 revision 2141).
121    
122 wakaba 1.286 2008-08-31 Wakaba <wakaba@suika.fam.cx>
123    
124 wakaba 1.287 * CacheManifest.pm: Support for extensibility (HTML5 revision 2051).
125    
126     2008-08-31 Wakaba <wakaba@suika.fam.cx>
127    
128 wakaba 1.286 * HTML.pm.src: Bug fix and sync with the spec with regard
129     to after after frameset insertion mode processing (HTML5
130     revision 1909). Note that the implementation was wrong
131     per the old spec before the r1909 changes.
132    
133 wakaba 1.278 2008-08-30 Wakaba <wakaba@suika.fam.cx>
134    
135 wakaba 1.285 * HTMLTable.pm: scope=auto algorithm fix synced with the
136     spec (HTML5 revision 2093).
137     ($process_row): Algorithm step numbers synced with the
138     spec (HTML5 revision 2092).
139    
140     2008-08-30 Wakaba <wakaba@suika.fam.cx>
141    
142 wakaba 1.284 * HTMLTable.pm: Zs is not what we want; we want White_Space! (HTML5
143     revision 2094).
144    
145     2008-08-30 Wakaba <wakaba@suika.fam.cx>
146    
147     * ContentType.pm: Support for image/svg+xml (HTML5 revision 2096).
148    
149     2008-08-30 Wakaba <wakaba@suika.fam.cx>
150    
151 wakaba 1.283 * HTML.pm.src: '"' and "'" at the end of attribute
152     name (after another attribute) now raise parse error (HTML5
153     revision 2123). Empty unquoted attribute values are no
154     longer allowed (HTML5 revision 2122).
155    
156     2008-08-30 Wakaba <wakaba@suika.fam.cx>
157    
158 wakaba 1.282 * mkhtmlparser.pl: Support for MathML |definitionURL| attribute (HTML5
159     revision 2130).
160    
161     2008-08-30 Wakaba <wakaba@suika.fam.cx>
162    
163 wakaba 1.281 * ContentChecker.pm: |xml:lang| attribute value must be same
164     as |lang| attribute value for HTML elements (HTML5 revision 2062
165     and so on).
166    
167     2008-08-30 Wakaba <wakaba@suika.fam.cx>
168    
169 wakaba 1.280 * ContentChecker.pm: Error level definition for |xml_id_error|
170     was missing.
171    
172     * URIChecker.pm: The end of the URL should be marked as the
173     error location for an empty path error. The position
174     between the userinfo and the port components should be
175     marked as the error location for an empty host error.
176    
177     2008-08-30 Wakaba <wakaba@suika.fam.cx>
178    
179 wakaba 1.279 * URIChecker.pm: Set parameters representing where in the
180     value the error occurs for errors. Report unknown
181     address format error in warning level, since address
182     formats are rarely added. Path segments starting with "/.."
183     were misinterpreted as a dot-segment.
184    
185     2008-08-30 Wakaba <wakaba@suika.fam.cx>
186    
187 wakaba 1.278 * URIChecker.pm (check_iri_reference): Requires
188     |Message::DOM::DOMImplementation|.
189    
190 wakaba 1.277 2008-08-29 Wakaba <wakaba@suika.fam.cx>
191    
192     * IMTChecker.pm: Updated for the new error reporting architecture.
193    
194     * ContentChecker.pm: Error levels for IMTs are added.
195    
196 wakaba 1.276 2008-08-17 Wakaba <wakaba@suika.fam.cx>
197    
198     * H2H.pm (_shift_token): Support for unquoted HTML attribute
199     values.
200    
201 wakaba 1.274 2008-08-16 Wakaba <wakaba@suika.fam.cx>
202    
203 wakaba 1.275 * CacheManifest.pm: Support for new style of error
204     reports.
205    
206     * HTML.pm.src: Set line=1, column=1 to the document node.
207    
208     2008-08-16 Wakaba <wakaba@suika.fam.cx>
209    
210 wakaba 1.274 * ContentChecker.pm, RDFXML.pm: Pass {level} object to language tag
211     and URL checkers. Support for more error levels for bogus
212     langauge tag and URL "standards".
213    
214     * LangTag.pm, URIChecker.pm: Support for new style error
215     level reporting.
216    
217 wakaba 1.271 2008-08-15 Wakaba <wakaba@suika.fam.cx>
218    
219 wakaba 1.273 * ContentChecker.pm: Support for RDF/XML error levels.
220    
221     * HTMLTable.pm, RDFXML.pm: Support for new style of error level
222     specifying. Error types are revised.
223    
224     2008-08-15 Wakaba <wakaba@suika.fam.cx>
225    
226 wakaba 1.272 * ContentChecker.pm: All error reporting method calls are
227     renewed.
228    
229     2008-08-15 Wakaba <wakaba@suika.fam.cx>
230    
231 wakaba 1.271 * HTML.pm.src: All error type names and "text" parameters
232     are revised. Use new style for "level" specification.
233    
234     * mkhtmlparser.pl: Use new style for "level" specification.
235    
236 wakaba 1.268 2008-08-03 Wakaba <wakaba@suika.fam.cx>
237    
238 wakaba 1.270 * WebIDL.pm (parse_char_string): Simplified error
239     reporting process for broken ignored valuetype definition.
240     (Valuetype idl_text): Support for special "DOMString" name.
241    
242     2008-08-03 Wakaba <wakaba@suika.fam.cx>
243    
244 wakaba 1.269 * WebIDL.pm ($get_scoped_name): Append "::::" if the last
245     terminal of the ScopedName is "DOMString", such that whether
246     the last part of the scoped name is "DOMString" or "_DOMString"
247     later. It is necessary to determine whether a |typedef|
248     definition should be ignored or not.
249     (parse_char_string): Unescape the identifier of
250     exception members.
251     ($resolve): Return undef for builtin types and sequence<T>
252     types (we might not have to do this, however...).
253     (check): Support checking for Exceptions, Valuetypes,
254     and Typedefs.
255     ($serialize_type): Support for "DOMString::::" syntax.
256     (Typedef idl_text): Output Type as "DOMString" if it
257     is really "DOMString" (i.e. its internal representation
258     is "::DOMString::").
259    
260     2008-08-03 Wakaba <wakaba@suika.fam.cx>
261    
262 wakaba 1.268 * WebIDL.pm ($resolve): New code, based on resolve code
263     for constant types in the |check| method.
264     (check): Support for checking of attributes, operations, and
265     arguments.
266     (Attribute/Operation idl_text): Exception names in getraises,
267     setraises, and raises clauses is serizlied by |$serialize_type|
268     code.
269    
270 wakaba 1.264 2008-08-02 Wakaba <wakaba@suika.fam.cx>
271    
272 wakaba 1.267 * WebIDL.pm ($integer): Order of selections are changed to match
273     hexadecimal numbers (the original pattern, taken from the spec,
274     was not work for hexadecimal numbers, because the "0" prefix
275     matches to the [0-7]* part (as an empty string) and therefore
276     it does not match with remaining "x..." part of a "0x..." integer
277     literal.
278     ($get_type): It now returns a string, not an array reference,
279     for regular types and |sequence| types (i.e. it in any case
280     returns a string).
281     ($get_next_token): The second item in the array that represents
282     a integer or float token is now a Perl number value, not the
283     original string representation of the number.
284     (check): Support for const value consistency checking.
285     No extended attribute is defined for constants.
286     (Node subclasses): Use simple strings rather than array references
287     for default data type values.
288     ($serialize_type): Type values are now simple strings.
289     (value): If the new attribute value is a false value, then
290     a FALSE value is set to the attribute.
291    
292     2008-08-02 Wakaba <wakaba@suika.fam.cx>
293    
294 wakaba 1.266 * WebIDL.pm ($get_scoped_name): Now scoped names are stored
295     in its stringified format ("scoped name" as defined in the
296     spec). Note that future version of this module should not use
297     array references for type values and the |type_text| attribute
298     should be made obsolete.
299     (parse_char_string): Unescape attribute names.
300     (check): Support for checking of whether inherited interfaces
301     are actually defined or not. Support for checking of whether
302     interface member identifiers are duplicated or not.
303     ($serialize_type): Scoped names are returned as is. A future
304     version of this code should escape identifiers other than "DOMString",
305     otherwise the idl_text would be non-conforming.
306    
307     2008-08-02 Wakaba <wakaba@suika.fam.cx>
308    
309 wakaba 1.264 * WebIDL.pm (parse_char_string): Set line/column numbers
310 wakaba 1.265 to generated nodes. Unescape identifiers. Extended attributes
311     for Definition's were ignored.
312     (append_child): Set |parent_node| attribute.
313     (parent_node): New attribute.
314     (check): Support interface/exception members. Support
315     extended attributes. Support definition identifier uniqueness
316     constraint.
317     (qualified_name): New attribute.
318     (Interface/Exception idl_text): Extended attributes were
319     not prepended to the returned text.
320    
321     2008-08-02 Wakaba <wakaba@suika.fam.cx>
322    
323     * WebIDL.pm (parse_char_string): Set line/column numbers
324 wakaba 1.264 to interface object experimentally. s/shift/pop/g, shift
325     would make things wrong. Support for interface forward
326     declarations was missing. Broken interface declarations
327     with no block were not ignored entirely.
328     (Whatpm::WebIDL::Node): New abstract class. This class
329     makes things easier.
330     (child_nodes): New attribute. Unlike DOM's attribute with
331     same name, this attribute returns a dead list of nodes for
332     simplicity.
333     (get_user_data, set_user_data): New methods.
334     (Module idl_text): A SPACE character should be inserted
335     before the |{| character.
336     (Interface idl_text): Support for interface forward declarations.
337     (is_forward_declaration): New attribute.
338    
339 wakaba 1.259 2008-07-19 Wakaba <wakaba@suika.fam.cx>
340    
341 wakaba 1.263 * WebIDL.pm (type_text): Better serializer.
342    
343     2008-07-19 Wakaba <wakaba@suika.fam.cx>
344    
345 wakaba 1.262 * WebIDL.pm: Revise forward-compatible parsing so that
346     it now can handle broken extended attributes and as such.
347    
348     2008-07-19 Wakaba <wakaba@suika.fam.cx>
349    
350 wakaba 1.261 * WebIDL.pm: Real support for extended attributes.
351     Support for extended attributes with arguments.
352    
353     2008-07-19 Wakaba <wakaba@suika.fam.cx>
354    
355 wakaba 1.260 * WebIDL.pm: Support for |exception| syntax.
356     (Interface->idl_text): Tentative support for inheritances.
357    
358     2008-07-19 Wakaba <wakaba@suika.fam.cx>
359    
360 wakaba 1.259 * WebIDL.pm: Hierarchical scoped name support was broken.
361     Support for raises, setraises, and getraises syntaxes.
362    
363 wakaba 1.258 2008-07-18 Wakaba <wakaba@suika.fam.cx>
364    
365     * WebIDL.pm: Support for |idl_text| attribute, version 1 (no
366     proper support for types, extended attributes, and exceptions yet).
367     WebIDL parser, version 1 (no support for exceptions yet,
368     no proper support for extended attributes yet).
369    
370 wakaba 1.257 2008-07-09 Wakaba <wakaba@suika.fam.cx>
371    
372     * WebIDL.pm (parse_char_string): Support for basic attribute syntax.
373    
374 wakaba 1.255 2008-06-29 Wakaba <wakaba@suika.fam.cx>
375    
376 wakaba 1.256 * WebIDL.pm: Support for valuetype and const.
377    
378     2008-06-29 Wakaba <wakaba@suika.fam.cx>
379    
380 wakaba 1.255 * WebIDL.pm: New module.
381    
382 wakaba 1.254 2008-06-15 Wakaba <wakaba@suika.fam.cx>
383    
384     * Makefile (Entities.html): URI changed.
385    
386 wakaba 1.253 2008-06-08 Wakaba <wakaba@suika.fam.cx>
387    
388     * HTML.pm.src: Support for ruby parsing (HTML5 revision 1704).
389    
390 wakaba 1.251 2008-06-01 Wakaba <wakaba@suika.fam.cx>
391    
392 wakaba 1.252 * HTML.pm.src (_get_next_token): A parse error was missing.
393    
394     2008-06-01 Wakaba <wakaba@suika.fam.cx>
395    
396 wakaba 1.251 * mklinktypelist.pl: rel=contact is no longer part of the HTML5
397     spec (commented out). (HTML5 revision 1711).
398    
399 wakaba 1.248 2008-05-25 Wakaba <wakaba@suika.fam.cx>
400    
401 wakaba 1.250 * ContentType.pm: Drop support for UTF-32 (HTML5 revision 1701).
402    
403     * HTML.pm.src: UTF-16BE and UTF-16LE should be considered
404     as UTF-16 (HTML5 revision 1701).
405    
406     2008-05-25 Wakaba <wakaba@suika.fam.cx>
407    
408 wakaba 1.249 * HTML.pm.src: Support for <noframes> in <head> (HTML5 revision
409     1692).
410    
411     2008-05-25 Wakaba <wakaba@suika.fam.cx>
412    
413     * HTML.pm.src: The secondary insertion mode used when switching
414     to foreign content is the "in body" insertion mode (HTML5 revision
415     1696).
416    
417     2008-05-25 Wakaba <wakaba@suika.fam.cx>
418    
419 wakaba 1.248 * HTML.pm.src: Don't raise parse error for <isindex/> (HTML5
420     revision 1697).
421    
422 wakaba 1.239 2008-05-24 Wakaba <wakaba@suika.fam.cx>
423    
424 wakaba 1.247 * HTML.pm.src: Support for end-of-file token in foreign content
425     insertion mode (HTML5 revision 1693). Update SVG camelCase
426     attribute list (HTML5 revision 1700). <textarea> closes
427     </select> (HTML5 revision 1699). More start tags close in
428     foreign content insertion mode (HTML5 revision 1698).
429    
430     2008-05-24 Wakaba <wakaba@suika.fam.cx>
431    
432 wakaba 1.246 * HTML.pm.src: ";" is not part of charset name (HTML5 revision 1665).
433    
434     2008-05-24 Wakaba <wakaba@suika.fam.cx>
435    
436 wakaba 1.245 * HTML.pm.src: More robust charset parameter detection (HTML5
437     revision 1674).
438    
439     2008-05-24 Wakaba <wakaba@suika.fam.cx>
440    
441 wakaba 1.244 * ContentType.pm: Support for image/vnd.microsoft.icon (HTML5
442     revision 1676).
443    
444     2008-05-24 Wakaba <wakaba@suika.fam.cx>
445    
446 wakaba 1.243 * HTML.pm.src: Ignore language part of public identifiers for
447     quriks mode detection (HTML5 revision 1679).
448    
449     2008-05-24 Wakaba <wakaba@suika.fam.cx>
450    
451 wakaba 1.242 * HTML.pm.src: Reduce the number of errors in truncated doctypes (HTML5
452     revision 1685).
453    
454     2008-05-24 Wakaba <wakaba@suika.fam.cx>
455    
456 wakaba 1.241 * HTML.pm.src: Support for EOF in new states for tags (HTML5
457     revision 1684).
458    
459     2008-05-24 Wakaba <wakaba@suika.fam.cx>
460    
461 wakaba 1.240 * HTML.pm.src (_reset_insertion_mode): Make <td>.innerHTML
462     work (HTML5 revision 1690).
463    
464     2008-05-24 Wakaba <wakaba@suika.fam.cx>
465    
466 wakaba 1.239 * HTML.pm.src (_tree_construction_main): Change handling of
467     end tags in head insertion modes (HTML5 revision 1686).
468     (parse_char_string): Bug fix for non-utf8 character string handlings.
469     (parse_char_stream): |ungetc| does not work well for this context.
470    
471 wakaba 1.237 2008-05-18 Wakaba <wakaba@suika.fam.cx>
472    
473 wakaba 1.238 * HTML.pm.src (parse_byte_string): Redefined to invoke
474     |parse_byte_stream|.
475     (parse_byte_stream): New method.
476    
477     2008-05-18 Wakaba <wakaba@suika.fam.cx>
478    
479 wakaba 1.237 * HTML.pm.src (parse_byte_string): Fix the column number reported
480     by encoding layer error reporter.
481    
482 wakaba 1.233 2008-05-17 Wakaba <wakaba@suika.fam.cx>
483    
484 wakaba 1.236 * HTML.pm.src (parse_byte_string): Use streaming decoder
485     rather than converting the whole byte string and then parsing.
486     Propagate errors in character encoding layer.
487     (get_next_token): Precise error reporting for |bare stago| error.
488    
489     2008-05-17 Wakaba <wakaba@suika.fam.cx>
490    
491 wakaba 1.235 * HTML.pm.src (parse_char_stream): New method.
492     (parse_char_string): This method is now defined as an invocation
493     of the |parse_char_stream| method.
494    
495     2008-05-17 Wakaba <wakaba@suika.fam.cx>
496    
497 wakaba 1.234 * HTML.pm.src (parse_byte_string): Report various status
498     of the sniffing as info-level errors. Support for new
499     decoding framework in parser resestting.
500     (new): Various default error levels were not set.
501    
502     2008-05-17 Wakaba <wakaba@suika.fam.cx>
503    
504 wakaba 1.233 * HTML.pm.src (parse_byte_string): HTML5 encoding siniffing
505     algorithm, except for the actual sniffing, is implemented
506     with new framework with Message::Charset::Info.
507    
508 wakaba 1.232 2008-05-16 Wakaba <wakaba@suika.fam.cx>
509    
510     * CacheManifest.pm (_parse): Drop fragment identifiers from
511     URIs in fallback section (HTML5 revision 1596).
512    
513 wakaba 1.230 2008-05-10 Wakaba <wakaba@suika.fam.cx>
514    
515 wakaba 1.231 * Makefile (Entities.html): URI has changed.
516    
517     2008-05-10 Wakaba <wakaba@suika.fam.cx>
518    
519 wakaba 1.230 * CacheManifest.pm: Don't replace U+0000 NULL (HTML5 revision
520     1553).
521    
522 wakaba 1.228 2008-05-06 Wakaba <wakaba@suika.fam.cx>
523    
524 wakaba 1.229 * ContentChecker.pm: Noted that those returned in |table| are
525     no longer table elements, but table objects returned
526     by Whatpm::HTMLTable.
527    
528     * HTMLTable.pm (form_table): Return table element node
529     as |$table->{element}|.
530     (assign_header): Support for the |headers=""| attribute.
531    
532     2008-05-06 Wakaba <wakaba@suika.fam.cx>
533    
534 wakaba 1.228 * HTMLTable.pm (assign_header): New function; first version
535     with no support for headers="".
536     (form_table): Include table width and height to the returned
537     table object for covenience. Indexing in column assignement was wrong.
538     Set whether a data cell is empty or not for convenience.
539    
540 wakaba 1.222 2008-05-05 Wakaba <wakaba@suika.fam.cx>
541    
542 wakaba 1.227 * HTMLTable.pm: Robuster caption support (HTML5 revision 1393).
543    
544     2008-05-05 Wakaba <wakaba@suika.fam.cx>
545    
546 wakaba 1.226 * HTMLTable.pm: How table model errors are detected is
547     changed (HTML5 revision 1387).
548    
549     2008-05-05 Wakaba <wakaba@suika.fam.cx>
550    
551 wakaba 1.225 * HTMLTable.pm: The algorithm now moves |tfoot| elements
552     to the end of the table (HTML5 revision 1380).
553    
554     2008-05-05 Wakaba <wakaba@suika.fam.cx>
555    
556 wakaba 1.224 * HTMLTable.pm: The algorithm is now 0-based indexing, instead
557     of 1-based (HTML5 revision 1376).
558    
559     2008-05-05 Wakaba <wakaba@suika.fam.cx>
560    
561 wakaba 1.223 * ContentType.pm: "Content-Type: text/plain; charset=UTF-8"
562     and "Content-Encoding" no longer prevent sniffing (HTML5
563     revision 1288).
564    
565     2008-05-05 Wakaba <wakaba@suika.fam.cx>
566    
567 wakaba 1.222 * ContentType.pm: Skip BOMs n feed or HTML algorithm (HTML5 revision
568     1282).
569    
570 wakaba 1.220 2008-05-03 Wakaba <wakaba@suika.fam.cx>
571    
572 wakaba 1.221 * ContentChecker.pm: Support for global attributes.
573     Status of XML specs are added.
574    
575     2008-05-03 Wakaba <wakaba@suika.fam.cx>
576    
577 wakaba 1.220 * ContentChecker.pm (check_element): Support for |template=""|
578     and |ref=""| attribute (referent element type checking).
579    
580 wakaba 1.218 2008-04-29 Wakaba <wakaba@suika.fam.cx>
581    
582 wakaba 1.219 * CacheManifest.pm (_parse): New same origin definition (HTML5 revision
583     1500) is implemented (except for IDNA part and URI-scheme-specific
584 wakaba 1.218 knowledge). Line number counting was wrong for LF-only
585     documents.
586    
587 wakaba 1.211 2008-04-13 Wakaba <wakaba@suika.fam.cx>
588    
589 wakaba 1.217 * HTML.pm.src: Raise an parse error for any disallowed
590     character (HTML5 revision 1263).
591    
592     2008-04-13 Wakaba <wakaba@suika.fam.cx>
593    
594 wakaba 1.216 * mkentitylist.pl: Support for new HTML5 entity table format (the
595     definition for |AElig;| was missing).
596    
597     2008-04-13 Wakaba <wakaba@suika.fam.cx>
598    
599 wakaba 1.215 * HTML.pm.src, mkhtmlparser.pl: Support for element/attribute
600     name/namespace fixup (HTML5 revisions 1413, 1415, 1416, and 1417).
601    
602     2008-04-13 Wakaba <wakaba@suika.fam.cx>
603    
604 wakaba 1.214 * HTML.pm.src: List of element names that close foreign content
605     insertion mode is added (HTML5 revisions 1412 and 1418).
606    
607     2008-04-13 Wakaba <wakaba@suika.fam.cx>
608    
609 wakaba 1.213 * HTML.pm.src: Support for |mglyph| and |malignmark|
610     elements (HTML5 revision 1410).
611    
612     2008-04-13 Wakaba <wakaba@suika.fam.cx>
613    
614 wakaba 1.212 * HTML.pm.src: Support for new long MathML entities (HTML5
615     revision 1406).
616    
617     2008-04-13 Wakaba <wakaba@suika.fam.cx>
618    
619 wakaba 1.211 * HTML.pm.src: CDATA section support for MathML and SVG
620     elements (HTML5 revision 1404 and 1420).
621    
622 wakaba 1.209 2008-04-12 Wakaba <wakaba@suika.fam.cx>
623    
624 wakaba 1.210 * HTML.pm.src, mkhtmlparser.pl: Support for MathML and SVG
625     elements (HTML5 revision 1404). Unused !!!macro definitions
626     are removed.
627    
628     2008-04-12 Wakaba <wakaba@suika.fam.cx>
629    
630 wakaba 1.209 * HTML.pm.src, mkhtmlparser.pl: The way permitted slash errors
631     are raised is changed (HTML5 revision 1404).
632    
633 wakaba 1.206 2008-04-06 Wakaba <wakaba@suika.fam.cx>
634    
635 wakaba 1.208 * HTML.pm.src: Category sets that are no longer used are removed.
636    
637     2008-04-06 Wakaba <wakaba@suika.fam.cx>
638    
639 wakaba 1.207 * HTML.pm.src: The ->[1] property of stack entries are now
640     replaced by constants representing element category.
641    
642     2008-04-06 Wakaba <wakaba@suika.fam.cx>
643    
644 wakaba 1.206 * HTML.pm.src: Don't use local name stored in stack (i.e. ->[1])
645     for error reporting. (This is a preparation for using constant
646     value for ->[1].)
647    
648 wakaba 1.204 2008-03-22 Wakaba <wakaba@suika.fam.cx>
649    
650 wakaba 1.205 * RDFXML.pm: Typo fixed.
651    
652     2008-03-22 Wakaba <wakaba@suika.fam.cx>
653    
654 wakaba 1.204 * ContentChecker.pm: |fact_level| is now treated
655     as same as |must_level|, i.e. level = |m|.
656     (check_element): Make list of URIs in the DOM.
657    
658 wakaba 1.197 2008-03-21 Wakaba <wakaba@suika.fam.cx>
659    
660 wakaba 1.203 * RDFXML.pm: Language accessor implemented.
661     Local (null-namespace) attribute support.
662    
663     2008-03-21 Wakaba <wakaba@suika.fam.cx>
664    
665 wakaba 1.202 * RDFXML.pm: Factored out ID checking code.
666    
667     2008-03-21 Wakaba <wakaba@suika.fam.cx>
668    
669 wakaba 1.201 * RDFXML.pm: TODO items noted. Validation of ID
670     and URI attributes is implemented. Warn if unknown
671     value is used in rdf:parseType="" attribute.
672    
673     * URIChecker.pm (check_rdf_uri_reference): New function.
674    
675     2008-03-21 Wakaba <wakaba@suika.fam.cx>
676    
677 wakaba 1.200 * RDFXML.pm: bnodeid implemented. Relative references
678     are now resolved.
679    
680     2008-03-21 Wakaba <wakaba@suika.fam.cx>
681    
682 wakaba 1.199 * ContentChecker.pm: RDF reification implemented.
683    
684     * RDFXML.pm: undef vs false bug fixed.
685     Reification implemented.
686    
687     2008-03-21 Wakaba <wakaba@suika.fam.cx>
688    
689 wakaba 1.198 * RDFXML.pm: s/id/ID/ for attribute name.
690     The |node| arguments are added for |ontriple| calls.
691     Too many "attribute not allowed" errors were raised.
692    
693     * ContentChecker.pm: Initial experimental support for rdf:RDF
694     element.
695    
696     2008-03-21 Wakaba <wakaba@suika.fam.cx>
697    
698 wakaba 1.197 * RDFXML.pm: New module.
699    
700 wakaba 1.192 2008-03-20 Wakaba <wakaba@suika.fam.cx>
701    
702 wakaba 1.196 * HTML.pm.src (set_inner_html): Line/column number
703     code was old one yet.
704    
705     2008-03-20 Wakaba <wakaba@suika.fam.cx>
706    
707 wakaba 1.195 * HTML.pm.src: Better line/column reporting for "duplicate attribute"
708     errors. Line/column markings for DOCTYPE, comment, and
709     character tokens are reintroduced; otherwise,
710     error location for "not HTML5" error and errors
711     for implied elements are not attached.
712    
713     2008-03-20 Wakaba <wakaba@suika.fam.cx>
714    
715 wakaba 1.194 * HTML.pm.src: Set line/column numbers to attributes.
716    
717     * NanoDOM.pm (create_attribute_ns, set_attribute_node_ns): Added.
718     (value): Setter implemented.
719    
720     * mkhtmlparser.pl: Set line/column numbers to Attr nodes.
721    
722     2008-03-20 Wakaba <wakaba@suika.fam.cx>
723    
724 wakaba 1.193 * HTML.pm.src: Unused line/column markings are removed.
725    
726     2008-03-20 Wakaba <wakaba@suika.fam.cx>
727    
728 wakaba 1.192 * HTML.pm.src (_get_next_token): Remove |first_start_tag|
729     flag, which is no longer used.
730    
731 wakaba 1.190 2008-03-17 Wakaba <wakaba@suika.fam.cx>
732    
733 wakaba 1.191 * HTML.pm.src: Set line/column information to element nodes.
734    
735     * mkhtmlparser.pl (!!!create-element, !!!insert-element,
736     and !!!insert-element-t): Set line/column information to
737     element nodes.
738    
739     2008-03-17 Wakaba <wakaba@suika.fam.cx>
740    
741 wakaba 1.190 * HTML.pm.src (_get_next_token): The first "<" character
742     in "<?", "<>", or "</>" should be the error point.
743    
744 wakaba 1.187 2008-03-16 Wakaba <wakaba@suika.fam.cx>
745    
746 wakaba 1.189 * HTML.pm.src: Some more fixes on error position reporting.
747    
748     2008-03-16 Wakaba <wakaba@suika.fam.cx>
749    
750 wakaba 1.188 * HTML.pm.src: Token-level precious error reporting.
751    
752     2008-03-16 Wakaba <wakaba@suika.fam.cx>
753    
754 wakaba 1.187 * HTML.pm.src: Preparation for more precious error point
755     reporting.
756    
757 wakaba 1.184 2008-03-11 Wakaba <wakaba@suika.fam.cx>
758    
759 wakaba 1.186 * HTML.pm.src: Error type revised.
760    
761     2008-03-11 Wakaba <wakaba@suika.fam.cx>
762    
763 wakaba 1.185 * HTML.pm.src: Similar codes are merged together, again.
764    
765     2008-03-11 Wakaba <wakaba@suika.fam.cx>
766    
767 wakaba 1.184 * HTML.pm.src: Similar codes are merged together.
768    
769 wakaba 1.183 2008-03-10 Wakaba <wakaba@suika.fam.cx>
770    
771     * mkhtmlparser.pl: Set "level" parameter to parse errors.
772    
773     * HTML.pm.src: Code refined.
774    
775 wakaba 1.175 2008-03-09 Wakaba <wakaba@suika.fam.cx>
776    
777 wakaba 1.182 * HTML.pm.src: |</body>| treatement has been changed (HTML5
778     revision 1348). Note that I really don't know this makes
779     any difference in the black-box behavior of the parser.
780    
781     2008-03-09 Wakaba <wakaba@suika.fam.cx>
782    
783 wakaba 1.181 * HTML.pm.src: New end-of-file token implementation (HTML5
784     revision 1348).
785    
786     2008-03-09 Wakaba <wakaba@suika.fam.cx>
787    
788 wakaba 1.180 * HTML.pm.src: |applet| support (HTML5 revision 1347).
789    
790     2008-03-09 Wakaba <wakaba@suika.fam.cx>
791    
792 wakaba 1.179 * HTML.pm.src: Foster parenting in AAA (HTML5 revision 1343).
793    
794     2008-03-09 Wakaba <wakaba@suika.fam.cx>
795    
796 wakaba 1.178 * HTML.pm.src: Support for |<input>| in the "in select" insertion
797     mode and support for the "in select in table" insertion mode (HTML5
798     revision 1342).
799    
800     2008-03-09 Wakaba <wakaba@suika.fam.cx>
801    
802 wakaba 1.177 * HTML.pm.src: No foster parenting for <script> and <script>
803     in non-tainted <table>s (HTML5 revision 1336).
804    
805     2008-03-09 Wakaba <wakaba@suika.fam.cx>
806    
807 wakaba 1.176 * HTML.pm.src: Ignore white space characters between <html>
808     and <head> (HTML5 revision 1332).
809    
810     2008-03-09 Wakaba <wakaba@suika.fam.cx>
811    
812 wakaba 1.175 * HTML.pm.src: Treat <input type=hidden> as if it were a
813     white space (HTML5 revision 1331).
814    
815 wakaba 1.164 2008-03-08 Wakaba <wakaba@suika.fam.cx>
816    
817 wakaba 1.174 * HTML.pm.src: Ignore U+000A at the beginning of a |listing|
818     element (HTML5 revision 1330).
819    
820     2008-03-08 Wakaba <wakaba@suika.fam.cx>
821    
822 wakaba 1.173 * HTML.pm.src: <title> is always appended to the current
823     element (HTML5 revision 1328).
824    
825     2008-03-08 Wakaba <wakaba@suika.fam.cx>
826    
827 wakaba 1.172 * HTML.pm.src: White space in tainted tables are moved
828     into foster parents (HTML5 revision 1326).
829    
830     2008-03-08 Wakaba <wakaba@suika.fam.cx>
831    
832 wakaba 1.171 * HTML.pm.src: Reduce errors from foster parenting cases (HTML5
833     revision 1321).
834    
835     2008-03-08 Wakaba <wakaba@suika.fam.cx>
836    
837 wakaba 1.170 * HTML.pm.src: |</h/n/>| case code rearranged to align with
838     the spec (HTML5 revision 1320). Note that we finally complete
839     all of HTML5 revision 1320 changes.
840    
841     2008-03-08 Wakaba <wakaba@suika.fam.cx>
842    
843 wakaba 1.169 * HTML.pm.src: |</form>| now works similar to |</div>| for unclosed
844     tags (HTML5 revision 1320).
845    
846     2008-03-08 Wakaba <wakaba@suika.fam.cx>
847    
848 wakaba 1.168 * HTML.pm.src: |</p>| case rearranged with no actual change in fact.
849    
850     2008-03-08 Wakaba <wakaba@suika.fam.cx>
851    
852 wakaba 1.167 * HTML.pm.src: A "generate implied end tags" code (t409.1)
853     could not be reached so that it is now removed (HTML5 revision
854     1320).
855    
856     2008-03-08 Wakaba <wakaba@suika.fam.cx>
857    
858 wakaba 1.166 * HTML.pm.src: Code for the case of |</div>| and so on are revised
859     to align with new spec text (HTML5 revision 1320).
860    
861     2008-03-08 Wakaba <wakaba@suika.fam.cx>
862    
863 wakaba 1.165 * HTML.pm.src: Remove strange |if| condition; however, it should
864     have had no harm in theory.
865    
866     2008-03-08 Wakaba <wakaba@suika.fam.cx>
867    
868 wakaba 1.164 * HTML.pm.src (_tree_construction_main): '</p>' in body
869     case is split from other end tags for the preparation
870     of implementing HTML revision 1320.
871    
872 wakaba 1.162 2008-03-07 Wakaba <wakaba@suika.fam.cx>
873    
874 wakaba 1.163 * HTML.pm.src: Simplified "generate implied end tag" (HTML5
875     revision 1320).
876    
877     2008-03-07 Wakaba <wakaba@suika.fam.cx>
878    
879 wakaba 1.162 * HTML.pm.src (_tree_construction_main): Merge rules for "h1"
880     and "div" (HTML5 revision 1318). Add comments to where
881     |form| pointer association codes should be inserted (HTML5 revision
882     1319).
883    
884 wakaba 1.161 2008-03-06 Wakaba <wakaba@suika.fam.cx>
885    
886     * HTML.pm.src: <html> treatement refined (HTML5 revision 1314).
887    
888 wakaba 1.159 2008-03-05 Wakaba <wakaba@suika.fam.cx>
889    
890 wakaba 1.160 * HTML.pm.src: Since the case t268 should never be reached (no
891     other token type, there are), it is replaced by a |die| statement.
892    
893     2008-03-05 Wakaba <wakaba@suika.fam.cx>
894    
895 wakaba 1.159 * HTML.pm.src: Typo fixed.
896    
897 wakaba 1.158 2008-03-04 Wakaba <wakaba@suika.fam.cx>
898    
899     * HTML.pm.src (_tree_construction_initial): Some limited quirks
900     doctypes were not uppercased for comparison.
901    
902 wakaba 1.152 2008-03-03 Wakaba <wakaba@suika.fam.cx>
903    
904 wakaba 1.157 * HTML.pm.src (tree construction and set_inner_html): Checkpoints
905     are added.
906    
907     2008-03-03 Wakaba <wakaba@suika.fam.cx>
908    
909 wakaba 1.156 * HTML.pm.src (_tokenize_attempt_to_consume_an_entity): Checkpoints
910     are set. Cases that are unlikely reached are noted as so.
911    
912     2008-03-03 Wakaba <wakaba@suika.fam.cx>
913    
914 wakaba 1.155 * HTML.pm.src: Checkpoints for debugging are added.
915    
916     * mkhtmlparser.pl: Support for |!!!cp| syntax.
917    
918     2008-03-03 Wakaba <wakaba@suika.fam.cx>
919    
920 wakaba 1.154 * HTML.pm.src, mkhtmlparser.pl: s/_input_character/_char/g
921     for simplicity.
922    
923     2008-03-03 Wakaba <wakaba@suika.fam.cx>
924    
925 wakaba 1.153 * HTML.pm.src: Flag name changed: s/correct/force-quirks/g (HTML5
926     revision 1307).
927    
928     2008-03-03 Wakaba <wakaba@suika.fam.cx>
929    
930 wakaba 1.152 * HTML.pm.src (_get_next_token): Where the /incorrect/ flag
931     is set are changed (HTML5 revision 1305).
932    
933 wakaba 1.149 2008-03-02 Wakaba <wakaba@suika.fam.cx>
934    
935 wakaba 1.151 * HTML.pm.src: Raise a parse error for |<span ===>| (HTML5 revision
936     1292). Entities are not parsed in comment-like part in RCDATA
937     elements (HTML5 revision 1294). Allow bare & at the end
938     of attribute value literals (HTML5 revision 1296). More
939     quirks mode doctypes (HTML5 revision 1302). Requires spaces
940     between attributes and ban attribute names or unquoted
941     attribute values containing single or double quotes (HTML5
942     revision 1303).
943    
944     2008-03-02 Wakaba <wakaba@suika.fam.cx>
945    
946 wakaba 1.150 * ContentChecker.pm: Typo fixed. Don't raise "character encoding"
947     and related errors unless it is an HTML document (though the spec
948     is unclear on whether it is applied to XHTML document).
949    
950     * HTML.pm (%HTMLAttrStatus): WF2 repetition model attributes
951     are added.
952    
953     2008-03-02 Wakaba <wakaba@suika.fam.cx>
954    
955 wakaba 1.149 * HTML.pm.src: s/local_name/manakai_local_name/g.
956    
957 wakaba 1.148 2008-03-01 Wakaba <wakaba@suika.fam.cx>
958    
959     * _NamedEntityList.pm: Updated (HTML5 revision 1286).
960    
961     * HTML.pm.src: |charset| in |content| attribute is
962     case-insensitive (HTML5 revision 1270).
963    
964 wakaba 1.147 2008-02-26 Wakaba <wakaba@suika.fam.cx>
965    
966     * ContentChecker.pm: New status constants are added.
967     ($ElementDefault): |status| added.
968     (check_element): Err for non-standard or deprecated elements.
969     (_attr_status_info): For non-standard or deprecated attributes.
970    
971 wakaba 1.142 2008-02-24 Wakaba <wakaba@suika.fam.cx>
972    
973 wakaba 1.146 * ContentChecker.pm (_attr_status_info): New internal method.
974    
975     2008-02-24 Wakaba <wakaba@suika.fam.cx>
976    
977 wakaba 1.145 * ContentChecker.pm (check_element): Element standardized
978     status information is now dispatched.
979    
980     2008-02-24 Wakaba <wakaba@suika.fam.cx>
981    
982 wakaba 1.144 * ContentChecker.pm (check_element): Fix |del|-and-significant
983     problem by adding some more arguments.
984    
985     2008-02-24 Wakaba <wakaba@suika.fam.cx>
986    
987 wakaba 1.143 * ContentChecker.pm (check_element): Use context of
988     container-for-the-purpose-of-content-model element (not transparent
989     element) for |check_child_element| calling and significant
990     text flag marking. This reintroduces |<del>|-and-significant
991     problem again.
992    
993     2008-02-24 Wakaba <wakaba@suika.fam.cx>
994    
995 wakaba 1.142 * ContentChecker.pm (check_element): Make semi-transparent
996     elements ignored for the purpose of phase changes in
997     content model checking.
998    
999 wakaba 1.138 2008-02-23 Wakaba <wakaba@suika.fam.cx>
1000    
1001 wakaba 1.141 * ContentChecker.pm (check_element): In-element state
1002     was not properly managed for transparent cases.
1003    
1004     2008-02-23 Wakaba <wakaba@suika.fam.cx>
1005    
1006 wakaba 1.140 * ContentChecker.pm (check_element): Support for |video|
1007     and |audio| as semi-transparent elements.
1008    
1009     2008-02-23 Wakaba <wakaba@suika.fam.cx>
1010    
1011 wakaba 1.139 * ContentChecker.pm ($HTMLSemiTransparentElements): New.
1012     (check_element): s/minuses/minus_elements/, s/pluses/plus_elements/.
1013     Support for |html:object| as a semi-transparent element.
1014    
1015     2008-02-23 Wakaba <wakaba@suika.fam.cx>
1016    
1017 wakaba 1.138 * ContentChecker.pm (check_element): The way to traverse
1018     the tree is entirely revised to make it easier to track
1019     the state of ancestors/descendants. As a result of this
1020     revision (which rewrites almost all of Whatpm::ContentChecker::HTML),
1021     support for content model checking for HTML elements |figure|,
1022     |object|, |video|, and |audio| and checking for XML elements (and
1023     some XMLNS checkings) are dropped for now. They will be
1024     reimplemented in due cource.
1025    
1026 wakaba 1.134 2008-02-17 Wakaba <wakaba@suika.fam.cx>
1027    
1028 wakaba 1.137 * HTML.pm.src: |>| in public or system literal closes the DOCTYPE
1029     token (HTML5 revision 1225).
1030    
1031     2008-02-17 Wakaba <wakaba@suika.fam.cx>
1032    
1033 wakaba 1.136 * ContentChecker.pm ({unsupported_level}): New value.
1034    
1035     * HTML.pm.src: Save whether |meta| |content| attribute
1036     contains character references or not.
1037    
1038     2008-02-17 Wakaba <wakaba@suika.fam.cx>
1039    
1040 wakaba 1.135 * ContentChecker.pm (_get_children): (Incompleted) attempt to
1041     imlement significant content checking for contents
1042     with |del| elements.
1043    
1044     2008-02-17 Wakaba <wakaba@suika.fam.cx>
1045    
1046 wakaba 1.134 * ContenteChecker.pm ($HTMLTransparentElements): More
1047     elements are added.
1048     (_get_children): HTML |object| elements are now semi-transparent.
1049    
1050     * NanoDOM.pm (manakai_html, manakai_head): New methods.
1051    
1052 wakaba 1.133 2008-02-16 Wakaba <wakaba@suika.fam.cx>
1053    
1054     * CacheManifest.pm: HTML5 revision 1211 implemented.
1055    
1056     * CacheManifest.pod: Updated.
1057    
1058 wakaba 1.132 2008-02-10 Wakaba <wakaba@suika.fam.cx>
1059    
1060     * ContentChecker.pm (check_document, check_element): Support
1061     for second argument ($onsubdoc).
1062     (_get_css_parser): Removed (now it is part of WDCC).
1063    
1064 wakaba 1.131 2008-02-09 Wakaba <wakaba@suika.fam.cx>
1065    
1066     * ContentChecker.pm (_get_css_parser): New.
1067    
1068 wakaba 1.129 2007-11-25 Wakaba <wakaba@suika.fam.cx>
1069    
1070 wakaba 1.130 * ContentChecker.pm ($AnyChecker): Old way to add child elements
1071     for checking had been used.
1072    
1073     2007-11-25 Wakaba <wakaba@suika.fam.cx>
1074    
1075 wakaba 1.129 * ContentChecker.pm (check_element): New todo item type |descendant|.
1076    
1077 wakaba 1.125 2007-11-23 Wakaba <wakaba@suika.fam.cx>
1078    
1079 wakaba 1.128 * IMTChecker.pm: Revised to raise errors and warnings as (poorly)
1080     specced in RFC 2046 and RFC 4288.
1081     (application/atom+xml): Definition added.
1082    
1083     2007-11-23 Wakaba <wakaba@suika.fam.cx>
1084    
1085 wakaba 1.127 * URIChecker.pm: Make RFC 3986 should-level errors
1086     warnings (rather than SHOULD-level errors).
1087    
1088     2007-11-23 Wakaba <wakaba@suika.fam.cx>
1089    
1090 wakaba 1.126 * NanoDOM.pm (get_user_data, set_user_data): New methods.
1091    
1092     * HTML.pm.src: A flag for character references in attribute
1093     values are added. Set |manakai_has_reference| user data
1094     to |charset| attribute.
1095    
1096     2007-11-23 Wakaba <wakaba@suika.fam.cx>
1097    
1098 wakaba 1.125 * NanoDOM.pm (input_encoding, manakai_charset, manakai_has_bom): New
1099     attributes.
1100    
1101     * ContentChecker.pm (check_document): Warn if charset requirements
1102     cannot be tested.
1103    
1104 wakaba 1.124 2007-11-19 Wakaba <wakaba@suika.fam.cx>
1105    
1106     * HTML.pm.src (parse_byte_string): Detect charset
1107     by universalchardet if charset parameter is not specified.
1108    
1109     * Makefile (Charset-all, Charset-clean): New rules.
1110    
1111 wakaba 1.120 2007-11-18 Wakaba <wakaba@suika.fam.cx>
1112    
1113 wakaba 1.123 * ContentChecker.pm (check_document): Check the existence
1114     of character encoding declaration and selection of encoding
1115     for HTML document.
1116    
1117     2007-11-18 Wakaba <wakaba@suika.fam.cx>
1118    
1119 wakaba 1.122 * ContentType.pm (get_sniffed_type): Return also the official
1120     type in list context.
1121    
1122     2007-11-18 Wakaba <wakaba@suika.fam.cx>
1123    
1124 wakaba 1.121 * ContentType.pm: Sniffing with leading white space
1125     ignoring (HTML5 revisions 1013 and 1016).
1126    
1127     2007-11-18 Wakaba <wakaba@suika.fam.cx>
1128    
1129 wakaba 1.120 * ContentType.pm: HTML5 revision 1013 changes, except for leading
1130     white spaces, are implemented.
1131    
1132 wakaba 1.117 2007-11-11 Wakaba <wakaba@suika.fam.cx>
1133    
1134 wakaba 1.119 * HTML.pm.src (parse_char_string): Set |inner_encoding|
1135     attribute if possible.
1136    
1137     2007-11-11 Wakaba <wakaba@suika.fam.cx>
1138    
1139 wakaba 1.118 * HTML.pm.src (parse_byte_string): New method.
1140     (parse_char_string): New alias for |parse_string|.
1141     (main phase): Invoking "change the encoding" algorithm if desired.
1142    
1143     * HTML.pod: Updated.
1144    
1145     2007-11-11 Wakaba <wakaba@suika.fam.cx>
1146    
1147 wakaba 1.117 * HTML.pod (get_inner_html): Removed.
1148    
1149     * Makefile (HTML-all, HTML-clean): New.
1150    
1151     2007-11-11 Wakaba <wakaba@suika.fam.cx>
1152    
1153     * HTML.pm.src (get_inner_html): Removed (moved to HTML/Serializer.pm).
1154    
1155 wakaba 1.116 2007-11-08 Wakaba <wakaba@suika.fam.cx>
1156    
1157     * mklinktypelist.pl: s/noreferer/noreferrer/ (HTML5 revision 1132).
1158    
1159 wakaba 1.112 2007-11-04 Wakaba <wakaba@suika.fam.cx>
1160    
1161 wakaba 1.115 * Makefile: |CacheManifest.html| is added.
1162    
1163     * CacheManifest.pod: New file.
1164    
1165     2007-11-04 Wakaba <wakaba@suika.fam.cx>
1166    
1167 wakaba 1.114 * CacheManifest.pm: New module.
1168    
1169     2007-11-04 Wakaba <wakaba@suika.fam.cx>
1170    
1171 wakaba 1.113 * HTML.pm.src: Support for application cache selection algorithm
1172     callback.
1173    
1174     2007-11-04 Wakaba <wakaba@suika.fam.cx>
1175    
1176 wakaba 1.112 * mklinktypelist.pl: Support for rel=noreferer (HTML5 revision 1118).
1177    
1178 wakaba 1.110 2007-10-17 Wakaba <wakaba@suika.fam.cx>
1179    
1180 wakaba 1.111 * Makefile (clean): New rule.
1181    
1182     * NanoDOM.pm (public_id, system_id): New attributes.
1183    
1184     2007-10-17 Wakaba <wakaba@suika.fam.cx>
1185    
1186 wakaba 1.110 * Makefile (CSS-all, CSS-clean, clean): New rules.
1187    
1188 wakaba 1.109 2007-10-14 Wakaba <wakaba@suika.fam.cx>
1189    
1190     * ContentChecker.pm (check_document): Support for
1191     new |is_xml_root| flag.
1192     (check_element): Support for new |pluses| state.
1193     (_add_pluses): New method.
1194     (_remove_minuses): Support for new |minus| item.
1195    
1196 wakaba 1.106 2007-09-24 Wakaba <wakaba@suika.fam.cx>
1197    
1198 wakaba 1.107 * ContentChecker.pm: Raise specific error for invalid
1199     root element.
1200    
1201     2007-09-24 Wakaba <wakaba@suika.fam.cx>
1202    
1203 wakaba 1.106 * ContentChecker.pm: Set level values for later uses.
1204    
1205 wakaba 1.103 2007-09-09 Wakaba <wakaba@suika.fam.cx>
1206    
1207 wakaba 1.105 * ContentChecker.pm: Support for language tag validation.
1208    
1209     2007-09-09 Wakaba <wakaba@suika.fam.cx>
1210    
1211 wakaba 1.104 * LangTag.pm (check_rfc3066_language_tag): New method.
1212    
1213     2007-09-09 Wakaba <wakaba@suika.fam.cx>
1214    
1215 wakaba 1.103 * LangTag.pm: New module.
1216    
1217 wakaba 1.102 2007-09-04 Wakaba <wakaba@suika.fam.cx>
1218    
1219     * HTML.pm.src: Some error types were wrong.
1220    
1221 wakaba 1.100 2007-08-17 Wakaba <wakaba@suika.fam.cx>
1222    
1223 wakaba 1.101 * CSS/: New directory.
1224    
1225     2007-08-17 Wakaba <wakaba@suika.fam.cx>
1226    
1227 wakaba 1.100 * ContentChecker.pm (_check_get_children): Support
1228     for |noscript| in |head|.
1229    
1230 wakaba 1.99 2007-08-12 Wakaba <wakaba@suika.fam.cx>
1231    
1232     * URI/: New directory.
1233    
1234 wakaba 1.94 2007-08-11 Wakaba <wakaba@suika.fam.cx>
1235    
1236 wakaba 1.98 * HTML.pm.src: Tokenizer's states are now represented in
1237     number.
1238    
1239     2007-08-11 Wakaba <wakaba@suika.fam.cx>
1240    
1241 wakaba 1.97 * HTML.pm.src: |or|s for insertion modes are replaced
1242     by |&|s.
1243    
1244     2007-08-11 Wakaba <wakaba@suika.fam.cx>
1245    
1246 wakaba 1.96 * HTML.pm.src: Token types are now represented in number.
1247    
1248     2007-08-11 Wakaba <wakaba@suika.fam.cx>
1249    
1250 wakaba 1.95 * ContentType.pm (SEE ALSO): Updated.
1251    
1252     * HTML.pm.src: Insertion modes are now represented in number.
1253    
1254     2007-08-11 Wakaba <wakaba@suika.fam.cx>
1255    
1256 wakaba 1.94 * ContentType.pm: Sniffing for bitmap images (HTML5 revision
1257     999) is implemented.
1258    
1259 wakaba 1.93 2007-08-08 Wakaba <wakaba@suika.fam.cx>
1260    
1261     * ContentType.pm: Sniffing for |<script| (HTML5 revision
1262     983) is implemented.
1263    
1264 wakaba 1.92 2007-08-06 Wakaba <wakaba@suika.fam.cx>
1265    
1266     * ContentChecker.pod: New documentation.
1267    
1268     * Makefile: A rule for |ContentChecker.html| is added.
1269    
1270     * ContentChecker.pm: A pod "LICENSE" section is added.
1271    
1272     * NanoDOM.pm ($VERSION): New variable.
1273    
1274 wakaba 1.90 2007-08-05 Wakaba <wakaba@suika.fam.cx>
1275    
1276 wakaba 1.91 * H2H.pm: |b|, |i|, and |sub| are added to the
1277     list of allowed HTML elements.
1278    
1279     2007-08-05 Wakaba <wakaba@suika.fam.cx>
1280    
1281 wakaba 1.90 * H2H.pm: |samp| is added to the list of allowed
1282     HTML elements.
1283    
1284     * URIChecker.pm (check_iri): New.
1285     (check_iri_reference): Error type for IRI reference
1286     syntax error is changed.
1287    
1288 wakaba 1.87 2007-08-04 Wakaba <wakaba@suika.fam.cx>
1289    
1290 wakaba 1.89 * ContentChecker.pm: Reference to the |Whatpm::ContentChecker::Atom|
1291     is added.
1292     (check_document): Load appropriate module before validation.
1293    
1294     2007-08-04 Wakaba <wakaba@suika.fam.cx>
1295    
1296 wakaba 1.88 * ContentChecker/: New directory.
1297    
1298     2007-08-04 Wakaba <wakaba@suika.fam.cx>
1299    
1300 wakaba 1.87 * ContentChecker.pm: HTML |time| element is implemented.
1301    
1302     * HTMLTable.pm: Comments are updated as HTML5 is revised.
1303    
1304 wakaba 1.71 2007-07-21 Wakaba <wakaba@suika.fam.cx>
1305    
1306 wakaba 1.86 * ContentChecker.pm (check_document): Return value
1307     even if no document element is found.
1308    
1309     2007-07-21 Wakaba <wakaba@suika.fam.cx>
1310    
1311 wakaba 1.85 * HTML.pm.src: |$in_body| is no longer a function.
1312    
1313     2007-07-21 Wakaba <wakaba@suika.fam.cx>
1314    
1315 wakaba 1.84 * HTML.pm.src: The |$in_body| code has been moved down.
1316    
1317     2007-07-21 Wakaba <wakaba@suika.fam.cx>
1318    
1319 wakaba 1.83 * HTML.pm.src: The "trailing end" insertion mode
1320     is split into "after html body" and "after html frameset"
1321     insertion modes. Their codes are merged with "after body"
1322     and "after frameset" codes. |$previous_insertion_mode|
1323     has been removed. "after frameset" code is
1324     merged with "in frameset" code.
1325    
1326     2007-07-21 Wakaba <wakaba@suika.fam.cx>
1327    
1328 wakaba 1.82 * HTML.pm.src: The "before head" insertion mode is
1329     merged with the "in head" insertion mode.
1330    
1331     2007-07-21 Wakaba <wakaba@suika.fam.cx>
1332    
1333 wakaba 1.81 * HTML.pm.src: Cases in "in head" insertion mode are
1334     reorganized.
1335    
1336     2007-07-21 Wakaba <wakaba@suika.fam.cx>
1337    
1338 wakaba 1.80 * HTML.pm.src: Some cases in "in table" insertion mode
1339     are merged.
1340    
1341     2007-07-21 Wakaba <wakaba@suika.fam.cx>
1342    
1343 wakaba 1.79 * HTML.pm.src: The "in row" insertion mode is merged
1344     with "in table" insertion mode.
1345    
1346     2007-07-21 Wakaba <wakaba@suika.fam.cx>
1347    
1348 wakaba 1.78 * HTML.pm.src: The "in table" and "in table body" insertion
1349     modes are merged.
1350    
1351     2007-07-21 Wakaba <wakaba@suika.fam.cx>
1352    
1353 wakaba 1.77 * HTML.pm.src: There is no "in table head" or "in table foot"
1354     insertion mode!
1355    
1356     2007-07-21 Wakaba <wakaba@suika.fam.cx>
1357    
1358 wakaba 1.76 * HTML.pm.src: |<noframes>| "in frameset" and "in noframes"
1359     now directly invoke the handler.
1360    
1361     2007-07-21 Wakaba <wakaba@suika.fam.cx>
1362    
1363 wakaba 1.75 * HTML.pm.src: Codes for "in cell" insertion mode
1364     is merged to the "in body" insertion mode code.
1365    
1366     2007-07-21 Wakaba <wakaba@suika.fam.cx>
1367    
1368 wakaba 1.74 * HTML.pm.src: Codes for "in body" and "in caption"
1369     insertion modes are merged.
1370    
1371     2007-07-21 Wakaba <wakaba@suika.fam.cx>
1372    
1373 wakaba 1.73 * HTML.pm.src: Two |!!!next-token|s were missing.
1374    
1375     2007-07-21 Wakaba <wakaba@suika.fam.cx>
1376    
1377 wakaba 1.72 * HTML.pm.src: Use numeric constant for |{content_mode}|
1378     instead of string constant for |{content_model_flag}|.
1379    
1380     2007-07-21 Wakaba <wakaba@suika.fam.cx>
1381    
1382 wakaba 1.71 * HTML.pm.src: Add the name of the attribute
1383     to the "duplicate attribute" error.
1384    
1385 wakaba 1.69 2007-07-17 Wakaba <wakaba@suika.fam.cx>
1386    
1387 wakaba 1.70 * ContentChecker.pm: Return the |class| node list.
1388    
1389     2007-07-17 Wakaba <wakaba@suika.fam.cx>
1390    
1391 wakaba 1.69 * ContentChecker.pm: Return the |id| node list.
1392    
1393     * HTML.pm.src: A typo is fixed.
1394    
1395 wakaba 1.63 2007-07-16 Wakaba <wakaba@suika.fam.cx>
1396    
1397 wakaba 1.68 * ContentChecker.pm: Drop wrong |level => 'error'| specification
1398     from "in HTML:xml:lang" error. Character position
1399     is now the last part of the error type in the URI error
1400     description. Report "unsupported" status for language
1401     tags, media queries, script codes, and style sheets.
1402    
1403     2007-07-16 Wakaba <wakaba@suika.fam.cx>
1404    
1405 wakaba 1.67 * ContentChecker.pm: Report error if |xml:lang|
1406     in HTML, |lang| in XML, |xmlns| in XML, and |meta| |charset|
1407     in XML.
1408    
1409     * NanoDOM.pm (Attr.owner_document): New attribute.
1410    
1411     2007-07-16 Wakaba <wakaba@suika.fam.cx>
1412    
1413 wakaba 1.66 * HTML.pm.src: The character immediately following
1414     a bare |hcro| was discarded. Fix handling of
1415     entity references in attribute values.
1416    
1417     2007-07-16 Wakaba <wakaba@suika.fam.cx>
1418    
1419 wakaba 1.65 * HTML.pm.src (main and trailin end phases): Token
1420     types |DOCTYPE|, |comment|, |end-of-file|, and |<html>| are
1421     factored out. Error types |in html:#DOCTYPE|
1422     and |after html:#DOCTYPE| are merged
1423     into |DOCTYPE in the middle|. |</frameset>|
1424     in fragment parsing mode changed the
1425     insertion mode.
1426    
1427     2007-07-16 Wakaba <wakaba@suika.fam.cx>
1428    
1429 wakaba 1.64 * HTML.pm.src: |$phase| has been removed; The |trailing end|
1430     phase is now an insertion mode. Treatments for white
1431     space character tokens were incorrect for some
1432     insertion modes. An old |meta| case was not removed.
1433    
1434     2007-07-16 Wakaba <wakaba@suika.fam.cx>
1435    
1436 wakaba 1.63 * HTML.pm.src: |meta| charset declaration extraction
1437     implemented (but changing the encoding is not yet:-).
1438    
1439 wakaba 1.60 2007-07-15 Wakaba <wakaba@suika.fam.cx>
1440    
1441 wakaba 1.62 * Charset/: New directory.
1442    
1443     2007-07-15 Wakaba <wakaba@suika.fam.cx>
1444    
1445 wakaba 1.61 * H2H.pm: New Perl module (created from
1446     manakai's H2H.dis).
1447    
1448     2007-07-15 Wakaba <wakaba@suika.fam.cx>
1449    
1450 wakaba 1.60 * XMLSerializer.pm: New Perl module (created from
1451     manakai's SimpleLS.dis).
1452    
1453 wakaba 1.59 2007-07-07 Wakaba <wakaba@suika.fam.cx>
1454    
1455     * HTML.pm.src: |<!---x-->| was not processed correctly.
1456    
1457 wakaba 1.57 2007-07-01 Wakaba <wakaba@suika.fam.cx>
1458    
1459 wakaba 1.58 * HTML.pm.src: Report correct error message
1460     for |<body></div></body>|.
1461    
1462     2007-07-01 Wakaba <wakaba@suika.fam.cx>
1463    
1464 wakaba 1.57 * HTMLTable.pm: An error description was incorrect.
1465    
1466     2007-06-30 Wakaba <wakaba@suika.fam.cx>
1467    
1468     * ContentChecker.pm: Return |{term}| list.
1469    
1470 wakaba 1.55 2007-06-30 Wakaba <wakaba@suika.fam.cx>
1471    
1472 wakaba 1.56 * HTML.pm.src: HTML5 revisions 961-966 (</p>, </br>,
1473     nested <nobr>, implied </tbody>, </tfoot>, and </thead>,
1474     and <title> outside of head).
1475    
1476     2007-06-30 Wakaba <wakaba@suika.fam.cx>
1477    
1478 wakaba 1.55 * IMTChecker.pm: Report warning for unregistered
1479     and private types/subtypes.
1480    
1481     * ContentChecker.pm, HTML.pm.src, IMTChecker.pm,
1482     URIChecker.pm, HTMLTable.pm: Error messages are now
1483     consistent; they are all listed in
1484     <http://suika.fam.cx/gate/2005/sw/Whatpm%20Error%20Types>.
1485    
1486 wakaba 1.52 2007-06-25 Wakaba <wakaba@suika.fam.cx>
1487    
1488 wakaba 1.54 * ContentChecker.pm: |<img ismap>| not in |<a></a>|
1489     is now erred. |<datalist>| is implemented.
1490     Attribute checker for |<command>| and |<menu>| are
1491     added. Support for |contextmenu| global attribute
1492     is added.
1493    
1494     2007-06-25 Wakaba <wakaba@suika.fam.cx>
1495    
1496 wakaba 1.53 * HTML.pm.src (_reset_insertion_mode): Interpretation
1497     of Step 3 has been changed.
1498    
1499     2007-06-25 Wakaba <wakaba@suika.fam.cx>
1500    
1501 wakaba 1.52 * HTML.pm.src: Late |<html>| parse error is implemented.
1502    
1503 wakaba 1.47 2007-06-24 Wakaba <wakaba@suika.fam.cx>
1504    
1505 wakaba 1.51 * URIChecker.pm (check_iri_reference): A |decode| method name was
1506     incorrect.
1507    
1508     * ContentChecker.pm: Support for the |footer| element.
1509     Check URI syntax for space-separated URI attributes.
1510     Support for the |tabindex| attribute. Support
1511     for |datetime| attribute.
1512    
1513     2007-06-24 Wakaba <wakaba@suika.fam.cx>
1514    
1515 wakaba 1.50 * HTML.pm.src: HTML5 revision 1.144 (&#x0D;) and 1.145 (invalid
1516     character references). HTML5 revision 1.146 (white space
1517     characters before root start tag). HTML5 revision
1518     1.148 (named character references in attribute values).
1519     HTML5 revision 1.152 (<plaintext>.innerHTML get).
1520    
1521     2007-06-24 Wakaba <wakaba@suika.fam.cx>
1522    
1523     * HTML.pm.src: HTML5 revisions 1.142 and 1.143 (<noscript>
1524 wakaba 1.49 in <head>).
1525    
1526     2007-06-24 Wakaba <wakaba@suika.fam.cx>
1527    
1528 wakaba 1.48 * HTML.pm.src: HTML5 revision 935 (<base>, <link>, <meta>
1529     in body).
1530    
1531     * ContentChecker.pm: HTML5 revision 938 (scoped="").
1532    
1533     2007-06-24 Wakaba <wakaba@suika.fam.cx>
1534    
1535 wakaba 1.47 * HTML.pm.src: HTML5 revision 923 (matching end tag in CDATA
1536     or RCDATA in fragment parsing mode). HTML5 revision
1537     924 (<!--> and <!--->). HTML5 revision 926 (hn in hn).
1538    
1539     2007-06-23 Wakaba <wakaba@suika.fam.cx>
1540    
1541     * HTML.pm.src (get_inner_html): HTML5 revision 922 (inner_html
1542     for <pre> and <textarea>).
1543    
1544 wakaba 1.33 2007-06-23 Wakaba <wakaba@suika.fam.cx>
1545    
1546 wakaba 1.46 * HTML.pm.src: HTML5 revision 920 (<isindex>).
1547    
1548     2007-06-23 Wakaba <wakaba@suika.fam.cx>
1549    
1550 wakaba 1.45 * HTML.pm.src: HTML5 revision 918 (</head>, </body>, </html>).
1551    
1552     2007-06-23 Wakaba <wakaba@suika.fam.cx>
1553    
1554 wakaba 1.44 * HTML.pm.src: HTML5 revision 916 (</body>).
1555     HTML5 revision 917 (conforming bare &).
1556    
1557     2007-06-23 Wakaba <wakaba@suika.fam.cx>
1558    
1559 wakaba 1.43 * NanoDOM.pm (manakai_is_html): Setting to false did not work.
1560    
1561     * HTML.pm.src: HTML5 revision 914 (</ in CDATA, RCDATA).
1562     HTML5 revision 915 (<nobr>).
1563    
1564     2007-06-23 Wakaba <wakaba@suika.fam.cx>
1565    
1566 wakaba 1.42 * HTML.pm.src: HTML5 revisions 908, 909, 912, and 913 (quirks mode).
1567    
1568     * NanoDOM.pm (manakai_is_html, manakai_compat_mode, compat_mode):
1569     New attributes.
1570    
1571     2007-06-23 Wakaba <wakaba@suika.fam.cx>
1572    
1573 wakaba 1.41 * HTML.pm.src: HTML5 revisions 900, 901, 902, and 911 (<
1574     in tags).
1575    
1576     2007-06-23 Wakaba <wakaba@suika.fam.cx>
1577    
1578 wakaba 1.40 * .cvsignore: |Entities.html| is added.
1579    
1580     * HTML.pm.src: |$entity_char| is removed and
1581     requires |Whatpm::_NamedEntityList| instead.
1582     HTML5 revision 898 (refc), except that lack of refc
1583     is parse error.
1584    
1585     * mkentitylist.pl: New script.
1586    
1587     * Makefile (all): |_NamedEntityList.pm| is added.
1588     (_NamedEntityList.pm, Entities.html): New rules.
1589    
1590     2007-06-23 Wakaba <wakaba@suika.fam.cx>
1591    
1592 wakaba 1.39 * HTML.pm.src: Parse errors immediately after U+000D
1593     were ignored and U+000D immediately following another
1594     U+000D was not converted to U+000A.
1595    
1596     2007-06-23 Wakaba <wakaba@suika.fam.cx>
1597    
1598 wakaba 1.38 * HTML.pm.src (set_inner_html): HTML5 revision 892 (adopt
1599     nodes before appended). Parser was not ready for NULL
1600     parse error and escape flag.
1601    
1602     * NanoDOM.pm (adopt_node): New.
1603    
1604     2007-06-23 Wakaba <wakaba@suika.fam.cx>
1605    
1606 wakaba 1.37 * HTML.pm.src: HTML5 revision 886 (insane comment in
1607     CDATA and RCDATA). Note that current implementation
1608     is simply repeating what the spec says and it is maybe not
1609     a best way to do it.
1610    
1611     2007-06-23 Wakaba <wakaba@suika.fam.cx>
1612    
1613 wakaba 1.36 * HTML.pm.src: HTML5 revision 884 (</form> don't close
1614     the form element if a descendant element without implied end tag has
1615     still been open).
1616    
1617     2007-06-23 Wakaba <wakaba@suika.fam.cx>
1618    
1619 wakaba 1.35 * ContentChecker.pm: HTML5 revision 881 (Make |id| attribute with
1620     space characters non-conforming).
1621    
1622     2007-06-23 Wakaba <wakaba@suika.fam.cx>
1623    
1624 wakaba 1.34 * HTML.pm.src: An error message was incorrect.
1625     HTML5 revision 869 (C1 character references).
1626    
1627     2007-06-23 Wakaba <wakaba@suika.fam.cx>
1628    
1629 wakaba 1.33 * HTML.pm.src: HTML5 revision 867 (a LF at the beginning of
1630     a |textarea| is removed).
1631    
1632 wakaba 1.32 2007-06-05 Wakaba <wakaba@suika.fam.cx>
1633    
1634     * NanoDOM.pm (get_attribute_node_ns): New method.
1635    
1636     * ContentChecker.pm: |script| |async| and |defer|
1637     no longer require |src|. |async| MUST NOT be
1638     specified if |defer|. (HTML5 revision 858).
1639    
1640 wakaba 1.31 2007-05-30 Wakaba <wakaba@suika.fam.cx>
1641    
1642     * HTML.pm.src: |<form><form>| went to inifinite loop.
1643    
1644 wakaba 1.27 2007-05-27 Wakaba <wakaba@suika.fam.cx>
1645    
1646 wakaba 1.30 * ContentChecker.pm (html): Set |is_root| (allowed
1647     as a document element) flag on.
1648     (new): Removed.
1649     (check_document): New method.
1650    
1651     2007-05-27 Wakaba <wakaba@suika.fam.cx>
1652    
1653 wakaba 1.29 * ContentChecker.pm (thead, tfoot): Checker specifications
1654     were incorrect.
1655    
1656     2007-05-27 Wakaba <wakaba@suika.fam.cx>
1657    
1658 wakaba 1.28 * ContentChecker.pm ($HTMLURIAttrChecker): Include
1659     error position in the |type| option of the error.
1660    
1661     * HTMLTable.pm (form_table): The |$onerror| parameter
1662     is now optional. Some bugs are fixed.
1663    
1664     2007-05-27 Wakaba <wakaba@suika.fam.cx>
1665    
1666 wakaba 1.27 * HTMLTable.pm: New module.
1667    
1668     * ContentChecker.pm (table): Invoke table model error checker.
1669    
1670     * NanoDOM.pm (first_child, get_attribute_ns): New.
1671    
1672 wakaba 1.25 2007-05-26 Wakaba <wakaba@suika.fam.cx>
1673    
1674 wakaba 1.26 * ContentChecker.pm ($HTMLLinkTypesAttrChecker): New checker.
1675     (link/@rel, a/@rel, area/@rel): Use new checker.
1676    
1677     * Makefile (_LinkTypeList.pm, RelExtensions.html): New rules.
1678    
1679     * _LinkTypeList.pm: New file.
1680    
1681     * mklinktypelist.pl: New file.
1682    
1683     * .cvsignore: |RelExtensions.html| added.
1684    
1685     * NanoDOM.pm (child_nodes): Returns an empty array
1686     for non-child-containing node types.
1687     (text_content): New attribute.
1688    
1689     2007-05-26 Wakaba <wakaba@suika.fam.cx>
1690    
1691 wakaba 1.25 * IMTChecker.pm: New module.
1692    
1693     * ContentChecker.pm ($HTMLIMTAttrChecker): Call IMTChecker
1694     to test parameter value validity.
1695    
1696     * HTML.pm.src ($style_start_tag): Attributes were
1697     discarded.
1698    
1699 wakaba 1.22 2007-05-25 Wakaba <wakaba@suika.fam.cx>
1700    
1701 wakaba 1.24 * ContentChecker.pm ($HTMLURIAttrChecker): Implemented.
1702    
1703     2007-05-25 Wakaba <wakaba@suika.fam.cx>
1704    
1705 wakaba 1.23 * URIChecker.pm: All recommendations from RFC 3986
1706     and RFC 3987 are listed (not all testable items are checked yet).
1707    
1708     2007-05-25 Wakaba <wakaba@suika.fam.cx>
1709    
1710 wakaba 1.22 * URIChecker.pm: New module.
1711    
1712 wakaba 1.18 2007-05-20 Wakaba <wakaba@suika.fam.cx>
1713    
1714 wakaba 1.21 * ContentChecker.pm: Now most attributes are implemented
1715     or associated to some placeholder.
1716     ($ElementDefault): Warn unknown attributes
1717     for unknown elements as "attribute not supported".
1718     ($HTMLLanguageTagAttrChecker, $HTMLMQAttrChecker): New placeholders.
1719     ($HTMLUsemapAttrChecker, $HTMLTargetAttrChecker): New checkers.
1720     (|a| attribute checker): Reimplemented.
1721    
1722     2007-05-20 Wakaba <wakaba@suika.fam.cx>
1723    
1724 wakaba 1.20 * ContentChecker.pm ($HTMLEventHandlerAttrChecker): New placeholder.
1725     ($HTMLAttrChecker): Event handler content attributes
1726     are added.
1727     (link, embed): Required attribute is now checked.
1728     (embed): Unknown local attributes are no longer warned.
1729    
1730     2007-05-20 Wakaba <wakaba@suika.fam.cx>
1731    
1732 wakaba 1.19 * ContentChecker.pm ($HTMLSpaceURIsAttrChecker): New placeholder.
1733     ($HTMLIMTAttrChecker): New checker.
1734     (link@rel, link@href, link@type, style@type,
1735     a@href, a@ping, a@ping, a@type, embed@src, embed@type,
1736     object@data, object@type, source@src, source@type, area@alt,
1737     area@shape, area@coords, area@href,
1738     area@ping, area@rel, area@type, script@src,
1739     script@defer, script@async, script@type): Checkers added.
1740    
1741     2007-05-20 Wakaba <wakaba@suika.fam.cx>
1742    
1743 wakaba 1.18 * ContentChecker.pm: Descendant checking was incorrect.
1744    
1745 wakaba 1.12 2007-05-19 Wakaba <wakaba@suika.fam.cx>
1746    
1747 wakaba 1.17 * ContentChecker.pm: Support |xml:*| and |xmlns:*|
1748     attributes. Report an error if |Element.prefix|
1749     is |xmlns|.
1750    
1751     * NanoDOM.pm (prefix): New attribute.
1752    
1753     2007-05-19 Wakaba <wakaba@suika.fam.cx>
1754    
1755 wakaba 1.16 * HTML.pm.src: In |main| phase, |in body| insertion
1756     mode, action for |<iframe>| was missing.
1757    
1758     2007-05-19 Wakaba <wakaba@suika.fam.cx>
1759    
1760 wakaba 1.15 * ContentChecker.pm: Support for many of HTML5 elements.
1761     ($GetHTMLNonNegativeIntegerAttrChecker): New.
1762    
1763     2007-05-19 Wakaba <wakaba@suika.fam.cx>
1764    
1765 wakaba 1.14 * ContentChecker.pm: Support for most elements up to |progress|.
1766     ($HTMLURIAttrChecker): Placeholder.
1767     ($HTMLIntegerAttrChecker, $GetHTMLFloatingPointNumberAttrChecker):
1768     New.
1769    
1770     2007-05-19 Wakaba <wakaba@suika.fam.cx>
1771    
1772 wakaba 1.13 * ContentChecker.pm: Attribute checkers
1773     for global attributes, |html|, |base|, |style|, and |meta|.
1774    
1775     * NanoDOM.pm (insert_before): Weaken reference
1776     to the parent node.
1777     (Attr::new): Set |owner_element| attribute.
1778     (namespace_uri, manakai_local_name): New attribute implementations.
1779     (owner_element): New attribute.
1780    
1781     2007-05-19 Wakaba <wakaba@suika.fam.cx>
1782    
1783 wakaba 1.12 * ContentChecker.pm ($AttrChecker, $HTMLAttrChecker,
1784     $AnyChecker->{attr_checker}, $HTMLAttrsChecker,
1785     $Element->{$HTML_NS}->{''}): New.
1786     (check_element): Invoke attrs_checker for each element.
1787    
1788 wakaba 1.6 2007-05-13 Wakaba <wakaba@suika.fam.cx>
1789    
1790 wakaba 1.11 * ContentChecker.pm: Don't use |manakai_element_type_match|.
1791    
1792     2007-05-13 Wakaba <wakaba@suika.fam.cx>
1793    
1794 wakaba 1.10 * ContentChecker.pm: Use hashs rather than lists for
1795     element type testings.
1796    
1797     2007-05-13 Wakaba <wakaba@suika.fam.cx>
1798    
1799 wakaba 1.9 * ContentChecker.pm: Don't generate duplicate
1800     error when an element type is put in the "minus" list
1801     and the element type is not allowed explicitly in the particular
1802     element content model.
1803     (html:a checker): New checker.
1804     (html:details, html:datagrid): New checkers.
1805     (html:legend): New checker.
1806    
1807     2007-05-13 Wakaba <wakaba@suika.fam.cx>
1808    
1809 wakaba 1.8 * ContentChecker.pm (html:li checker): Implemented.
1810    
1811     2007-05-13 Wakaba <wakaba@suika.fam.cx>
1812    
1813 wakaba 1.7 * ContentChecker.pm ($HTMLInlineOrStriclyInlineChecker): New
1814     checker.
1815     (html:dd checker): New checker.
1816     (html:q, html:em, html:strong, html:small,
1817     html:m, html:dfn, html:code, html:samp, html:span): New checkers.
1818    
1819     2007-05-13 Wakaba <wakaba@suika.fam.cx>
1820    
1821 wakaba 1.6 * ContentChecker.pm ($AnyChecker): Renamed
1822     from |$ElementDefault->{checker}|.
1823     ($ElementDefault->{checker}): Throw an error that
1824     the element type is not supported by the checker.
1825     ($HTMLMetadataElement): |html:base| was missing.
1826     ($HTMLEmptyChecker): Don't throw an error
1827     for inter-element whitespace nodes.
1828     (html:html checker): Errors were not
1829     thrown even if |html:head| and/or |html:body|
1830     children were missing.
1831     (html:head checker): An error was not
1832     thrown if <meta charset> appered after other
1833     elements.
1834    
1835 wakaba 1.5 2007-05-05 Wakaba <wakaba@suika.fam.cx>
1836    
1837     * ContentChecker.pm: |footer|, |video|, |audio|, |script|,
1838     and |noscript| elements are implemented.
1839     (new): New method.
1840    
1841 wakaba 1.3 2007-05-04 Wakaba <wakaba@suika.fam.cx>
1842    
1843 wakaba 1.4 * ContentChecker.pm: New module.
1844    
1845     2007-05-04 Wakaba <wakaba@suika.fam.cx>
1846    
1847 wakaba 1.3 * NanoDOM.pm (manakai_parent_element,
1848     document_element, manakai_local_name,
1849     manakai_element_type_match): New method.
1850    
1851     2007-05-03 Wakaba <wakaba@suika.fam.cx>
1852    
1853     * HTML.pm.src: Replace decimal and hexadecimal numeric
1854     entities in C1 range using Windows-1252 mapping. Bare LF
1855     did not count as new line for error reporting.
1856    
1857 wakaba 1.2 2007-05-02 Wakaba <wakaba@suika.fam.cx>
1858    
1859     * NanoDOM.pm (DOMImplementation): New class.
1860     (append_child): Weaken the |parent_node| reference.
1861     (create_element_ns, Element new): Set the |owner_document|
1862     reference.
1863     (implementation): New attribute.
1864     (owner_document, local_name, namespace_uri): New attributes.
1865    
1866     * HTML.pm.src (parse_string): Line and column numbers
1867     are now provided to error handler.
1868     (!!!parse-error): Short descriptions are added.
1869     (_construct_tree): Split into three methods; support
1870     for innerHTML mode.
1871     (set_inner_html): New method.
1872    
1873 wakaba 1.1 2007-05-01 Wakaba <wakaba@suika.fam.cx>
1874    
1875     * NanoDOM.html: Documentation is added.
1876    
1877     * HTML.pod, ContentType.html: Documentation is revised.
1878    
1879     * .cvsignore: Pod2html temporary files are added.
1880    
1881     * Makefile: Make |NanoDOM.html|.
1882    
1883     2007-05-01 Wakaba <wakaba@suika.fam.cx>
1884    
1885     * HTML.pm.src (parse_string): New method.
1886     (get_inner_html): Renamed from |inner_html|.
1887    
1888     * Makefile: A rule for |HTML.html| is added.
1889    
1890     * HTML.pod: New documentation.
1891    
1892     2007-05-01 Wakaba <wakaba@suika.fam.cx>
1893    
1894     * NanoDOM.pm (last_child, previous_sibling): New attributes.
1895     (clone_node): Attribute nodes were not completely copied.
1896    
1897     * HTML.pm.src: Many bugs are fixed.
1898    
1899     2007-04-30 Wakaba <wakaba@suika.fam.cx>
1900    
1901     * HTML.pm.src: Some typos are fixed.
1902    
1903     2007-04-30 Wakaba <wakaba@suika.fam.cx>
1904    
1905     * mkhtmlparser.pl, Makefile: References to the |HTML-consume-entity.src|
1906     are removed.
1907    
1908     * HTML.pm.src: Tokenizer's handling on named entities are rewritten.
1909    
1910     * HTML-consume-entity.src: Removed.
1911    
1912     2007-04-30 Wakaba <wakaba@suika.fam.cx>
1913    
1914     * HTML.pm.src: Tokenizer's handling on hexadecimal
1915     numeric entities are rewritten.
1916    
1917     2007-04-30 Wakaba <wakaba@suika.fam.cx>
1918    
1919     * HTML.pm.src: Some tokenizer bugs are fixed.
1920    
1921     2007-04-30 Wakaba <wakaba@suika.fam.cx>
1922    
1923     * HTML.pm.src: The tree construction stage is implemented.
1924    
1925     * mkhtmlparser.pl: New macros are added.
1926    
1927     2007-04-28 Wakaba <wakaba@suika.fam.cx>
1928    
1929     * ContentType.pm: A note on bug in the specification
1930     is removed since it's been now fixed.
1931    
1932     * .cvsignore: New file.
1933    
1934     2007-04-28 Wakaba <wakaba@suika.fam.cx>
1935    
1936     * HTML.pm.src, HTML-consume-entity.src: New files.
1937    
1938     * Makefile (HTML.pm): New rule.
1939    
1940     * mkhtmlparser.pl: New script.
1941    
1942     2007-04-25 Wakaba <wakaba@suika.fam.cx>
1943    
1944     * Makefile: New file.
1945    
1946     2007-04-24 Wakaba <wakaba@suika.fam.cx>
1947    
1948     * ContentType.pm: An error in pod is fixed.
1949    
1950     2007-04-24 Wakaba <wakaba@suika.fam.cx>
1951    
1952     * ContentType.pm: Documentation is added.
1953    
1954     2007-04-24 Wakaba <wakaba@suika.fam.cx>
1955    
1956     * ContentType.pm: New Perl module.
1957    
1958     * ChangeLog: New file.
1959    
1960    

admin@suikawiki.org
ViewVC Help
Powered by ViewVC 1.1.24